Drug Hunter (drughunter.com) is an essential web-based knowledge platform for drug discovery and development innovators turning molecules into medicines. The Scientific Content team at Drug Hunter distills the science and technology behind emerging drugs into concise searchable reports and resources with relevant transferable insights. Drug Hunter members include many leading biotechnology and pharmaceutical companies, as well as top-tier investment firms.
We are seeking a highly motivated Data Engineer to extend and develop our internal data platform, and contribute to novel and innovative Drug Hunter current and future products.
Job Summary:
As a Data Engineer, you will design and maintain data pipelines, integrate chemical, biological and clinical data sources, and optimize databases for analytics and machine learning applications. You will work closely with other data scientists, and software engineers to develop scalable solutions that enable low latency data integration in the fast-paced setting of drug discovery information.
Key Responsibilities:
-
Develop, maintain, and optimize ETL/ELT pipelines for chemical, biological, clinical, and other research data.
-
Build scalable and secure data architectures using cloud platforms
-
Integrate diverse data multimodal sources, including structured and unstructured data (documents, images, etc),
-
Design and manage relational (PostgreSQL) and NoSQL (MongoDB) databases.
-
Implement best practices for data governance, security, and licensing compliance. Be familiar with FAIR data concepts.
-
Automate data workflows and develop APIs for data access and integration.
-
Monitor and troubleshoot data pipeline performance, ensuring reliability and scalability.
Required Qualifications:
-
Bachelor's or Master's degree in Computer Science, Data Science, or a related field.
-
5+ years of experience in data engineering, preferably in life sciences or healthcare.
-
Proficiency in Python, SQL, and big data technologies.
-
Experience with cloud data services (e.g. AWS).
-
Strong understanding of data modeling, warehousing, and pipeline automation.
-
Familiarity with chemical/biological data sources and formats and scientific informatics tools.
-
Experience with containerization and orchestration (Docker, Kubernetes).
Preferred Qualifications:
-
Experience with graph databases and knowledge graphs for biomedical data.
-
Experience with ontologies and semantic data standards and tools.
-
Background in computational drug discovery/chemistry/biology or bioinformatics.
-
Experience in NER/NLP techniques, OCR (e.g. Tesseract).
Application Instructions:
Please include a list of publications, conference presentations, and GitHub repository link (if applicable) in your application submission.
Compensation:
Drug Hunter takes a market-based approach to pay. The candidate's starting pay will be determined based on job-related skills, experience, qualifications, interview performance, and work location.
Total Compensation includes the following:
-
Competitive salary, variable compensation, and equity
-
Broad range of medical, dental, vision, and life insurance plans for employees and their dependents
-
Supplemental insurance including disability, cancer, and critical illness
-
Paid parental leave and childcare FSA plan
-
401(k) + employer match
-
Home office set up stipend for remote employees
-
Learning and development support
-
Generous and flexible vacation
We are an equal opportunity employer, which means we don't discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We also consider for employment qualified applicants with arrest and conviction records, consistent with applicable federal, state and local law, including but not limited to the San Francisco Fair Chance Ordinance.
Please be aware that Drug Hunter will never request personal information, payment, or sensitive details outside of iSolved or via email. All official communications will come from an @drughunter.com email address or from an approved vendor alias.