Johnson & Johnson is seeking to improve healthcare outcomes worldwide by standardizing and connecting biomedical and clinical data, and is looking for a Knowledge Graph Engineer to join their Data Strategy and Products team to achieve this goal.
Requirements
- Programming background in parser combinators, natural language processing, and linked data (RDF Triple Stores and property graphs).
- Demonstrated experience in large-scale knowledge graphs construction, ontology development, pharmaceutical or healthcare domains integration.
- Proficiency in semantic web technologies (SPARQL, RDF, OWL), familiarity with graph databases (Neo4j, Amazon Neptune).
- Proven work with complex biomedical datasets, including genomics, proteomics, and high-throughput screening data.
- Experience in CI/CD implementations, git usage, CI/CD stacks (Jenkins, GitLab, Azure DevOps), DevOps tools, metrics/monitoring, and containerization technologies (Docker, Singularity).
- Strong skills in analysis, problem-solving, organizational change, project delivery, and managing external vendors.
- Familiarity with various data storage solutions (SQL, key-value, column, document, graph stores) and data modeling techniques (semantic data, ontologies, taxonomies).
Responsibilities
- Contribute to the design and implementation of a scalable knowledge graph infrastructure focused on data standardization and interoperability.
- Curate and extend ontologies for clear mapping into established biomedical ontologies and controlled terminologies using RDF standards.
- Apply graph-based data modeling for efficient organization, integration and retrieval to ensure system flexibility and long-term maintainability.
- Stand up SPARQL/GraphQL/REST services; develop ingestion and curation pipelines to ingest, normalize and map concepts across data sources.
- Extend and curate ontologies (e.g., diseases, drugs, targets, pathways, etc.) and maintain synonyms, cross-references, and provenance.
- Partner with cross-functional teams to enable NLP/RAG over graphs, features for predictive modeling and terminology services for search and study design tools.
- Work with IT and DevOps teams to deploy and manage the graph database infrastructure, focusing on high availability, scalability, and recovery operations.
Other
- Desired Ph.D. or master's degree in bioengineering, computer science, IT, bioinformatics, physics, mathematics, or related fields, emphasis on semantic technologies and biomedical application.
- At least 5 years professional experience in health informatics, or at least 7 years of professional experience or with additional consideration for candidates with graduate degrees or equivalent experience.
- Ability to multi-task, prioritize work, exhibit organizational skills and flexibility to deliver maximum business value.
- Capacity to translate discussions into user requirements and project plans.
- Willingness to travel less than 25% to conferences and internal meetings.