Mass General Brigham needs to modernize the data warehouse of the Nurses’ Health Studies, a legacy system that is affecting productivity and data-sharing ability due to its age. The growing scale and scope of NHS data require advanced methodologies for storage, access, sharing, and analysis.
Requirements
- Indepth knowledge of GraphQL and/or other graph query languages.
- Experience with Gen3 query development and data submission workflows
- Experience with Kubernetes Management. Extensive knowledge on postgresSQL.
- Postgres and Kubernetes are mandatory.
- Extensive UNIX scripting for Kubernetes/Files
- Proficient in Python and/or Java, including the use of APIs or web services
- Experience with Pandas and Large scale data movements. Experience with data modeling and architecture.
Responsibilities
- Design and implement scalable data pipelines and storage solutions to accommodate the growth of NHS
- Optimize data access and retrieval methods for maximum efficiency
- Develop and oversee an integration plan for new processes and tools
- Meet regularly with stakeholders to assess data requirements and solution validation
- Apply both strategic and hands-on efforts when delivering data and analytics solution action plans to stakeholders
- Establish data governance practices to ensure data quality, security, and compliance
- Understand and maintain health data compliance requirements
Other
- Master’s or PhD in Computer Science, Data Science, Statistics, or related field
- 3+ years data engineering experience
- Promote a collegial, team-oriented work style
- Work with IT and Project Management personnel to establish strategy, deadlines and resource needs
- Foresee obstacles, identify workarounds, leverage resources, rally teammates.