Modernizing the world of Healthcare IT by building and maintaining the infrastructure that empowers analysts and data scientists to drive insights.
Requirements
- 7+ years experience in Data Engineering.
- Strong knowledge of SQL and relational databases.
- Deep understanding and prior experience with Spark.
- Deep understanding and prior experience with Spark/pySpark.
- Understanding of data warehouse table design, star schemas, etc.
- Previous experience architecting and building data pipelines from 1st party and 3rd party data sources.
- Experience using AWS cloud services (S3, Athena, EMR, Kinesis).
Responsibilities
- Develop and maintain data pipelines while achieving high reliability and efficiency.
- Help guide data infrastructure design and develop proof of concepts for recommended solutions.
- Liaison with engineering and product to help the Research team develop new data-products.
- Maintain documentation for the Data Warehouse and other data products.
- Design, develop, QA and maintain code related to data engineering.
- Provide thought-leadership and dependable execution on diverse project.
Other
- Masters in Computer Science or equivalent work experience.
- Strong problem-solving skills, adaptable, proactive, and willing to take ownership.
- Strong commitment to quality, architecture, and documentation.
- Experience with data pipeline technologies a plus (Airflow, Luigi).
- Experience with business intelligence tools a plus (Tableau, Qlik).
- Experience with Databricks.
- Hybrid role (3 days a week from the office).