Healthcare quality is declining and soaring costs are crushing American families and businesses.
Requirements
- Mastery of Python, as well as SQL for data extraction, transformation, and analysis
- 3+ years of experience working in the cloud (AWS preferred)
- 3+ years of experience building ETL/ELT pipelines with tools like Apache Spark, Apache Beam, AWS Glue, etc.
- 2+ years managing pipeline workflows with tools such as Airflow, Argo, Prefect, etc.
- 2+ years of experience working with Apache Iceberg, AWS Lake Formation, Apache Hudi or equivalent
- 2+ years of experience leveraging analytical solutions such as Snowflake, Redshift, BigQuery, or Trino for large-scale data processing and analytics
- Preferred experience with Kubernetes
Responsibilities
- Build and maintain high quality data systems that calculate Garner’s provider rankings
- Implement algorithms to improve the accuracy of our provider rankings data while lowering total associated costs and resource utilization
- Partner with engineers across Garner to define and implement coding standards, common libraries, and shared tooling
- Partner with Data Science, Researchers, and Product to convert ideas from research into reliable, performant production systems
- Drive the technical strategy in the team to ensure we are building maintainable, scalable, resilient software
- Help the team grow by mentoring junior members and raising the engineering bar
Other
- 10+ years of experience, with the 3+ most recent years specifically in Data Engineering or distributed Software Engineering
- Must have a Computer Science degree or equivalent experience demonstrating CS fundamentals
- A willingness to “roll up your sleeves” and do whatever is necessary to ensure company success
- Experience working in a rapidly evolving startup environment
- A desire to be a part of our mission to improve the U.S. healthcare system
- Must be willing to work in the office 3 days per week on Tuesday, Wednesday and Thursday