Zocdoc is looking to improve the healthcare experience for patients and providers by building reliable, scalable, and cost-effective data systems.
Requirements
- Expertise in Python or Scala, and strong proficiency in SQL for data modeling and optimization
- Deep experience with data warehouse technologies like Snowflake, including clustering, performance tuning, query profiling, and access management
- Experience with data lake and lakehouse architectures such as Databricks, Delta Lake, Iceberg, or Apache Hudi, and query engines like Athena or Presto
- Proven ability to design and implement scalable ETL pipelines using technologies like dbt for transformation and Databricks for large-scale processing
- Familiarity with managing infrastructure-as-code, job orchestration (Dagster, Airflow), and CI/CD workflows
- Experience implementing row-level security and data masking for PHI/PII use cases
- Exposure to governance tools (e.g., Collibra, Amplitude, Looker admin, Unity Catalog)
Responsibilities
- Designing and maintaining scalable data pipelines for ingestion, transformation, and delivery across multiple data sources
- Collaborating with Analytics Engineers and Product teams to curate datasets and establish data contracts that improve transparency and reliability
- Developing and managing modern data architectures, such as lakehouses and medallion layers, using tools like Databricks, Delta Lake, or Iceberg
- Optimizing Snowflake usage and performance, ensuring data quality and cost efficiency
- Supporting and scaling orchestration platforms (like Dagster), metadata systems (like Unity Catalog or Collibra), and monitoring tools (like Datadog)
- Collaborating with data engineering, analytics engineering, and security teams to deliver stable and efficient infrastructure for diverse workflows
- Building tools, alerting systems, and documentation that ensure reliable operation and developer self-service across our data stack
Other
- 5+ years of experience in data engineering or platform/infrastructure roles, with a focus on scaling tools and systems
- Autonomous, proactive, and eager to solve complex problems with practical, scalable solutions
- Excellent collaboration and communication skills to support cross-functional teams and data consumers
- Unlimited PTO
- 100% paid employee health benefit options
- Employer funded 401(k) match
- Corporate wellness programs with Headspace and Peloton
- Parental leave
- Cell Phone reimbursement
- Commuter Benefits
- Catered lunch everyday along with snacks (when back in office)
- Convenient Soho location