Oura is looking for a Senior Data Engineer to join their Data & ML Platform team to develop, maintain, and support Oura's Datalake and adjacent systems, collaborating with data scientists, analysts, and other departments that utilize the Datalake.
Requirements
- 5+ years of experience developing and operating production systems.
- Experience running, monitoring, and debugging production systems at scale on a public cloud. We rely on AWS but experience with other cloud platforms counts too.
- Good architectural understanding of event-driven architectures, workflow engines, database, and data warehouse systems.
- Ability to design robust data models and pipelines, with attention to data accuracy and integrity.
- Experience in building infrastructure and components following best practices such as CI/CD and infrastructure as code.
- Demonstrated skills in writing well-structured, testable, and high-performance code.
- Familiarity with some of the following: Workflow engines, Stream processing, MLops, Spark, Athena, SQL, dbt, Databricks.
Responsibilities
- Build, operate, and improve Oura’s Datalake consisting of systems to move, process, and store large amounts of data (Terabyte–Petabyte scale) leveraging tools such as AWS Kinesis, S3, Spark / Glue, Athena, dbt, Iceberg, Snowflake, Docker, workflow engines, and more.
- Design and implement data models and pipelines with a strong focus on data quality, reliability, and governance.
- Ensure that our data platform and processes meet privacy, compliance, and security requirements.
- Build components that support the handling of datasets, training, testing, and release of new on-device and cloud-based ML models.
- Independently collaborate with different stakeholders including Data Scientists, Testing, Firmware, and Hardware to define and implement improvements and new functionality and support internal Datalake consumers in their day-to-day work.
- Write code, mostly in Python and SQL with occasional additions Typescript, Java or some other language.
Other
- This is a remote U.S. role with a strong preference for candidates based in the East Coast.
- You are self-motivated, proactive, and able to lead and manage your own work effectively.
- A pragmatic and goal oriented approach to development.
- Real-life experience with Databricks.
- Familiarity and experience with the Data Mesh approach