Tendo's strategic data engineering solutions by ingesting, transforming, and warehousing healthcare-related data from various sources to support advanced analytics and AI/ML model development.
Requirements
- 7+ years of experience in data engineering
- Extensive experience in the design, build, and maintenance of data ETL pipelines
- Extensive knowledge of coding in Python or Scala with a focus on data processing
- Experience using Apache Spark (PySpark or Scala)
- Experience with AWS technology stack (S3, Glue, Athena, EMR, etc.)
- Experience with data and entity relationship modeling to support data warehouses and analytics solutions
- Deep understanding of relational and non-relational databases (SQL/NOSQL)
Responsibilities
- ingesting, transforming, and warehousing healthcare-related data from various sources
- produce quality data flows and transformations that support advanced analytics and AI/ML model development
- develop tools and solutions to facilitate data integration, data warehousing, and data modeling
- enable Data Engineers and Data Scientists to experiment and train machine learning models to produce useful insights for Tendo’s customers
- Collaborate with Data Scientists and Business Intelligence Analysts to ensure efficient and effective data processing and analysis
- Optimize data infrastructure and processes to ensure optimal performance and scalability
- Develop and maintain data documentation and data lineage
Other
- Candidates may be located in any one of our hub locations.
- Stay current with emerging technologies and industry trends related to data engineering.
- Experience working in a professional software environment using source control (git), an issue tracker (JIRA, Confluence, etc.), continuous integration, code reviews, and agile development process (Scrum/Lean).
- Basic data privacy and security principles.
- Interest and/or experience in AI/ML applications, including support for model development or deployment workflows.