Apple is seeking a software/data ETL engineer to design, build, and implement robust and efficient ETL processes for large datasets.
Requirements
- High proficiency in Python
- Experience with data quality and validation techniques
- Proven experience working with large datasets
- Experience with Big Data formats and tooling such as Parquet, Iceberg
- Experience with media data formats (i.e. photos and video) and metadata formats such as CSV and JSON
- Expertise in at least one cloud platform (e.g., AWS, Azure, GCP) and relevant services
- Experience with Dagster and other data orchestration frameworks
Responsibilities
- Design, develop, and implement ETL processes using a suitable framework
- Collaborate with the data processing team, receiving technical instructions and specifications
- Provide documentation for ETL processes, including data lineage and transformation logic
- Troubleshoot and resolve data-related issues, utilizing strong analytical and problem-solving skills
- Monitor and maintain the performance of ETL pipelines, identifying and resolving bottlenecks
- Participate in code reviews and contribute to best practices within the team
- Contribute to the improvement of our data infrastructure and processes
Other
- Bachelor's degree in Computer Science, Engineering, or a related field
- 5+ years of experience in data engineering, with a strong focus on ETL/ELT processes
- Strong communication and collaboration skills
- Excellent problem-solving and analytical skills
- Ability to work effectively with limited contextual information and follow detailed technical specifications