The Apple Data Platform (ADP) group is looking to build the data platform that enables the next generation of intelligent experiences on all Apple products and services by solving data problems at scale and improving ML experience for Apple customers.
Requirements
- Experience with containerization and orchestration technologies, such as Docker and Kubernetes
- Experience in delivering data and machine learning infrastructure in production environments
- Experience configuring, deploying and troubleshooting large scale production environments
- Experience in designing, building, and maintaining scalable, highly available systems that prioritize ease of use
- Experience with alerting, monitoring and remediation automation in a large scale distributed environment
- Extensive programming experience in Java, Python or Go
- Understanding of the ML lifecycle and state of the art ML Infrastructure technologies
Responsibilities
- Designing, implementing, and maintaining distributed systems to build world-class ML platforms/products at scale
- Diagnose, fix, improve, and automate complex issues across the entire stack to ensure maximum uptime and performance
- Design and extend services to improve functionality and reliability of the platform
- Monitor system performance, optimize for cost and efficiency, and resolve any issues that arise
- Build relationships with stakeholders across the organization to better understand internal customer needs and enhance our product better for end users
Other
- B.S., M.S., or Ph.D. in Computer Science, Computer Engineering, or equivalent practical experience
- Strong collaboration and communication (verbal and written) skills
- 5+ years of experience in distributed systems with deep knowledge in computer science fundamentals
- Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services
- Relocation might be eligible for discretionary bonuses or commission payments