Improve stability and performance for private cloud compute and implement new functionality for generative AI as part of the Private Cloud Compute component of Apple Intelligence
Requirements
- Experience programming in: Swift, C, C++, iOS/macOS or XCode
- Practical experience running machine learning models and evaluating them for quality and performance metrics
- Familiar with Apple ML stack (ANE, CoreML, MPS/Metal), high-level general distributed ML stack (PyTorch-distributed, NCCL) and high throughput inter-chip communication systems
- Familiar with on-device iOS development
Responsibilities
- Engineer continuous improvements in stability and performance for private cloud compute
- Implement entirely new functionality as it emerges from the research community
- Write performant and scalable frameworks to distribute and coordinate ML inference tasks to different hardware acceleration IP blocks on different SoCs
- Integrate inference code into a full service stack to ensure that user traffic is served reliably and performantly
- Develop code that is easy and safe to develop, update and monitor
Other
- Quality focus - produce reliable, maintainable, deliverable software
- Comfortable diving deep - working across multiple levels of abstraction
- Good at handling relationships & communication - collaborate well with colleagues across a wide range of functions
- B.S. in Computer Science or other numerate subject, with 6+ years experience; or - M.S. in Computer Science or other numerate subject, with 4+ years experience