Apple is looking to enhance the intelligence of Siri and its products by building groundbreaking technology for search, natural language processing, artificial intelligence, and machine learning. The Information Intelligence Infrastructure team needs to develop and maintain a robust infrastructure that powers large foundation models and various Apple services, ensuring low latency and efficient compute utilization.
Requirements
- Strong background in computer science: algorithms, data structures and system design
- 10+ year experience on large scale distributed system design, operation and optimization
- Familiar with one of the popular ML Frameworks like Pytorch, Tensorflow
- Proficient in building and maintaining systems written in modern languages (e.g. Golang, Python)
- Familiar with fundamental Deep Learning architectures such as Transformers, Encoder/Decoder models.
- Familiarity with Nvidia TensorRT-LLM, vLLLM, Nvidia Triton Server etc.
Responsibilities
- Design, build and maintain infrastructure to support features that empower billions of Apple users.
- Processes billions of requests every day across our search and foundation model platform.
- Take full end-to-end ownership of our services, driving them through every stage meticulously, encompassing conception, design, implementation, deployment, and maintenance.
- Work on incredibly complex large scale systems with trillions of records and petabytes of data.
- Work along side Foundation Model Research team to optimize inference for cutting edge model architectures.
- Work closely with product teams to build production grade solutions for millions of customers in real time.
- Optimize billions of parameter language and vision and speech models using state of the art technologies and make it run at scale of Apple.
Other
- Excellent interpersonal skills able to work independently as well as cross-functionally