At Apple, the business problem is to bridge the gap between research advances and practical applications in generative AI and multimodal foundation models to deliver the next groundbreaking Apple products and experiences.
Requirements
- Experience in deep learning with demonstrated work in at least one area of multimodal systems (e.g. vision, language, video, etc.)
- Proficiency in Python and in a modern deep learning framework such as PyTorch or JAX
- Experience with rapid prototyping, reproduction, and validation of research ideas
- Deep expertise in multimodal foundation models, with a focus on practical applications
- Strong applied research experience in at least one major area of model development (data curation, pre-training, fine-tuning, alignment, or evaluation)
- Experience with large-scale training pipelines, including working with large datasets and scaling models across distributed systems
- Experience bridging research ideas with production constraints
Responsibilities
- Evaluating and adapting emerging research
- Conducting applied research experiments
- Working with engineering teams to transform promising approaches into robust solutions
- Taking into account future hardware design and product needs
- Engaging and collaborating with several teams across Apple to deliver the best products
- Crafting upcoming research directions in the field of multimodal foundation models
- Developing state of the art solutions for challenging problems
Other
- Ability to work in a collaborative environment
- Ability to communicate the results of analyses in a clear and effective manner
- BS and a minimum of 3 years relevant industry experience
- Master's or PhD, or equivalent practical experience, in Computer Science, Computer Vision, Machine Learning, or related technical field
- Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services