At Apple, the business problem is to bridge the gap between research advances and practical applications in generative AI and multimodal foundation models to deliver the next groundbreaking Apple products & experiences.
Requirements
- Experience in deep learning with demonstrated work in at least one area of multimodal systems (e.g. vision, language, video, etc.)
- Proficiency in Python and in a modern deep learning framework such as PyTorch or JAX
- Experience with rapid prototyping, reproduction, and validation of research ideas
- Deep expertise in multimodal foundation models, with a focus on practical applications
- Strong applied research experience in at least one major area of model development (data curation, pre-training, fine-tuning, alignment, or evaluation)
- Experience with large-scale training pipelines, including working with large datasets and scaling models across distributed systems
- Experience bridging research ideas with production constraints
Responsibilities
- Evaluating and adapting emerging research
- Conducting applied research experiments
- Working with engineering teams to transform promising approaches into robust solutions
- Taking into account future hardware design and product needs
- Engaging and collaborating with several teams across Apple to deliver the best products
- Crafting upcoming research directions in the field of multimodal foundation models
- Developing state of the art solutions for challenging problems
Other
- Ability to work in a collaborative environment
- Ability to communicate the results of analyses in a clear and effective manner
- BS and a minimum of 3 years relevant industry experience
- Master's or PhD, or equivalent practical experience, in Computer Science, Computer Vision, Machine Learning, or related technical field
- Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services