Apple is seeking to advance its Apple Intelligence products, particularly Siri, by leveraging machine learning and generative AI to create a highly personalized and effective user experience while maintaining privacy standards. The goal is to evaluate and enhance AI/ML models that will power the next generation of Apple products.
Requirements
- 7+ years of professional work experience applying machine learning to real-world problems and crafting scalable and effective data solutions.
- Experience with managing datasets for ML training and/or evaluation.
- MS/PhD in Machine Learning, Computer Science, or equivalent experience in a related field.
- Excellent programming skills in Python.
- Good Conversational AI domain knowledge.
- Expertise in defining and measuring evaluation coverage for large language models and agents.
- Experience with systems engineering; in-depth understanding of interdependencies of ML and SW components.
Responsibilities
- Deliver offline evaluation insights that drive model development and improvements with wins for the end-user experience.
- Collaborate closely with cross-functional teams to lead the creation and evolution of high-quality datasets for evaluation of state-of-the-art models.
- Leverage large language models (LLMs) to automatically evaluate the impact of model changes on end-user experience.
- Assess the quality and naturalness of conversations with a digital assistant.
- Harness the power of generative AI to create adversarial scenarios that anticipate future user behaviors and edge cases.
- Develop representative datasets for delivering a highly personalized user experience.
- Shape the future of Apple products by ensuring that model improvements translate into tangible benefits for users.
Other
- MS/PhD in Machine Learning, Computer Science, or equivalent experience in a related field.
- Excellent problem solving, critical thinking, and communication skills.
- Enthusiasm and ability for continuing to learn new technologies.
- Experience delivering large-scale, cross-functional product or platform outcomes.
- Ability to collaborate with cross-functional teams.