Apple is looking to build and refine scalable, multilingual NLP solutions that empower users with intelligent text input in their language of choice, incorporating generative AI into their products.
Requirements
- Experience building and maintaining model pipelines end-to-end, from data curation to evaluation
- Ability to design and perform experiments that bring ML and NLP research ideas to production
- Familiarity with LLMs, such as SFT, RHLF, prompt engineering, data synthesis, automatic evaluation, and RAG
- Strong Python programming skills, with experience developing production-quality Python modules
- Solid background in machine learning, data science, natural language processing, or statistics
Responsibilities
- Development and maintenance of modeling pipelines that scale to multiple languages and production deployment
- Definition of robust automated evaluation metrics to facilitate hillclimbing model quality
- Failure analysis to understand shortcomings of our models
- Research into techniques for improving model behavior
- Curation and synthesis of representative training and evaluation data
- Implementation of experiments and simulations to assess the value of model changes
- Collaboration with language experts and QA to refine modeling approach in consideration of language-specific requirements
Other
- MS or PhD in Computer Science or related field with at least 2 years of industry experience
- Background in linguistics, fluency in multiple languages, or a passion for scaling NLP features for global audiences
- Excellent written and verbal communication skills
- History of developing Python packages and supporting other users
- Record of publications, innovations, and/or leadership