Amazon is investing in generative AI (GenAI) and the responsible development and deployment of large language models (LLMs) across all of its businesses. The Applied Scientist will play a critical role in driving innovation and advancing the state-of-the-art in natural language processing and machine learning to tackle challenging problems and elevate the customer experience.
Requirements
- Experience programming in Java, C++, Python or related language
- Experience with neural deep learning methods and machine learning
- Experience with modeling tools such as R, scikit-learn, Spark MLLib, MxNet, Tensorflow, numpy, scipy etc.
- Experience with large scale distributed systems such as Hadoop, Spark etc.
Responsibilities
- Ensure quality of speech/language/other data throughout all stages of acquisition and processing, including data sourcing/collection, ground truth generation, normalization, transformation, cross-lingual alignment/mapping, etc.
- Clean, analyze and select speech/language/other data to achieve goals
- Build and test models that elevate the customer experience
- Collaborate with colleagues from science, engineering and business backgrounds
- Present proposals and results in a clear manner backed by data and coupled with actionable conclusions
- Work with engineers to develop efficient data querying infrastructure for both offline and online use cases
Other
- 3+ years of building machine learning models for business application experience
- PhD, or Master's degree and 6+ years of applied research experience
- Collaborate with colleagues from science, engineering and business backgrounds
- Present proposals and results in a clear manner backed by data and coupled with actionable conclusions
- Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.