Amazon is looking to solve the problem of advancing the state-of-the-art in natural language processing and machine learning, specifically in the development and deployment of large language models (LLMs) across all of its businesses.
Requirements
- Experience programming in Java, C++, Python or related language
- Experience in any of the following areas: algorithms and data structures, parsing, numerical optimization, data mining, parallel and distributed computing, high-performance computing
- Experience using Unix/Linux
- Experience in professional software development
Responsibilities
- Ensure quality of speech/language/other data throughout all stages of acquisition and processing, including data sourcing/collection, ground truth generation, normalization, transformation, cross-lingual alignment/mapping, etc.
- Clean, analyze and select speech/language/other data to achieve goals
- Build and test models that elevate the customer experience
- Present proposals and results in a clear manner backed by data and coupled with actionable conclusions
- Work with engineers to develop efficient data querying infrastructure for both offline and online use cases
- Collaborate with colleagues from science, engineering and business backgrounds
Other
- 3+ years of building models for business application experience
- PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
- Ability to work with cross-functional teams
- Must be able to present proposals and results in a clear manner
- Must be able to work in a fast-paced environment