Moveworks is looking to build cutting edge ML infrastructure for building and serving large language models.
Requirements
- Experience with deep learning framework such as Pytorch or Huggingface or LLM serving frameworks such as vLLM or TensorRT-LLM.
- Experience with building and scaling end-to-end machine learning systems
- Experience building scalable micro services and ETL pipelines
- Expertise in Python and experience with performant language such as C++ or GoLang
- Experience with ML Inference optimization using TensorRT.
- Experience with distributed training frameworks such as Deepspeed.
- Experience in managing and scaling GPU Inference services via Kubernetes
Responsibilities
- Design, build and optimize scalable machine learning infrastructure to support training, evaluation, and deployment of large language models.
- Build abstractions to automate various steps in different ML workflows
- Collaborate with cross functional teams of engineers, data analytics, machine learning experts, and product to build new features
- Leverage experience to drive best practices in ML and data engineering
Other
- 2+ years of industry experience in Machine Learning, Infrastructure or related fields
- Bachelor's in Computer Science, Computer Engineering, Mathematics, or equivalent field.
- A love of research publications in the machine learning and software engineering communities
- Effective communicator with experience collaborating cross-functionally with other teams