Anthropic is looking to build reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society. The Research Engineer role aims to contribute to building these large-scale ML systems from the ground up, focusing on safety, steerability, and trustworthiness.
Requirements
- High performance, large-scale ML systems
- GPUs, Kubernetes, Pytorch, or OS internals
- Language modeling with transformers
- Reinforcement learning
- Large-scale ETL
Responsibilities
- making the cluster more reliable for our big jobs
- improving throughput and efficiency
- running and designing scientific experiments
- improving our dev tooling
- Optimizing the throughput of a new attention mechanism
- Comparing the compute efficiency of two Transformer variants
- Scaling a distributed training job to thousands of GPUs
Other
- Have significant software engineering experience
- Are results-oriented, with a bias towards flexibility and impact
- Pick up slack, even if it goes outside your job description
- Enjoy pair programming (we love to pair!)
- Want to learn more about machine learning research
- Care about the societal impacts of your work
- We require at least a Bachelor's degree in a related field or equivalent experience.
- Currently, we expect all staff to be in one of our offices at least 25% of the time.
- We do sponsor visas!
- We think AI systems like the ones we're building have enormous social and ethical implications.
- We greatly value communication skills.