Google Deepmind Robotics is looking to solve AGI in the physical world and is facing a major bottleneck of high quality data, which this role aims to tackle
Requirements
- C++ and Python programming experience
- Experience with production data systems and quality projects
- Experience in data analysis, debugging, experiment design and evaluation
- Experience with production ML-oriented systems or applications
- Rich experience working with very large datasets using Google data infrastructure and tooling ecosystems like Flume and PLX
- Industry experiences in robotics or automation
Responsibilities
- Design and implement large scale data acquisition, processing and curation pipelines, assuming end-to-end ownership over the full lifecycle of high quality datasets for training advanced robotics foundation models
- Systematically improve data quality by performing sophisticated data analysis, debugging and experiments, developing metrics, tests and monitoring mechanisms, which directly contribute to ML model improvements
- Develop, improve large scale data pipeline infrastructure and tooling. Improve system reliability, usability and scalability, data safety and security over the full data lifecycle
Other
- Bachelor’s degree or equivalent practical experience
- 2+ years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree
- Strong problem-solving skills
- Product mindset, willingness to learn, work with research and product colleagues, focus on delivering value in real-world applications
- Ability to pass a background check