Roblox is looking to solve the problem of creating safer, more civil shared experiences for its global community of developers and creators by building advanced generative AI safety systems powered by high-quality, multimodal data.
Requirements
- 5+ years of experience building production-grade, scalable, and reliable data systems for AI/ML applications.
- Strong expertise in developing large-scale data pipelines (batch and streaming) with technologies such as Spark, Ray, Kubeflow, or similar.
- Hands-on experience working with data systems at petabyte scale or beyond.
- Approach data as a product—prioritizing quality, discoverability, and reusability.
Responsibilities
- Design, build, and own core components of the data infrastructure and tools to clean, transform, and curate multimodal datasets for AI model training and evaluation.
- Partner with ML Engineers, Data Scientists, and Ops teams to understand evolving data needs, improve workflows, and surface actionable insights that drive improvement in model performance and policy.
- Develop frameworks to capture, organize, and surface critical data, enabling model teams to rapidly identify performance gaps and retrain on high-value examples.
- Amplify the impact of other Safety teams by identifying data bottlenecks and building scalable, generalizable solutions.
- Design high-throughput data storage and retrieval systems that facilitate efficient training and evaluation of large-scale generative models.
- Implement automated systems for data quality, traceability, and governance to ensure accuracy and auditability throughout the ML lifecycle.
Other
- Bachelor’s degree or higher in Computer Science or a related technical field.
- Excellent cross-functional collaboration skills; enjoy building infrastructure and tools that enable and accelerate team productivity.
- Roles that are based in an office are onsite Tuesday, Wednesday, and Thursday, with optional presence on Monday and Friday (unless otherwise noted).
- Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.