OpenAI's Sora team is focused on integrating multimodal functionalities into their AI products, and this role is crucial for designing and scaling the infrastructure that powers large-scale multimodal training and evaluation at OpenAI.
Requirements
- Have strong experience with distributed systems and large-scale infrastructure with a strong interest in data.
- Are detail-oriented and bring rigor to building and maintaining reliable systems.
- Demonstrate excellent software engineering fundamentals and organizational skills.
Responsibilities
- Design, build, and maintain data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure while ensuring scalability, reliability, and security.
- Ensure our data platform can scale by orders of magnitude while remaining reliable and efficient
- Partner with researchers to deeply understand requirements and translate them into production-ready systems.
- Harden, optimize, and maintain critical data infrastructure systems that power multimodal training and evaluation.
Other
- This role is based in San Francisco, CA.
- We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.
- Are comfortable with ambiguity and rapid change