OpenAI needs to design and deploy services and infrastructure that empower their models and agents to generate insights and execute tasks, grounded in authorized data sources and internal systems, to unlock higher model accuracy and task performance at unprecedented pace of growth and scale with high performance and reliability.
Requirements
- Have expertise in deploying, configuring, and operating software systems within customer-managed infrastructure and networks.
Responsibilities
- Design, build, and maintain infrastructure and networking systems that connect structured and unstructured data to LLMs in a performant and reliable manner.
- Design, build, and maintain secure and compliant systems to process sensitive and valuable data.
- Ensure our platform can support use cases at the scale of large enterprise customers, while remaining reliable and efficient.
- Integrate data access and authentication with our production model and agent systems.
- Collaborate with product, research and forward-deployed engineering teams to fill in the full picture on how to deploy production systems and achieve world-class results on overall task capabilities.
- Own the reliability of the systems you build, including participation in an on-call rotation for critical incidents
Other
- Own the full lifecycle of your work—from architecture and implementation to production operations and on-call responsibilities.
- Success in this role requires a solid understanding of enterprise data management fundamentals, including security, identity and access, performance, and reliability.
- Are excited to bring the power of AGI to the infrastructure the world runs on today
- Are comfortable with ambiguity and rapid change
- Have an intrinsic desire to learn and fill in missing skills, and an equally strong talent for sharing learnings clearly and concisely with others