Drive the design and operation of Gradient AI platform at DigitalOcean, focusing on delivering a simple and innovative agent development experience with best of breed scale, performance, and predictability.
Requirements
- Hands-on experience designing and operating production-grade AI/ML platforms using the latest GenAI and Agent-development technologies.
- 10+ years in designing and building applications on the cloud with 5+ years experience in AI/ML platforms
- Expertise in driving operational excellence via automation and best practices.
- Strong written and verbal communication skills with a track record of mentoring senior and junior engineers; translating complex concepts across engineering and business teams.
- Experience with cloud-based technologies
- Experience with AI/ML platforms
- Experience with automation and best practices
Responsibilities
- Design and evolve the architecture for our agent development experience including code integration, evaluations, observability, tools, and cross-agent interactions.
- Drive initiatives to deliver an architecture optimized for scalability, reliability, low-latency, and cost efficiency.
- Manage and evolve our benchmarking system to continuously raise the bar on our experience.
- Roll out new services by taking on a hands-on lead role as required to ensure timely delivery.
- Establish and enforce technical standards, coding practices, tooling, and infrastructure guidelines across the AI/ML engineering teams.
- Lead Operations Excellence for our Agent development platform, establish mechanisms and processes that scale to the engineering organization while raising the bar.
- Oversee availability, performance tuning, failover strategies, capacity planning, and disaster recovery.
Other
- Prior experience as a technical visionary in large-scale, mission-critical projects; ability to align technology strategy with business impact.
- Strong written and verbal communication skills with a track record of mentoring senior and junior engineers; translating complex concepts across engineering and business teams.
- Ability to work with product managers, stakeholders, and business leaders to translate strategic objectives into scalable technical roadmaps.
- Ability to guide customer-facing teams (e.g., consultants, support, sales engineers) to shape AI modernization initiatives via agents.
- Bachelor's degree or higher in a relevant field