Riot Games is looking to build a new ML Platform team from the ground up to support high-impact ML use cases across their games and internal products. The goal is to create a robust, cost-efficient, and extensible platform for global ML inference at scale, model deployment, observability, live testing, and model versioning.
Requirements
- Working knowledge of production ML systems, including model inference, CI/CD for ML artifacts, observability, and cost optimization
- Experience with cloud-native orchestration architectures (e.g., Kubernetes, GPU scheduling, container orchestration)
- Experience founding or scaling global infrastructure/platform teams from the ground up
- Demonstrated ability to lead delivery of technical products used by other developers, engineers, or researchers
- Familiarity with MLOps and model serving tools (e.g., MLflow, KServe, BentoML, TorchServe, Seldon Core, DVC, LakeFS, etc)
- Exposure to A/B testing infrastructure, especially in online or latency-sensitive environments
- Prior experience with budgeting, CPU & GPU utilization tracking, and platform efficiency work
Responsibilities
- Build and lead a new ML Platform team from the ground up—recruiting, mentoring, and growing both ICs and future leads.
- Drive the team’s roadmap and execution—balancing foundational investments with fast, iterative delivery of user-visible platform capabilities.
- Own the delivery of the ML Platform’s early feature set: scalable inference serving, model artifact CI/CD, versioning, testing environments (A/B, shadow), and observability.
- Coordinate cross-team dependencies and build strong partnerships with platform engineering, SRE, security, and product stakeholders.
- Ensure the platform meets critical non-functional goals such as cost-efficiency, operational reliability, and regional availability.
- Represent the ML Platform team in broader AI Foundations and Riot-wide planning forums—connecting strategy to execution and ensuring alignment with Riot’s broader technical ecosystem.
- Collaborate technical leadership (including Riot, AI Foundations, and the ML Platform P5 Principal Engineer) to ensure architectural decisions are grounded in Riot’s long-term needs.
Other
- Set the cultural and operational foundations for a sustainable, inclusive, and high-performing engineering team.
- Align closely with data scientists, ML engineers, and game product teams to deeply understand workflows, pain points, and infrastructure needs.
- Champion developer experience, designing for usability and self-service adoption from day one.
- Strong execution skills—ability to translate long-term vision into well-scoped milestones, track progress, and unblock teams
- Comfortable making trade-offs between velocity, cost, and maintainability while managing stakeholder expectations