Red Hat OpenShift Service on AWS (ROSA) is a fully-managed, enterprise-grade Kubernetes service that combines the power of Red Hat OpenShift with the flexibility and scale of the AWS public cloud. The company is looking to leverage AI across development, operations, and testing workflows to simplify processes, reduce complexity, and enhance efficiency.
Requirements
- Hands-on experience with container technologies, including Kubernetes and OpenShift
- Experience with multiple hyperscaler platforms, such as AWS, GCP, and Azure
- Deep technical expertise with the ability to navigate from high-level system and software architecture to detailed design, code review, and problem-solving
- Comprehensive understanding of software development life cycle, project management, quality assurance, and customer advocacy in large-scale environments
- Experience leading Site Reliability Engineering (SRE) initiatives, including building reliable, scalable systems, monitoring, and incident response
- Experience applying AI or ML techniques in software development, testing, or operational workflows (e.g., predictive monitoring, intelligent automation, AI-assisted development tools).
Responsibilities
- Lead a global engineering team to design, develop, operate, and deliver the ROSA service and associated features/outcomes
- Drive technical discussions, architecture design, cross-team engineering collaboration, and engagement with customers and partners
- Manage the day-to-day activities of the team, coordinate with other contributing teams, and own the delivery of features, updates, and operational excellence.
- Collaborate with team leads, architects, and engineers on product design, architecture, and technical direction
- Work closely with cross-functional teams—including Product Management, Documentation, and Support—to ensure a high-quality service experience for customers.
- Partner with Red Hat’s global customer and partner support teams to resolve escalated issues efficiently.
- Champion the adoption of AI within the team to improve development, testing, and operational workflows
Other
- 4+ years managing software engineering teams, including development, testing, DevOps, and productization of cloud services using Agile methodologies
- Demonstrated ability to translate business problems into technical solutions and lead teams through ambiguity and change
- Strong organizational skills, including planning and accelerating initiatives, proactive risk mitigation, and leading global engineering teams
- Coach and mentor team members, providing regular feedback and supporting career development and growth.
- Ensure ethical AI use, addressing data privacy, bias mitigation, intellectual property, and responsible disclosure