The UXE Platform team needs to provide core platforms, golden paths, operational integrity, and AI acceleration for exceptional digital experiences. This role will focus on discovering and executing AI tooling capabilities to guide tactical decisions and drive strategic initiatives.
Requirements
- Deep understanding and extensive hands-on experience with major cloud platforms, specifically AWS and Openshift, including operational experience from both tactical and strategic viewpoints
- Strong proficiency in multiple programming languages. Experience with Python, NodeJS, and/or PHP is a significant plus.
- Extensive full-stack web application development experience and a clear understanding of the request stack, including caching headers, HTTP verbs, and common errors.
- Strong knowledge of monitoring and alerting tools (e.g., Prometheus, Grafana, Datadog) and experience with configuration management tools (e.g., Ansible, Puppet, Chef)
- Exceptional problem-solving, troubleshooting, and analytical skills, with a proven ability to navigate and resolve complex technical challenges in production environments.
- Passion for designing, building, and operating highly reliable, scalable, and secure distributed systems
- Demonstrate proficiency in utilizing LLMs (e.g., Google Gemini), as relevant, for tasks such as brainstorming solutions, deep research, summarizing technical documentation, drafting communications, summarizing complex technical information, and enhancing problem-solving efficiency across the development lifecycle
Responsibilities
- Provide expert technical leadership and advisement to the Director and broader leadership, offering critical insights to guide tactical decisions and ensure technical alignment with overall UXE Platform engineering objectives through the strategic integration and development of AI-powered solutions
- Lead and conduct in-depth technical discovery and analysis for complex platform initiatives, including AI enablement efforts (e.g., architectural discovery for services like the Customer Portal or Data Lake).
- Translate high-level strategic goals into concrete technical requirements, propose robust solutions, and effectively delegate implementation tasks to team members
- Drive initiatives to improve operational efficiency, system reliability, and performance by designing and implementing AI-powered solutions and automation within our core platform software
- Contribute to the strategic enhancement of our core platforms with AI capabilities, supporting the integration and operationalization of AI/ML models, tools, and frameworks to accelerate AI adoption within UXE.
- Lead incident response efforts, conduct thorough root cause analysis, and identify opportunities to automate manual tasks and processes, thereby improving system stability, reducing errors, and minimizing operational toil
- Demonstrate proficiency in utilizing LLMs (e.g., Google Gemini), as relevant, for tasks such as brainstorming solutions, deep research, summarizing technical documentation, drafting communications, summarizing complex technical information, and enhancing problem-solving efficiency across the development lifecycle
Other
- 13+ years of progressive experience in software engineering, with a strong focus on platform engineering, DevOps, Site Reliability Engineering (SRE), or related infrastructure-focused roles
- Proven experience in a technical leadership role, demonstrating the ability to guide teams, provide technical direction, and influence architectural decisions
- Act as a lead mentor and coach for software engineers, fostering a culture of continuous learning, professional development, and the adoption of cutting-edge best practices within the team and across the organization.
- Outstanding written and verbal communication skills to articulate complex technical and architectural concepts to diverse audiences, from engineers to leadership, and foster effective collaboration.
- Explore and experiment with emerging AI technologies relevant to software development, proactively identifying opportunities to incorporate new AI capabilities into existing workflows and tooling