Ensure the consistent operation and reliability of the IKS AIR Platform across Intuit.
Requirements
- Expert knowledge of Kubernetes architecture, networking, troubleshooting tools and automation
- Proven expertise in building, upgrading, and managing Kubernetes clusters
- Experience with the development of applications and services in AWS, leveraging native cloud services and APIs
- Hands-on experience working with Kubernetes, Docker, Springboot, React, and DynamoDB
- Experience in using java unit testing frameworks like JUnit and Mockito
- Knowledge of monitoring and logging tools such as Splunk, Wavefront, AWS CloudTrails, and Micrometer
- Experience working in Java and Linux environments
Responsibilities
- Build, operate, and scale IKS AIR services running on AWS
- Contribute to critical platform components FMEA and Chaos Engineering
- Develop observability components with massive scale for platforms
- Build automation for operational insights and analytics
- Diagnose and troubleshoot build, deployment and infrastructure issues
- Debugging and automating day-to-day operational tasks
- Provide application operational support from inception to retirement
Other
- A Bachelor's degree in Computer Science or a related technical field with a focus on AI, Software Development, and Operations
- Excellent communication skills and the ability to collaborate effectively
- Participate in IKS AIR support rotations along with the IKS Production Engineering Team
- Drive and own Root Cause Analysis (RCA) for specific applications