Deliver cloud infrastructure automation tools, frameworks, workflows, and validation platforms on public cloud platforms such as AWS, GCP, Azure, or Alibaba.
Requirements
- Deep knowledge of programming in Java, Golang, Python, or Ruby
- Experience owning and operating multiple instances of a critical service
- Experience with Agile development methodology and Test Driven Development
- Experience with critical infrastructure services including, monitoring, alerting, logging, and reporting applications
Responsibilities
- Deliver cloud infrastructure automation tools, frameworks, workflows, and validation platforms on our public cloud platforms such as AWS, GCP, Azure, or Alibaba
- Designing, developing, debugging, and operating resilient distributed systems that run across thousands of compute nodes in multiple data centers
- Using and contributing to open source technology (Spinnaker, Zookeeper, etc.)
- Developing Infrastructure-as-Code using Terraform
- Writing micro-services on containerization frameworks such as Kubernetes, Docker, Mesos
- Resolving complex technical issues and drive innovations that improve system availability, resilience, and performance
- Eat, sleep, and breathe services. You have experience balancing live-site management, feature delivery, and retirement of technical debt
Other
- A related technical degree required
- 3+ years backend software development experience
- Participate in the team’s on-call rotation to address complex problems in real-time and keep services operational and highly available