Choice Hotels is looking to solve the problem of maintaining system reliability and performance across their cloud platforms, while working closely with software engineering and infrastructure teams to ensure seamless operations and rapid incident resolution.
Requirements
- Strong proficiency in Python and Java for automation and scripting.
- In-depth experience with AWS services including EC2, S3, Lambda, CloudWatch, and others.
- Expertise in CI/CD tools such as Jenkins, GitLab CI, and Bitbucket Pipelines.
- Proven experience building monitoring dashboards and custom metrics for proactive observability.
- Strong problem-solving and troubleshooting skills across distributed systems and cloud infrastructure.
- Hands-on experience with Terraform or AWS CloudFormation for infrastructure automation.
- Familiarity with Docker, Kubernetes, or other containerization technologies.
Responsibilities
- Design, implement, and manage scalable, reliable, and secure infrastructure solutions on AWS.
- Develop and maintain Lambda functions, manage S3 data storage, and ensure seamless cloud service integration.
- Support high availability and disaster recovery initiatives across production environments.
- Build and maintain CI/CD pipelines using tools such as Jenkins, Bitbucket Pipelines, and GitLab CI.
- Automate deployment, monitoring, and infrastructure provisioning using Python and Java.
- Implement Infrastructure as Code using Terraform or CloudFormation for consistency and repeatability.
- Develop and maintain dashboards and alerts using Datadog, CloudWatch, Grafana, and Prometheus.
Other
- Bachelor’s degree in Computer Science, Information Technology, or related field.
- 2-4 years of hands-on experience in Site Reliability Engineering, DevOps, or a related technical role.
- Excellent communication skills and a collaborative, team-oriented mindset.
- A team player mindset with a proactive approach to problem-solving.
- Must be eligible to work in the US without sponsorship