The Global Technology team at a major investment firm is looking to build and enhance AI/ML infrastructure and application patterns on AWS to power mission-critical applications, ensuring high availability, durability, and resiliency.
Requirements
- Ability to debug, optimize code, and automate routine tasks.
- Experience with infrastructure automation tools such as Puppet, Ansible, CloudFormation, or Terraform.
- Working knowledge of pipeline-automation tools such as Jenkins, CodePipeline, Azure DevOps, or other comparable tools.
- Experience using Git for source control management.
- Ability to proficiently write code in Python, Node.js, Bash (shell), PowerShell, or other similar languages.
- Experience using Docker within container orchestration platforms such as AWS ECS, EKS, Google Anthos, or others.
- Understanding of foundational AWS services such as VPCs, EC2, S3, RDS, Auto Scaling Groups, CloudWatch Logs, etc.
Responsibilities
- Focus on optimizing existing systems, building infrastructure, and eliminating work through automation.
- Influence application and security architecture and design across multi and hybrid cloud platforms.
- Peer-reviewing infrastructure-as-code (AWS CloudFormation, Python, Terraform, or similar).
- Partnering with application and infrastructure teams to develop reusable cloud patterns.
- Deployment and troubleshooting of infrastructure code.
- Partner with the Site Reliability Engineering (SRE) team to conduct post-incident reviews and root cause analysis and building monitoring and automation to prevent future incidents.
- Identify opportunities to build self-service capabilities and automate infrastructure and application deployments.
Other
- 5+ years experience in Amazon Web Services (AWS).
- Experience in working in an Agile/Scrum-focused organization.
- Strong verbal and written communication skills; comfortable with translating technical problems to non-technical audiences.
- One or more Associate or Professional-level AWS certificates.
- Prior experience within a DevOps, DevSecOps, SRE, or UNIX/Linux Sys-Admin teams.