The Atlantic is looking to maintain infrastructure stability, improve security and developer experience, and optimize cloud costs within their DevOps team
Requirements
- 4+ years of DevOps or related experience
- Solid experience with Terraform (including modules, workspaces) and Kubernetes
- Comfortable writing code
- Deep experience with AWS services (EC2, VPC, RDS, Lambdas)
- Expertise in Fastly CDN and VCL configuration
- Terraform Cloud (workspace management, policies)
- ArgoCD or other GitOps tooling expertise
Responsibilities
- Build and maintain the infrastructure that powers our applications
- Own the stability and security of our AWS-hosted infrastructure—Kubernetes clusters, Terraform stacks, CI/CD pipelines, and monitoring
- Collaborate with our System Architects as well as feature teams to improve observability and incident readiness—training them on how to use our dashboards, logs, and alerts
- Build and maintain internal tools (Python or Go preferred) that make fellow engineers’ lives easier—think chat-ops bots, auto-scaling workflows, and deployment automation
- Keep the lights green: upgrade versions, patch vulnerabilities, optimize cloud spend, and respond to incidents with calm, data-driven fixes
- Evolve our observability stack (DataDog, Grafana, Prometheus) so teams can spot issues before customers do
- Contribute to design docs and runbooks in Confluence; review code in GitHub; ship via GitHub Actions, Terraform Cloud, and Argo CD
Other
- Strong communication skills—you’ll talk with other teams to understand their needs and help build the right solutions
- Share on-call responsibilities (on a supportive, balanced rotation)
- 4+ years of experience
- Bachelor's degree or equivalent experience
- The Atlantic requires all employees to be vaccinated against COVID-19, including subsequent boosters, and submit proof of vaccination status