The Atlantic is looking to maintain infrastructure stability, improve security and developer experience, and optimize cloud costs within their DevOps team
Requirements
4+ years of DevOps or related experience
Solid experience with Terraform (including modules, workspaces) and Kubernetes
Comfortable writing code
Deep experience with AWS services (EC2, VPC, RDS, Lambdas)
Expertise in Fastly CDN and VCL configuration
Terraform Cloud (workspace management, policies)
ArgoCD or other GitOps tooling expertise
Responsibilities
Build and maintain the infrastructure that powers our applications
Own the stability and security of our AWS-hosted infrastructure—Kubernetes clusters, Terraform stacks, CI/CD pipelines, and monitoring
Collaborate with our System Architects as well as feature teams to improve observability and incident readiness—training them on how to use our dashboards, logs, and alerts
Build and maintain internal tools (Python or Go preferred) that make fellow engineers’ lives easier—think chat-ops bots, auto-scaling workflows, and deployment automation
Keep the lights green: upgrade versions, patch vulnerabilities, optimize cloud spend, and respond to incidents with calm, data-driven fixes
Evolve our observability stack (DataDog, Grafana, Prometheus) so teams can spot issues before customers do
Contribute to design docs and runbooks in Confluence; review code in GitHub; ship via GitHub Actions, Terraform Cloud, and Argo CD
Other
Strong communication skills—you’ll talk with other teams to understand their needs and help build the right solutions
Share on-call responsibilities (on a supportive, balanced rotation)
4+ years of experience
Bachelor's degree or equivalent experience
The Atlantic requires all employees to be vaccinated against COVID-19, including subsequent boosters, and submit proof of vaccination status