Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

AHU Technologies Logo

Software Engineer (Observability & Monitoring) - West Des Moines, IA

AHU Technologies

Salary not specified
Dec 22, 2025
Washington, DC, US
Apply Now

Seeking an experienced Observability and Monitoring Engineer to build and mature enterprise-wide monitoring, logging, alerting, and observability capabilities across AWS-based technology stack

Requirements

  • Hands-on experience with monitoring/logging tools such as Zabbix, Graylog, Splunk, SolarWinds, or equivalents
  • 5+ years of hands-on experience with AWS services and architecture
  • Deep understanding of metrics, logs, traces, distributed tracing, and event correlation
  • Experience building dashboards and KPIs for application, infrastructure, and database layers
  • Strong scripting/automation skills (Python, Bash, PowerShell) and familiarity with Terraform or CloudFormation
  • AWS Services (EC2, RDS, S3, Lambda, ECS/EKS, etc.)
  • Monitoring Tools (Dynatrace, CloudWatch, Zabbix, Solarwinds, Graylog etc.)

Responsibilities

  • You will establish standards for logs, metrics, traces, event correlation, and alert across multiple environments
  • You will build centralized dashboards and alerting policies that provide unified visibility across: applications & services, operating systems, AWS services (EC2, RDS, Lambda, S3, CloudWatch, CloudTrail, etc.), databases (MS SQL Server, PostgreSQL, etc.), file transfer systems (SFTP, managed transfer tools), batch jobs and scheduled processes
  • You will create actionable and noise-free alerting thresholds, escalation policies, and runbooks
  • You will integrate existing tools (Dynatrace, Graylog, Splunk, SolarWinds, Zabbix) into a cohesive ecosystem
  • You will rationalize tool usage and recommend consolidation or modernization where appropriate
  • You will manage the lifecycle, configuration, tuning, and health of monitoring and logging platforms, automate monitoring deployments using IaC (CloudFormation) and CI/CD pipelines, and develop reusable templates/standards so teams can onboard new applications quickly
  • You will define SLOs/SLIs and reliability KPIs for critical services

Other

  • Bachelor's degree in Computer Science or related field
  • 5+ years of experience implementing monitoring and observability using Dynatrace
  • Familiarity with ITIL incident/problem management processes
  • Proficiency with AI tools and using them responsibly in improving observability preferred
  • Experience with container orchestration and microservices architecture preferred