Natera is looking for a Senior Software Engineer focused on Observability to set observability standards, lead automation efforts, and mentor engineers, ensuring all monitoring and Datadog configuration changes are implemented Infrastructure-as-Code (IaC). The role aims to lead the design and management of a code-driven Datadog observability platform, providing end-to-end visibility into Java applications, Kubernetes workloads, and containerized infrastructure, with an emphasis on cost-effective observability at scale.
Requirements
- Proficiency in at least one modern programming language such as Go, Java, C++ or Python, with the ability to design, implement and maintain reusable code libraries, not just scripts.
- Proven experience developing and maintaining observability-as-code using tools such as Terraform.
- Hands-on experience managing observability platforms such as Datadog, New Relic or Dynatrace as code, using Terraform modules, APIs and CI/CD workflows at scale with deep expertise in APM, logs, metrics, tracing, dashboards and audit trails
- Experience integrating observability into CI/CD pipelines such as GitLab CI, GitHub Actions or AWS CodePipeline
- Solid understanding of AWS cloud services and monitoring practices for Kubernetes workloads
- Experience designing and implementing custom monitoring libraries, exporters or telemetry pipelines such as OpenTelemetry or Prometheus
- Experience with cost optimization strategies in observability platforms
Responsibilities
- Own and define observability standards for Java applications, Kubernetes workloads and cloud infrastructure
- Configure and manage the Datadog platform using Terraform and Infrastructure-as-Code (IaC) best practices
- Drive adoption of structured JSON logging, distributed tracing and custom metrics across Java and Python services
- Optimize Datadog usage through cost governance, log filtering, sampling strategies and automated reporting
- Collaborate closely with Java developers and platform engineers to standardize instrumentation and alerting
- Troubleshoot and resolve issues with missing or misconfigured logs, metrics and traces, working with developers to ensure proper instrumentation and data flow into Datadog
- Lead incident response efforts using Datadog insights for actionable alerting, root cause analysis (RCA) and reliability improvements
Other
- 6+ years of professional software engineering experience building production-grade systems with emphasis on automation, integrations and infrastructure tooling
- Mentorship experience with the ability to coach teams on observability best practices
- Bachelor’s degree in Computer Science, Engineering, Mathematics, Physics or a related technical field
- Natera team consists of highly dedicated statisticians, geneticists, doctors, laboratory scientists, business professionals, software engineers and many other professionals from world-class institutions, who care deeply for our work and each other.
- When you join Natera, you’ll work hard and grow quickly.