Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

NBCUniversal Logo

Site Reliability Engineer

NBCUniversal

$110,000 - $145,000
Sep 24, 2025
Englewood, NJ, USA
Apply Now

NBCUniversal is looking for a Site Reliability Engineer to support live channel distribution on the Video Streaming Engineering team, focusing on maintaining and diagnosing distribution systems, preventing on-air issues, and supporting future technological solutions in the broadcast environment.

Requirements

  • 5+ years of DevOps/SRE experience in the technology sector delivering production-quality software or software-defined infrastructure in a high traffic environment run on a “cloud hosting” environment (AWS preferred)
  • Experience with deployment automation in within AWS-hosted services (Cloud Formation, Terraform, Ansible)
  • Familiarity with containerization and orchestration services such as Kubernetes and Docker
  • Familiarity with CI/CD orchestration tools (e.g., GitHub Actions, or Jenkins)
  • Experience with CI/CD build and deployment practices
  • 3-5 years of Linux System Administration
  • 3-5 years experience coding in Go, Python, Ruby, Java, or shell languages

Responsibilities

  • Investigate issues within broadcast systems and their integration points to find the root cause of problems or systemic issues.
  • As a Level 2 resource, drive and own investigations related to Broadcast issues and report back findings in a timely manner to leadership and operations.
  • Follow up with team members & 3rd party vendors if issues found cannot be solved and drive vendors for root cause and solutions if possible.
  • Create comprehensive documentation outlining the intricacies of encountered issue, elucidating the root cause and steps for effective issue resolution.
  • Assist in the deployment and testing of patches or fixes from vendors both in the Development environment as well as the Production environment until completion and to the satisfaction of the Operations team.
  • Assist in the design, analysis, or evaluation of assigned projects using sound engineering principles and adhering to business standards, practices, procedures, and product / program requirements.
  • Support and participate in On-air systems integration and on-air rollout.

Other

  • Provide 24x7 On-Air systems support and daily operations support; some on-call support may be required from time to time during on-air rollout and special broadcast events.
  • Attend daily maintenance and operations review calls to report back to leadership and Operations on findings from new and open issues and their potential fixes and planned deployments of those fixes
  • A passion for investigating issues, driving towards resolutions and effective problem solving
  • Willingness and ability to prioritize business needs to meet short-term demands
  • An unwillingness to tolerate user-facing downtime