Tubi's Infrastructure team needs to build and operate core platforms that power their services at scale, ensuring reliable, scalable, and developer-friendly systems for compute, networking, observability, and deployment. The Senior Software Engineer will be responsible for ensuring reliable service delivery and efficient traffic management across large-scale Kubernetes environments.
Requirements
- 5+ Years experience in IaC with a Cloud Provider (AWS)
- 3+ Years of experience with production Kubernetes Clusters
- Hands-on experience managing Kubernetes in production environments.
- Strong understanding of service mesh technologies (Istio, Envoy, or similar).
- Expertise in CI/CD workflows and tools such as ArgoCD, FluxCD, GitHub Actions, or Jenkins.
- Solid foundation in Linux, networking, and containerization.
- Programming skills in Go, Python for automation tooling.
Responsibilities
- Manage and scale multi-cluster Kubernetes deployments, ensuring high availability, performance, and reliability.
- Design and implement traffic strategies (e.g., canary releases, blue/green deployments, A/B testing, gradual rollouts) using Istio/Envoy or similar service mesh technologies.
- Build and maintain CI/CD pipelines, automate deployments and rollbacks, and improve release efficiency and reliability.
- Use Terraform and other IaC tools to provision and manage cloud infrastructure, ensuring consistency and auditability.
- Establish monitoring, logging, and tracing solutions; troubleshoot and resolve production issues quickly to maintain system stability.
- Write and maintain clear technical documentation (system architecture, release processes, traffic policies, runbooks, best practices) to enable effective onboarding and collaboration.
- Partner with developers, SREs, and platform teams to design scalable release and traffic strategies, and drive adoption of engineering best practices.
Other
- Strong technical writing skills—able to produce clear, structured documentation for both technical and non-technical audiences.
- Strong problem-solving skills, with proven experience in high-pressure incident response.
- Excellent communication and collaboration skills, with a mindset for driving engineering efficiency and quality.
- LI-Hybrid
- The pay range for this role, with final offer amount dependent on education, skills, experience, and location is listed annually below.