GitHub is looking to improve the reliability, safety, and automation of its compute platform to support internal engineering teams and enable the company to scale.
Requirements
- 4+ years’ technical experience in infrastructure domains (e.g., container orchestration engineering, platform engineering, database engineering, software engineering, network engineering, systems administration, or related field)
- 2+ years building and supporting large, high traffic applications at scale within platform/infrastructure domains
- 2+ years supporting and building cloud native workloads in Azure, AWS or Google Cloud
- 4+ years’ experience with Azure, or any other Cloud Provider
- Experience building or maintaining planetary scale engineering systems
- Experience working with a remote, distributed team
- Demonstrated expertise in working with cloud environments and Cloud Native Compute Foundation (CNCF) concepts
Responsibilities
- Contribute to the design and implementation of reliable, performant, and secure systems that support GitHub’s compute platform (e.g. hypervisor, lifecycle, VM runtime, automation, image pipelines, and configuration tooling)
- Build features and automation that reduce operational toil and increase the predictability and safety of platform operations.
- Maintain and improve existing compute and lifecycle services, including GC2 components, fleet management workflows, and base OS/container image pipelines.
- Write, review, and maintain high-quality code while following GitHub engineering best practices.
- Collaborate with partner engineering teams to help them adopt platform capabilities, debug issues, and integrate with Platform APIs and workflows.
- Participate in on-call rotations for the services and systems owned by the organization, contributing to incident response, reliability improvements, and follow-up work.
- Contribute to clear, thoughtful documentation and to asynchronous communication patterns that support a distributed engineering environment.
Other
- Bachelor's degree in computer science, Information Technology, or related field
- 2+ years’ technical experience in infrastructure domains (e.g., container orchestration engineering, platform engineering, database engineering, software engineering, network engineering, systems administration, or related field)
- Strong written and verbal communication skills
- Ability to work in an environment optimized for asynchronous work and written communication
- Ability to work with a remote, distributed team