xAI is looking to build software, services, and frameworks for its network infrastructure, requiring a skilled engineer to design, deploy, operate, and monitor the network.
Requirements
- Python
- Go
- TCP/IP
- BGP
- RDMA
- Expert knowledge of designing scalable and reliable software from the ground up
- Deep experience collaborating with network engineers daily using extensive knowledge of network topologies, physical and logical, and network protocols
Responsibilities
- Building software and tools with extensive metrics coverage for large GPU supercomputing network fabrics
- Implementing IaC best practices, enhancing deployment pipelines, and ensuring robust, secure service delivery
- Designing scalable and reliable software from the ground up that can build and orchestrate tens of thousands of network devices
- Collaborating with network engineers daily using extensive knowledge of network topologies, physical and logical, and network protocols
- Creating metrics that will help prioritize the focus of the team and your own
Other
- Strong communication skills
- Ability to concisely and accurately share knowledge with teammates
- Ability to thrive in ambiguity
- Travel expected to Palo Alto for inter team collaboration and the data center
- Bachelor's, Master's, or Ph.D. degree in Computer Science or related field (not explicitly mentioned but implied)
- Annual Salary Range: $180,000 - $440,000 USD