Vast.ai is looking to improve the performance, reliability, and capability of their infrastructure and containerization technologies.
Requirements
- Strong programming skills in at least one language, ideally C++
- Extensive knowledge of Linux kernel internals, containerization technologies, and virtualization
- Deep understanding of workload and network isolation techniques in multi-tenant environments
- Experience in securing and hardening cloud infrastructure, particularly in environments with untrusted workloads
- Strong background in workload and network isolation, network security, and cloud-native security practices
- Experience with GPU programming and an understanding of GPU-specific security concerns
Responsibilities
- Expand and extend the GPU cloud daemon
- Design and deploy market-based resource management systems
- Harden code and infrastructure to meet zero-trust standards
- Benchmark, profile, and eliminate bottlenecks across hypervisor, container, and network layers
Other
- Full-time
- On-site at either our SF or LA offices
- Strong intrinsic drive, a true passion for advancing the state of the art, and a mix of architecture, coding, and communication skills