NVIDIA is seeking a Senior Systems Software Engineer to architect and deliver solutions that improve efficiency, stability, and scalability for their AV testing infrastructure, supporting the growing need for advanced perception and cognitive capabilities in AI-powered applications.
Requirements
- 12+ years designing and building distributed systems with a strong foundation in Linux systems and infrastructure engineering.
- Proficiency in Python and deep experience working with container orchestration and cloud/on-prem environments (Kubernetes, Docker, VMs).
- Familiarity with monitoring, logging, and alerting tools such as Grafana, Prometheus, and ELK stack.
- Proven ability to debug complex systems spanning infrastructure, build processes, and workload execution
- Demonstrated technical leadership and architectural impact on critical, large-scale distributed systems deployed in production.
- Experience building developer tools and automation pipelines that significantly improve build reliability and reduce integration times at scale.
- Experience working in autonomous vehicle or related real-time, safety-critical systems is a plus.
Responsibilities
- Architect, design, and implement distributed infrastructure solutions to support AV software builds, large-scale simulation testing, and real-time observability.
- Innovate developer tooling and automation frameworks to mitigate integration challenges, avert regressions, and uphold quality standards.
- Design comprehensive metrics to monitor system health, workload quality, and resource utilization across complex compute and storage environments.
- Collaborate deeply with multi-functional AI teams to translate their requirements into scalable, future-proof platforms that boost efficiency and accelerate innovation.
- Serve as a technical leader, driving architecture decisions and guiding project execution across the entire platform.
- Partner with software, hardware, safety, and product management teams to translate product visions into actionable architecture and design documentation.
- Take ownership of driving proof-of-concept projects, writing technical proposals, and championing novel infrastructure solutions that tackle complex, unsolved challenges.
Other
- BS or MS or equivalent experience in Computer Science, Computer Architecture, Electrical Engineering or related field.
- Proven capability to collaborate effectively and communicate clearly, with a history of successful partnerships with engineering teams and business partners.
- Demonstrated problem-solving mentality with a drive to own solutions end-to-end and thrive in a fast-paced, innovative environment.
- Strong interpersonal skills, capable of leading across geographies and organizational boundaries
- Passion for innovation — proven ability to lead proof-of-concept projects and write compelling technical proposals that drive new initiatives.