NVIDIA is seeking a Software Architect to define and own the technical vision for the NVIDIA Inference Microservices (NIM) Factory, aiming to build, deploy, and scale enterprise-grade AI services.
Requirements
- Deep architectural expertise in cloud-native technologies, including Kubernetes, containers, and microservices.
- Proficiency in languages like Python for building tooling and services.
- Experience architecting solutions for GPU-accelerated or other high-performance computing workloads.
- Hands-on with LLM inference stacks (Triton Inference Server, TensorRT-LLM, vLLM).
- Experience optimizing large-model serving (KV cache sharding/paging, tensor/sequence parallelism, speculative decoding, dynamic batching).
- Experience architecting next-generation container build systems or CI/CD platforms at scale.
- Background with workflow orchestration engines (e.g., Temporal, Airflow) for complex, distributed processes.
Responsibilities
- Define the end-to-end technical architecture for the NIM Factory, from container build systems and CI/CD to Kubernetes deployment patterns and runtime optimization.
- Drive technical strategy and roadmap, making high-impact decisions on frameworks, technologies, and standards that empower dozens of engineering teams.
- Architect and influence the design of workflow orchestration systems that underpin the NIM factory.
- Advocate for guidelines in software development, encompassing API composition, automation, observability, and secure supply chain management.
- Collaborate with leadership across research, backend, SRE, and product to align technical vision with product goals and influence technical roadmaps.
- Guide and support senior engineers throughout the organization in building a culture centered on technical excellence and innovation.
Other
- 15+ years of experience building large-scale, production distributed systems.
- Consistent track record in a technical leadership or architect role, setting technical direction, and implementing.
- Exceptional ability to mentor, and grow senior engineers with a passion for raising the technical bar of the entire organization.
- Excellent communication and collaboration skills, with the ability to articulate complex technical concepts to diverse audiences and drive consensus.
- A degree in Computer Science, Computer Engineering, or a related field (BS or MS) or equivalent experience.