Nutanix is looking to solve complex problems for clients by designing and implementing scalable AI/ML infrastructure solutions on their HCI platforms, requiring expertise in AI/ML infrastructure, client-facing communication, and problem-solving.
Requirements
- 3-5+ years of experience in designing and implementing infrastructure for data-intensive or AI/ML workloads.
- Strong expertise in container orchestration with Kubernetes.
- Knowledge of deployment lifecycle and resource requirements for AI/ML models, including LLM inference.
- Proficiency in Infrastructure as Code (IaC) tools like Terraform or Ansible.
- Experience with GPU acceleration technologies (e.g., NVIDIA CUDA) in containerized environments.
- Familiarity with monitoring tools such as Prometheus and Grafana.
- Experience with Hyperconverged Infrastructure (HCI) platforms is a strong plus.
Responsibilities
- Design and implement scalable AI/ML infrastructure solutions on Nutanix HCI platforms, tailored to customer requirements.
- Advise clients on best practices for configuring Kubernetes for efficient management of AI/ML workloads.
- Collaborate with cross-functional teams to align infrastructure capabilities with client AI/ML initiatives and strategies.
- Develop and maintain documentation for solution configurations, deployment guides, and knowledge-sharing resources.
- Monitor and optimize infrastructure performance, ensuring high availability and resource efficiency for AI model execution.
- Act as a subject matter expert, providing guidance on GPU acceleration technologies and their integration in AI workflows.
- Engage with clients to troubleshoot issues and provide ongoing support for their AI/ML infrastructures.
Other
- Excellent client-facing communication skills
- Relentless drive to solve complex problems
- Achieve measurable improvements in client satisfaction and project outcomes within the first year, evidenced by successful deployments and positive feedback.
- Strong problem-solving, analytical, and troubleshooting abilities.
- Proven ability to work independently in a dynamic, customer-facing environment.
- Active Top Secret Clearance is a strong plus.
- Travel requirements of up to 30%