Super Micro Computer is looking for an innovative leader to fill a Sr. Engineer, software position to develop, test, and deploy AI/ML infrastructure software
Requirements
- Expertise in Nvidia AI/LLM software stack, such as NVAIE, NIM, CUDA toolkit, GPU driver, Nvidia Container Toolkit, Docker Engine, Nvidia System Management (NVSM), Data Center GPU Management (DCGM), etc.
- Experience and demonstrable knowledge of AI/LLM, and familiarity with common frameworks and methodologies utilized in building LLM applications
- Proficiency in Python
- Proven expertise in LLM training, development, and fine-tuning
- Ability to quickly learn new technologies, frameworks, and algorithms
- Strong analytical, problem-solving, and communication skills
- MS or BS in Computer Science, Computer Engineering, or equivalent experience in AI/LLM, deep learning and related software technologies preferred
Responsibilities
- Develop, test, and deploy LLM applications using Nvidia AI/ML software infrastructure and frameworks
- Optimize and fine-tune LLM models for enhanced performance, efficiency, and accuracy
- Develop the technology and create the solution to solve the defined problem
- Manage multiple tasks simultaneously in a fast-paced environment
- Perform all other duties, tasks or projects as assigned
- Collaborate with cross-functional teams to deliver user-friendly and innovative solutions
- Provide timely, accurate, and quality work product
Other
- Regular in-office attendance
- Ability to work in a highly interactive, cross-functional collaborative environment
- Self-driven, collaborative, and committed to delivering high-quality work
- Ability to quickly learn new technologies, frameworks, and algorithms
- Strong analytical, problem-solving, and communication skills