4Minds is seeking to solve the problem of enabling organizations to build and operate private, domain-specific AI systems that can learn continuously from live data and be deployed on-prem or in the cloud. The goal is to empower AI teams and scale engineering teams for rapid AI deployment and ROI.
Requirements
- Proficiency in Python and hands-on experience with TensorFlow and PyTorch for model development and training.
- Demonstrated experience of training and deploying large models on GPUs, ensuring high-performance outputs.
- Strong expertise in Nvidia’s AI stack, including CUDA, TensorRT, and NVLM for model optimization and deployment.
- Experience in setting up and running A/B testing experiments to evaluate and improve AI model accuracy and performance.
Responsibilities
- Develop, train, and fine-tune machine learning models with a focus on improving core AI system functionalities.
- Collaborate with data engineers to preprocess and clean data for efficient model ingestion and training.
- Experiment with Nvidia’s NVLM models to push the boundaries of AI performance and optimize models using TensorRT for efficient deployment on CUDA hardware.
- Design and implement A/B testing frameworks, set up experiments, and analyze results to measure and enhance model performance.
Other
- Master’s degree in Computer Science, Software Engineering, AI, Mathematics, Physics, or a closely related technical field; strong academic background preferred.
- Doctor of Philosophy in Computer Science, Software Engineering, AI, Mathematics, Physics, or a closely related technical field; strong academic background preferred.
- Ability to work on-site 5 days a week, full time from our Dallas, TX office.
- Coachable and humble individual looking to turn themselves into a top 1% engineer.