Leveraging AI models for Supply Chain systems and products, specifically implementing new LLM and vision-language models for supply chain applications.
Requirements
- Proficient in PyTorch or TensorFlow, with experience training large models.
- Experience with multimodal tasks (e.g., image-text, vision-audio integration).
- Experience with model fine-tuning techniques like LoRA, QLoRA, and Prompt Tuning.
- Familiar with distributed training frameworks such as DeepSpeed, FSDP, Megatron-LM.
- Hands-on Experience with high-performance computing platforms (A100/H100 clusters).
- Familiarity with open-source LMMs such as OpenCLIP, LLaVA, MiniGPT, InternVL.
- Knowledge of model compression, AI safety, and data governance.
Responsibilities
- Design, train, and optimize deep learning models focusing on LLMs, vision models, or generative AI systems for our in-house applications with Python and Java.
- Build scalable training pipelines, including data preprocessing, distributed training, and model evaluation.
- Develop efficient model inference and deployment systems, improving real-time performance in production.
- Stay up- to date with deep learning trends and research, applying model compression, fine-tuning, and distillation techniques.
- Collaborate across teams to implement AI solutions in real-world business scenarios, such as Q&A, content generation, and predictive analytics.
Other
- Master’s degree or above in Computer Science, AI, Electrical Engineering, or a related field.
- 2+ years of experience in deep learning algorithm development.
- Strong engineering mindset and ability to drive project execution independently.
- This position is hybrid 3days/wk at our Morrisville, NC location.
- Individuals may also be considered for bonus and/or commission.