CAE Defense & Security is looking to enhance its AI & Data Science team by hiring a Machine Learning Engineer to design and deploy scalable NLP systems, optimize LLM performance, and integrate these solutions into real-world products to improve user experience and business outcomes.
Requirements
- Proficiency in ML frameworks (PyTorch, TensorFlow, Scikit-learn, CUDA).
- Good understanding of distributed systems, understanding of microservice architecture and REST APIs.
- Strong understanding of MLOps tools and practices (MLflow, Airflow, DVC).
- Hands-on experience with Hugging Face Transformers, LangChain, and OpenAI APIs.
- Technology proficiency with cloud platforms (AWS, GCP, Azure), Linux, and container orchestration (Docker, Kubernetes).
- Proven track record of deploying ML models in production environments.
- Experience in working with SQL/NoSQL database systems such as MySQL, MongoDB or Elasticsearch.
Responsibilities
- Design, develop, and deploy machine learning models for real-world applications.
- Build scalable data pipelines and model training workflows using modern tools and frameworks.
- Conduct rigorous model evaluation, validation, and performance tuning.
- Monitor and maintain deployed models, ensuring reliability and accuracy over time.
- Design, fine-tune, and deploy LLMs (LLaMA, Mistral, etc.) for various NLP tasks such as summarization, question answering, semantic search, and chatbots.
- Develop scalable and efficient model serving infrastructure using tools like ONNX, TensorRT, DeepSpeed, or vLLM.
- Implement retrieval-augmented generation (RAG) pipelines using vector databases (e.g., FAISS, Weaviate, Pinecone, Milvus).
Other
- This position is onsite with locations in Tampa FL, Arlington TX, or Orlando FL.
- Collaborate with cross-functional teams to understand business requirements and translate them into ML solutions.
- Stay current with the latest research and trends in machine learning, AI & LLMs.
- Mentor junior engineers and contribute to team knowledge sharing.
- Document processes, models, and decisions for transparency and reproducibility.