Cadmus is seeking an experienced AI/ML Engineer to support advanced AI projects that leverage both traditional machine learning and modern generative AI technologies using cloud-native services.
Requirements
- Proficiency in Python and experience building scalable backend services with frameworks such as FastAPI.
- Experience with cloud platforms (AWS, Azure, or GCP) and their AI/ML services including compute services, SDKs, and data storage solutions.
- Experience using Large Language Models along with related tools, libraries and frameworks, and strong expertise in using vector databases and embedding technologies.
- Experience with fine-tuning, evaluating, deploying and monitoring large language models.
- Experience using RAG (retrieval augmented generation) techniques to implement information retrieval.
- Experience implementing GenAI applications including chatbots and agentic workflows.
- Experience with MLOps and AI model lifecycle management including CI/CD pipelines.
Responsibilities
- Work with cross-functional team members to understand business needs and develop AI/ML solutions incorporating generative AI technologies.
- Design scalable AI/ML systems, implement, and deploy GenAI applications including large language models, multimodal AI systems, and retrieval-augmented generation (RAG) architectures for enterprise knowledge management and chatbot applications.
- Develop multi-agent AI systems using frameworks such as Langgraph with coordinated complex agent interactions.
- Participate in code reviews and establish coding best practices for developing and maintaining clean, efficient, and well-documented Python code.
- Drive end-to-end model development lifecycle from research to production deployment for both traditional ML and GenAI workflows.
- Lead fine-tuning initiatives for large language models and establish evaluation frameworks for model performance assessment and improvement.
- Collaborate with stakeholders to define technical roadmaps and project timelines.
Other
- Bachelor's or Master's degree in Computer Science, Machine Learning, AI, or related field.
- 4-6 years of hands-on experience in AI/ML engineering with multiple production-level GenAI projects and proven track record of leading technical initiatives.
- Exceptional analytical and problem-solving skills.
- Excellent communication and teamwork abilities with experience presenting to clients and other stakeholders.
- Candidates must be eligible to work in the United States as a U.S Perm Resident or U.S. Citizen.