Citi is seeking to drive the design, development, and integration of state-of-the-art Generative AI solutions across its enterprise Controls Technology platform to enhance automation and operational efficiency.
Requirements
- Strong hands-on experience with LLMs and fine-tuning methods such as LoRA, QLoRA, Adapter/Prefix Tuning, and instruction tuning.
- Practical knowledge of model optimization (compression, quantization) and familiarity with tools such as DeepSpeed, vLLM, GPTQ, or similar.
- Proficient in prompt engineering and familiarity with prompt design tools/frameworks.
- Experience building RAG systems, including hybrid search and multi-vector retrieval.
- Proficient with machine learning frameworks (PyTorch, TensorFlow, Keras) and distributed training.
- Strong skills in NLP (NER, Dependency Parsing, Text Classification, Topic Modeling), transfer learning, and advanced learning paradigms.
- Experience with containerization (Docker), orchestration (Kubernetes), and CI/CD pipelines for ML models.
Responsibilities
- Collaborate with AI architects, leads, and stakeholders to design and implement generative AI solutions that address business challenges.
- Develop, fine-tune, and optimize large language models (LLMs), leveraging both parameter-efficient techniques and full fine-tuning where applicable.
- Implement and experiment with advanced generative AI methods, including prompt engineering and Retrieval-Augmented Generation (RAG).
- Support the integration of AI models into production environments, ensuring robust deployment, scalability, and maintainability.
- Contribute to the development and optimization of real-time and streaming AI solutions.
- Stay current with the latest advances in generative AI and actively share knowledge with the team.
- Ensure adherence to ethical AI guidelines, data privacy, and compliance standards.
Other
- Strong collaboration skills to work effectively in cross-functional teams.
- Analytical and proactive approach to problem-solving.
- Clear communication skills for both technical and non-technical audiences.
- Eagerness to learn, innovate, and mentor less experienced developers.
- 5 plus years of experience in AI/ML, including significant experience in Generative AI.