Capital One is looking to leverage AI and Generative AI technologies, specifically Large Language Models (LLMs), to transform financial services and create next-generation customer experiences. The AI Foundations LLM Customization team is central to this vision, aiming to apply state-of-the-art AI to business challenges.
Requirements
- At least 2 years of experience leveraging open source programming languages for large scale data analysis
- At least 2 years of experience working with machine learning
- At least 2 years of experience utilizing relational databases
- At least 1 year of experience working with AWS
- At least 5 years’ experience in Python, Scala, or R for large scale data analysis
- At least 5 years’ experience with machine learning
- LLM
- PhD focus on NLP or Masters with 5 years of industrial NLP research experience
- Multiple publications on topics related to the pre-training of large language models (e.g. technical reports of pre-trained LLMs, SSL techniques, model pre-training optimization)
- Member of team that has trained a large language model from scratch (10B + parameters, 500B+ tokens)
- Publications in deep learning theory
- Publications at ACL, NAACL and EMNLP, Neurips, ICML or ICLR
- PhD focused on topics related to guiding LLMs with further tasks (Supervised Finetuning, Instruction-Tuning, Dialogue-Finetuning, Parameter Tuning)
- Demonstrated knowledge of principles of transfer learning, model adaptation and model guidance
- Experience deploying a fine-tuned large language model
Responsibilities
- Partner with a cross-functional team of data scientists, applied researchers, software engineers, machine learning engineers and product managers to deliver AI powered products that change how customers interact with their money.
- Leverage a broad stack of technologies — Pytorch, Hugging Face, AWS Ultraclusters, LangChain, VectorDBs, and more — to reveal the insights hidden within huge volumes of numeric and textual data.
- Be the expert in Natural Language Processing (NLP) to harness the power of Large Language Models (LLMs), adapt and finetune them for business specific applications and features.
- Build NLP models through all phases of development, from design through training, evaluation, and validation; partnering with engineering teams to operationalize them in scalable and resilient production systems.
- Flex your interpersonal skills to translate the complexity of your work into tangible business goals.
Other
- Innovative. You continually research and evaluate emerging technologies.
- Creative. You thrive on bringing definition to big, undefined problems.
- Influential. You are passionate about AI/ML and can bring along a cross functional team in breakthrough innovations.
- You communicate clearly and effectively to share your findings with non-technical audiences.
- At least 1 year of experience managing people