Capital One is looking to leverage the latest in computing and machine learning technologies, specifically Generative AI and Large Language Models (LLMs), to disrupt the financial industry at scale. The goal is to unlock opportunities that help customers save money, time, and reduce financial stress by developing AI-powered products and features.
Requirements
- You are experienced in training language models or large computer vision models as well as have expertise in one or more key subdomains such as: training optimization, self-supervised learning, explainability, RLHF.
- You have an engineering mindset as shown by a track record of delivering models at scale both in training data and inference volumes.
- You have experience in delivering libraries, platforms, or solution level code to existing products.
- At least 1 year of experience working with AWS
- At least 3 years’ experience in Python, Scala, or R
- At least 3 years’ experience with machine learning
- At least 3 years’ experience with SQL
Responsibilities
- Our team creates unprecedented amounts of high quality data for training and testing GenAI models; we care about how it’s created, what’s in those datasets, and the impact they have
- We are invested in building capabilities for evaluating and monitoring generative models; these methods must be state of the art, easy to use, and trusted by our users and contributors
- Horizontal capabilities enable vertical use case work; the team builds search, summarization, RAG, and agentic workflows for integration in production applications across the company
- Partner with a cross-functional team of data scientists, software engineers, machine learning engineers and product managers to deliver AI powered products that change how customers interact with their money.
- Leverage a broad stack of technologies — Pytorch, AWS Ultraclusters, Hugging Face, LangChain, VectorDBs, and more — to reveal the insights hidden within huge volumes of numeric and textual data.
- Be the expert in Natural Language Processing (NLP) to harness the power of Large Language Models (LLMs), adapt and finetune them for customer facing applications and features.
- Build machine learning and NLP models through all phases of development, from design through training, evaluation, and validation; partnering with engineering teams to operationalize them in scalable and resilient production systems that serve 80+ million customers.
Other
- Customer first. You love the process of analyzing and creating, but also share our passion to do the right thing. You know at the end of the day it’s about making the right decision for our customers.
- Innovative. You continually research and evaluate emerging technologies. You stay current on published state-of-the-art methods, technologies, and applications and seek out opportunities to apply them.
- Creative. You thrive on bringing definition to big, undefined problems. You love asking questions and pushing hard to find answers. You’re not afraid to share a new idea.
- A leader. You challenge conventional thinking and work with stakeholders to identify and improve the status quo. You're passionate about talent development for your own team and beyond.
- Influential. You are passionate about AI/ML and can bring along a cross functional team in breakthrough innovations. You communicate clearly and effectively to share your findings with non-technical audiences.