At Capital One, we are creating trustworthy and reliable AI systems, changing banking for good. We are committed to building world-class applied science and engineering teams and continue our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build.
Requirements
- Has a deep understanding of the foundations of AI methodologies.
- Experience building large deep learning models, whether on language, images, events, or graphs, as well as expertise in one or more of the following: training optimization, self-supervised learning, robustness, explainability, RLHF.
- An engineering mindset as shown by a track record of delivering models at scale both in terms of training data and inference volumes.
- Experience in delivering libraries, platform level code or solution level code to existing products.
- LLM
- PhD focus on NLP or Masters with 5 years of industrial NLP research experience
- Multiple publications on topics related to the pre-training of large language models (e.g. technical reports of pre-trained LLMs, SSL techniques, model pre-training optimization)
Responsibilities
- Leverage a broad stack of technologies — Pytorch, AWS Ultraclusters, Huggingface, Lightning, VectorDBs, and more — to reveal the insights hidden within huge volumes of numeric and textual data.
- Build AI foundation models through all phases of development, from design through training, evaluation, validation, and implementation.
- Engage in high impact applied research to take the latest AI developments and push them into the next generation of customer experiences.
- Partner with a cross-functional team of data scientists, software engineers, machine learning engineers and product managers to deliver AI-powered products that change how customers interact with their money.
- Experience building large deep learning models, whether on language, images, events, or graphs, as well as expertise in one or more of the following: training optimization, self-supervised learning, robustness, explainability, RLHF.
- An engineering mindset as shown by a track record of delivering models at scale both in terms of training data and inference volumes.
- Experience in delivering libraries, platform level code or solution level code to existing products.
Other
- Flex your interpersonal skills to translate the complexity of your work into tangible business goals.
- You love the process of analyzing and creating, but also share our passion to do the right thing. You know at the end of the day it’s about making the right decision for our customers.
- Innovative. You continually research and evaluate emerging technologies. You stay current on published state-of-the-art methods, technologies, and applications and seek out opportunities to apply them.
- Creative. You thrive on bringing definition to big, undefined problems. You love asking questions and pushing hard to find answers. You’re not afraid to share a new idea.
- A leader. You challenge conventional thinking and work with stakeholders to identify and improve the status quo. You’re passionate about talent development for your own team and beyond.