Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

DeepMind Logo

Research Scientist, Multimodal LLMs

DeepMind

$166,000 - $244,000
Aug 17, 2025
Mountain View, CA, US
Apply Now

The VIVID team at Google DeepMind is focused on advancing the capabilities of foundation models to enable personalized, multimodal, and agentic experiences. This role aims to develop and advance the next generation of AI models that can seamlessly integrate and reason across different modalities such as text, images, audio, and video.

Requirements

  • PhD in Computer Science, Statistics, or a related field.
  • Strong publication record in top machine learning conferences (e.g., NeurIPS, CVPR, ICML, ICLR, ICCV, ECCV).
  • Expertise in one or more of the following areas: computer vision, natural language processing, machine learning.
  • Experience with training, evaluating, or interpreting large language models.
  • Proven ability to design and execute independent research projects.
  • Extensive experience with deep learning frameworks (e.g. PyTorch, JAX) and large-scale model training.

Responsibilities

  • Develop and implement next-generation agentic reasoning frameworks for multimodal understanding
  • Explore reinforcement learning (RL) to reward detailed reasoning chains
  • Develop robust self-critique mechanisms for error correction
  • Integrate tool-use, enabling models to execute code for interactive video analysis and manipulation
  • Pioneer the extension of visual reasoning from 2D into the 3D domain, enabling a more physically-grounded form of intelligence
  • Leverage deep spatial understanding to develop novel generative capabilities, such as synthesizing photorealistic views of a scene from new perspectives or allowing for the interactive modification of 3D objects and layouts
  • Spearhead the creation of novel, challenging benchmarks to rigorously measure progress and define the future of visual reasoning

Other

  • Excellent communication and collaboration skills.
  • The US base salary range for this full-time position is between $166,000 USD - 244,000 USD + bonus + equity + benefits.
  • Application deadline: Friday, August 29th at 9:00am PDT
  • We value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact.
  • We are committed to equal employment opportunities regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law.