Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Meta Logo

Research Scientist Intern - Multimodal Audio Generation - PhD

Meta

$7,650 - $12,134
Sep 3, 2025
Burlingame, CA, USA • Redmond, WA, USA
Apply Now

Meta's Core AI team is seeking a Research Scientist Intern with a focus on audio generation, especially music and song generation from multimodal input. Our team is pioneering AI research across text, audio, and video domains, with a mission to develop AI-driven foundational models and their applications.

Requirements

  • Research experience in one or more of these areas: machine learning, deep learning, generative AI, audio processing or related fields
  • Knowledge of state of the art deep learning methods and neural networks
  • Experience working with machine learning libraries like Pytorch, Jax, etc
  • Experience with scripting languages such as Python and shell scripts
  • Experience with developing scalable machine learning models in at least one of the following areas: large language models, natural language understanding or generation, efficient training and inference, multimodals, or relevant areas
  • Experience with large scale model training, implementing algorithms, and evaluating language systems
  • Proven track record of achieving significant results as demonstrated by publications at leading conferences/journals such as NeurIPS, ICLR, ICML, CVPR, ICCV, ICASSP, Interspeech, AAAI, IEEE TASLP or similar

Responsibilities

  • Lead and contribute to cutting-edge audio (music and song) generation model research that leads to publications on top-tier conferences
  • Perform research to tackle unsolved real-world problems and push the state of the art
  • Independently design and implement algorithms, train advanced foundational models on large datasets, and evaluate their performance
  • Define, plan and execute cutting-edge deep learning research to advance product experiences using the audio generation features
  • Communicate the experimental results and the recommendations clearly, both within the group as well as to the cross-functional groups

Other

  • Currently is in the process of obtaining a PhD in the field of Artificial Intelligence or related field
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
  • Intent to return to degree-program after the completion of the internship
  • Experience working and communicating cross functionally in a team environment
  • Experience solving complex problems and comparing alternative solutions, trade offs, and diverse points of view to determine a path forward