Building speech AI that actually sounds human for a well-funded speech AI startup
Requirements
- 3+ years of experience in speech synthesis, audio generation, or generative modeling
- Experience with audio generation using LLMs
- Solid background in modern language model architectures
- Proven ability to ship research into production systems
- Experience training large-scale models
- Published research in speech or generative modeling
- Experience with real-time speech systems or multimodal models
Responsibilities
- Conduct research to advance their core speech models and extend product capabilities
- Develop and experiment with new model architectures and training approaches
- Work on large-scale model training and data systems
- Collaborate with the team to take research from concept to deployed systems
Other
- Ideally located in SF, but can also consider remote worldwide
- Comp is up to $250K base DOE, plus equity