Wispr Flow is looking to solve the problem of creating a seamless voice dictation platform that understands users perfectly on the first try, and is building the interaction layer for computers to make human-computer interaction more natural and effortless.
Requirements
- PhD in machine learning or related field (neuroscience, EE, etc)
- Strong publication track record at top conferences (ICML, NeurIPS, ICLR, ICASSP)
- Fluency in Python and LLM development
- Attention to detail and eagerness to learn
- Aptitude and clarity of thought
- Creativity in R&D, excellence in engineering, and code velocity
Responsibilities
- Lead the ML research and engineering teams for development of next generation speech recognition and voice-to-action models
- Technically lead the direction of the ML team
- Be directly hands on involved in aspects of training and serving of models
- Hire and grow the team
- Spearhead work on RL-based personalization of LLMs for post-processing and interpreting user speech
- Train next generation of multimodal speech-LLMs to perform contextual speech recognition
- Use auxiliary signals such as speaker history, current screen context, and more to break the current performance boundary of ASR models
Other
- Experience leading teams of researchers and engineers - ideally of ten or more people
- Opinionated takes on how to make quality breakthroughs in speech recognition
- Degree requirement: PhD
- Travel requirements not mentioned
- Clearance requirements not mentioned