Bixby is an intelligent personal assistant that needs to expand its voice technology and features to include advanced research and projects in Text to Speech synthesis, Automatic Speech Recognition (ASR), and personalization. This involves improving the quality of TTS in various applications and systems, analyzing TTS system performance, and making technological choices for generative AI solutions.
Requirements
- Experience with Tensorflow or Pytorch or similar frameworks
- Worked on advance architectures such as Tacotron, WavNet, Fastspeech, Vall-e and other advanced models for TTS systems
- Experience working in voice cloning, neural style transfer and machine synthesis of speech from speakers
- Experience in Prosody modeling for more natural generation of speech
- Working experience on TTS in large scale production systems
- Working on various vocoder techniques for production
- Knowledge of state-of-the-art Large Language models such as Deepseek, GPT, BERT variants and other deep fusion techniques is essential
Responsibilities
- Architect and design end to end Automatic Speech Recognition products, applications and solutions for specific business needs and provide implementation guidance during delivery
- Leverage, customize and implement TTS models, algorithms, and methodologies to improve the overall quality TTS in various applications and systems
- Analyze and evaluate the performance TTS systems and provide design recommendations
- Analyze and make right technological choices for generative ai solutions
- Design and prototype reusable components for LLM based solutions for TTS
- Architect components of an TTS solution to address Responsible AI & Security
- Harness the power of transformer architecture, a cutting-edge deep learning model widely employed in natural language processing and computer vision, to optimize the language model's performance and efficiency
Other
- MS or Ph.D. in Computer Science or Digital Signal Processing or equivalent combination of education, training, and experience
- 5+ years of relevant professional experience in Machine Learning or relevant field
- Collaborate seamlessly with diverse, cross-functional teams to accurately identify and prioritize requirements, ensuring that the language model meets the needs and expectations of various stakeholders
- Create and maintain comprehensive technical documentation that comprehensibly captures the intricate details of the language model, facilitating seamless understanding, efficient troubleshooting, and future development
- Ensure ethical AI development practices, prioritizing fairness, transparency, and privacy
- Ability to develop project plans and experience to execute them
- Research expertise in ML and written research publications