Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Member of Technical Staff - Voice & Vision

Microsoft

$137,600 - $294,000

Sep 10, 2025

Mountain View, CA, USA

Microsoft is looking to solve the problem of developing voice and vision capabilities for its Copilot product, with a focus on voice recognition, natural language processing, and computer vision technologies.

Requirements

Coding experience in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
Experience with audio and video technologies, including super resolution and real-time video streaming
Expertise in natural language processing (NLP) techniques, such as named entity recognition, sentiment analysis, and intent detection
Experience with computer vision algorithms for tasks such as image classification, object detection, and facial recognition
Experience with voice recognition algorithms, including acoustic modeling, language modeling, and speech-to-text conversion
Experience with machine learning frameworks and tools, such as TensorFlow or PyTorch
Experience with cloud-based platforms, such as Azure or AWS

Responsibilities

Work on cutting-edge technologies with a focus on voice and vision, including super resolution and real-time video streaming.
Apply expertise in audio and video technologies to new AI contexts, focusing on traditional methods such as noise suppression and echo cancellation to enhance voice quality across mobile platforms.
Develop and implement advanced techniques for video and image manipulation, including upscaling and super resolution.
Lead the integration and rollout of vision capabilities, ensuring seamless collaboration between the vision and voice teams to deliver next-level improvements and expand functionality across various contexts.
Drive hands-on development of voice-heavy and video-heavy features, pushing the boundaries of generative AI in both audio and video domains.
Design, develop, and optimize voice recognition algorithms, including acoustic modeling, language modeling, and speech-to-text conversion.
Implement and enhance natural language processing (NLP) techniques, such as named entity recognition, sentiment analysis, and intent detection.

Other

Bachelor's Degree in Computer Science, or related technical discipline
6 years technical engineering experience
Ability to work in office 3 days a week, with a requirement to work from a designated Microsoft office at least four days a week if living within 50 miles of the location
Ability to collaborate with product managers, designers, and other stakeholders to define technical requirements and deliverables
Ability to work in a dynamic team environment