Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Microsoft Logo

Member of Technical Staff - Voice & Vision

Microsoft

$137,600 - $294,000
Sep 10, 2025
Mountain View, CA, USA
Apply Now

Microsoft is looking to solve the problem of developing voice and vision capabilities for its Copilot product, with a focus on voice recognition, natural language processing, and computer vision technologies.

Requirements

  • Coding experience in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • Experience with audio and video technologies, including super resolution and real-time video streaming
  • Expertise in natural language processing (NLP) techniques, such as named entity recognition, sentiment analysis, and intent detection
  • Experience with computer vision algorithms for tasks such as image classification, object detection, and facial recognition
  • Experience with voice recognition algorithms, including acoustic modeling, language modeling, and speech-to-text conversion
  • Experience with machine learning frameworks and tools, such as TensorFlow or PyTorch
  • Experience with cloud-based platforms, such as Azure or AWS

Responsibilities

  • Work on cutting-edge technologies with a focus on voice and vision, including super resolution and real-time video streaming.
  • Apply expertise in audio and video technologies to new AI contexts, focusing on traditional methods such as noise suppression and echo cancellation to enhance voice quality across mobile platforms.
  • Develop and implement advanced techniques for video and image manipulation, including upscaling and super resolution.
  • Lead the integration and rollout of vision capabilities, ensuring seamless collaboration between the vision and voice teams to deliver next-level improvements and expand functionality across various contexts.
  • Drive hands-on development of voice-heavy and video-heavy features, pushing the boundaries of generative AI in both audio and video domains.
  • Design, develop, and optimize voice recognition algorithms, including acoustic modeling, language modeling, and speech-to-text conversion.
  • Implement and enhance natural language processing (NLP) techniques, such as named entity recognition, sentiment analysis, and intent detection.

Other

  • Bachelor's Degree in Computer Science, or related technical discipline
  • 6 years technical engineering experience
  • Ability to work in office 3 days a week, with a requirement to work from a designated Microsoft office at least four days a week if living within 50 miles of the location
  • Ability to collaborate with product managers, designers, and other stakeholders to define technical requirements and deliverables
  • Ability to work in a dynamic team environment