Microsoft is seeking to develop next-generation visual/screen models with AI-powered capabilities that are intelligent, secure, and efficient, and is looking for a Senior Applied Scientist to join their team to build cutting-edge visual models optimized for on-device inference.
Requirements
- Solid experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and model compression techniques.
- Experience with model evaluation metrics and benchmarking across platforms.
- Prior experience training models with custom architectures and tuning architectures to optimize for inference time.
- Proven track record in quantization, performance tuning, and deploying models on edge devices.
- Familiarity with silicon-specific optimization strategies.
- Hands-on experience with Qualcomm QNN, Intel OpenVINO, or similar toolchains.
- Experience with techniques like distillation, adapters, and Low Rank Adapters (LoRAs)
Responsibilities
- Be part of a team contributing to pre- and post-training efforts for visual models pertaining to screen understanding.
- Explore and implement Transformer-based architectures and suggest architectural improvements for efficiency and scalability.
- Use techniques like distillation, adapters, and Low Rank Adapters (LoRAs) to build upon existing models.
- Optimize models for on-device inference using techniques such as quantization and debugging models.
- Conduct performance tuning and model evaluation across diverse silicon platforms (e.g., Neural Processing Units--NPUs, GPUs, custom accelerators).
- Collaborate with cross-functional teams to integrate models into production pipelines and validate performance in real-world scenarios.
- Contribute to internal model libraries and tooling for deployment across multiple hardware toolchains.
Other
- Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience
- OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience
- OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience
- Equivalent experience.
- Microsoft will accept applications for the role until September 14, 2025.