Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Apple Logo

GPU Software Architecture Engineer

Apple

Salary not specified
Dec 6, 2025
Cupertino, CA, US
Apply Now

Apple is seeking to solve the complex challenge of orchestrating massive network models across server clusters to power Apple Intelligence at unprecedented scale

Requirements

  • Strong knowledge of GPU programming (CUDA, ROCm) and high-performance computing
  • Excellent system programming skills in C/C++, Python is a plus
  • Deep understanding of distributed systems and parallel computing architectures
  • Experience with inter-node communication technologies (InfiniBand, RDMA, NCCL) in the context of ML training/inference
  • Understand how tensor frameworks (PyTorch, JAX, TensorFlow) are used in distributed training/inference
  • Familiar with model development lifecycle from trained model to large scale production inference deployment
  • Proven track record in ML infrastructure at scale

Responsibilities

  • Design and implement tensor/data/expert parallelism strategies for large language model inference across distributed server cluster environments
  • Drive hardware and software roadmap decisions for ML acceleration
  • Expert in designing architectures that achieves peak compute utilizations and optimal memory throughput
  • Develop and optimize distributed inference systems with focus on latency, throughput, and resource efficiency across multiple nodes
  • Architect scalable ML serving infrastructure supporting dynamic model sharding, load balancing, and fault tolerance
  • Collaborate with hardware teams on next-generation accelerator requirements and software teams on framework integration
  • Lead performance analysis and optimization of ML workloads, identifying bottlenecks in compute, memory, and network subsystems

Other

  • Technical BS/MS degree
  • Apple is an equal opportunity employer that is committed to inclusion and diversity
  • Must have excellent communication and collaboration skills to work with cross-functional teams
  • Ability to work in a fast-paced environment and adapt to changing priorities
  • Commitment to promoting equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics