Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Motional Logo

Senior Backend Engineer, Data Mining

Motional

$159,000 - $207,000
Oct 24, 2025
Boston, MA, US • Las Vegas, NV, US • Pittsburgh, PA, US
Apply Now

Motional is looking to solve the problem of transforming how autonomous vehicles discover critical intelligence hidden within petabytes of multimodal sensor data, by finding rare edge cases, long-tail scenarios, and model errors that matter most.

Requirements

  • Deep, hands-on expertise with Ray or Spark (or both) for distributed data processing and large-scale inference workloads
  • Expert-level Python proficiency with strong software engineering fundamentals: testing (unit, integration, and end-to-end), CI/CD pipelines, containerization, and code review practices
  • Proven experience optimizing and scaling production data pipelines that process terabytes or petabytes of data
  • Strong SQL and data manipulation skills; comfort with both structured and semi-structured data
  • Experience with cloud infrastructure (AWS preferred: S3, EC2, EKS, EMR, IAM) and infrastructure-as-code patterns
  • Demonstrated track record of shipping robust, well-tested, production-grade systems and mentoring junior engineers
  • Hands-on expertise with distributed systems, data processing, and large-scale inference workloads

Responsibilities

  • Architect the OmniTag Engine: Design and build the high-throughput, low-latency backend systems that execute billion-scale inference across Ray/Spark, transforming raw sensor data into unified multimodal representations.
  • Scale Multimodal Data Pipelines: Own the complete data journey - from ingestion, normalization, and preprocessing of heterogeneous modalities (image, video, LiDAR, audio) through encoding, indexing, and cached embedding storage.
  • Evolve the Vector Search and Retrieval Engine: Enhance our in-house billion-scale vector search engine to power RAG-driven few-shot dataset creation.
  • Own Data Quality and Observability: Build comprehensive monitoring, logging, and alerting for multimodal data preprocessing pipelines.
  • Collaborate on Encoder-Decoder Adaptation: Work closely with ML engineers to support domain-specific fine-tuning workflows, model versioning, and A/B testing of new encoders and decoders.
  • Drive Production Reliability: Establish patterns for graceful degradation, fault tolerance, and cost optimization.
  • Operate OmniTag as a mission-critical data platform serving the entire ML organization, with a focus on reliability, debuggability, and operational excellence.

Other

  • BS in Computer Science or a related field, or equivalent professional experience
  • 6+ years designing, building, and operating large-scale distributed systems in production environments
  • We encourage a hybrid schedule with in-office time at one of our locations in Boston, Pittsburgh, or Las Vegas to support collaboration, or this role can be fully remote.
  • Motional AD Inc. is an EOE. We celebrate diversity and are committed to creating an inclusive environment for all employees.
  • To comply with Federal Law, we participate in E-Verify. All newly-hired employees are queried through this electronic system established by the DHS and the SSA to verify their identity and employment eligibility.