Serve Robotics is looking to scale their robotics data platform to enable real-time robotics operations and analytics at scale, by designing and scaling the systems that power Serve’s robotics data platform and owning the lifecycle of robot and cloud data.
Requirements
- Experience building large-scale distributed systems handling petabytes of data per day
- Proficiency in at least one of Python, Go, or C++.
- Proven experience building and operating large-scale, distributed data systems on a major cloud platform (GCP experience is preferred: GCS, BigQuery, Pub/Sub, Dataflow, etc.).
- Deep understanding of data processing, ETL/ELT, networking principles, and I/O optimization.
- Strong architectural skills, with the ability to design complex systems that are scalable, reliable, and maintainable.
- Hands-on experience with data transformation and modeling tools like dbt.
- Familiarity with containerization and orchestration technologies (Docker, Kubernetes).
Responsibilities
- Define and own the technical roadmap for the data infrastructure platform, making key architectural decisions and ensuring alignment with company objectives.
- Own the development of our resilient data offload system, ensuring the fleet can meet strict SLAs and maximally utilize depots' available bandwidth.
- Drive the creation of a centralized cloud service (Orchestrator) to manage bandwidth, prioritize data events, and coordinate offload activities across the entire fleet.
- Build and maintain the infrastructure for ingesting and transforming robot metrics into queryable tables (e.g., BigQuery) and analytics-ready models using tools like dbt.
- Implement robust monitoring and ensure the entire system is resilient to interruptions, with features like automatic rollover and data integrity verification
- Mentor other engineers on the team, fostering best practices in system design, coding, and operational excellence.
Other
- 8+ years of professional software engineering experience, with at least 2 years in a technical leadership or lead engineer role.
- Demonstrated ability to lead technical projects from conception to production.
- Excellent mentorship skills and a passion for growing the capabilities of your team.
- Strong communication skills and the ability to drive alignment across multiple teams.
- Experience in robotics, IoT, or a similar field involving data offload from fleets of edge devices.
- Experience designing systems that must be resilient to network failures and other real-world interruptions.