Zoox is looking to solve the problem of managing data generated by its autonomous vehicle fleet to advance autonomous transportation
Requirements
- Proficiency in Python and C++, or similar languages, with emphasis on writing clean, maintainable code
- Extensive experience with cloud platforms (AWS preferred), including S3, Aurora, and EC2
- Proven experience designing and operating large-scale data ingestion and storage systems handling petabytes of data
- Knowledge of data cataloging systems, metadata management, storage optimization, and data governance
- Experience with monitoring and observability tools (such as Grafana, Cloudwatch)
Responsibilities
- Design, develop, and maintain scalable systems for collecting, cataloging, storing, and processing petabytes of vehicle data
- Architect robust log data storage solutions using AWS S3, and distributed systems to ensure data integrity and availability
- Lead technical design discussions, create design documents, and drive architectural decisions for complex data platform features
- Implement monitoring, alerting, and operational tooling to ensure high availability and reliability of data pipelines processing thousands of runs daily
- Mentor junior engineers, conduct code reviews, and establish best practices for data platform development
- Troubleshoot and resolve production issues in data ingestion and storage systems, participating in on-call rotations
Other
- 8+ years of software engineering experience in building large-scale distributed cloud storage systems or data platforms
- Demonstrated strong ownership by driving projects from conception to completion and collaborating effectively with cross-functional teams through clear communication