Samsara is looking to improve the safety, efficiency, and sustainability of physical operations by harnessing Internet of Things (IoT) data to develop actionable insights and improve operations.
Requirements
- Strong proficiency in one or more common languages (e.g., C++, Golang, Java, Python, Scala).
- Proficiency with common ML tools (e.g., Spark, TensorFlow, PyTorch).
- Experience deploying and iteratively refining models using customer feedback loops.
- Comfortable with full-stack / backend development code to build a strong understanding of underlying data structures and other dependencies.
- Experience with Kubernetes
- Experience with Ray
- Experience with CI/CD pipelines
Responsibilities
- Design and implement scalable machine learning infrastructure using Ray to support model training, deployment, and inference at scale.
- Leverage Kubernetes for orchestration of containerized applications, ensuring seamless deployment, scaling, and management of ML models and associated services.
- Develop and maintain CI/CD pipelines for automated testing, deployment, and management of ML applications and infrastructure.
- Implement robust monitoring, logging, and alerting systems to ensure high availability, performance, and security of the ML platform.
- Collaborate with data scientists and ML engineers to optimize data pipelines and model performance.
- Stay abreast of the latest advancements in machine learning technologies and infrastructure, and advocate for the adoption of best practices and new technologies within the team.
- Provide DevOps/SRE support for the ML platform, including incident response, performance tuning, and disaster recovery planning.
Other
- BS or MS in Computer Science or other relevant field.
- 6+ years of experience as a Machine Learning Engineer, Applied Scientist, or similar role.
- Ability to work in a remote environment
- Ability to work in the US
- Must be able to secure and maintain the legal right to work at the company and in the specified work location, if applicable.