SentinelOne is seeking a Senior Staff Software Engineer to join the Observo.ai team, our cutting-edge AI-driven data pipeline optimization platform. This role will be responsible for leading the architectural design and technical strategy for high-performance systems that process massive volumes of telemetry data while reducing costs and improving insights for enterprise customers.
Requirements
- Expert-level proficiency in Go, Rust, or Java with a deep understanding of system design patterns, software architecture principles, and performance optimization
- Extensive experience with cloud platforms (AWS, GCP, Azure) and container orchestration technologies (Kubernetes, Docker) at enterprise scale
- Proven track record leading and scaling data pipelines using technologies like Apache Kafka, Apache Spark, Apache Flink, or similar streaming frameworks
- Deep expertise in database technologies, including both SQL (PostgreSQL, MySQL) and NoSQL (MongoDB, Cassandra, Redis) systems with experience in data modeling and optimization
- Advanced experience with machine learning frameworks (TensorFlow, PyTorch, scikit-learn) and MLOps practices for production ML systems at scale
- Expert knowledge of observability and monitoring tools and practices, with experience architecting solutions using Prometheus, Grafana, ELK stack, and similar platforms
- Comprehensive understanding of data formats, protocols, and standards used in enterprise observability (OpenTelemetry, StatsD, syslog, JSON, Parquet)
Responsibilities
- Lead the architectural design and technical roadmap for scalable, high-performance data processing pipelines capable of handling petabyte-scale telemetry data (logs, metrics, traces)
- Drive the development and optimization of ML-driven data routing, filtering, and transformation engines to reduce customer data volumes by 80%+ while preserving critical insights
- Architect and implement real-time analytics and anomaly detection systems using advanced machine learning techniques and large language models
- Design cloud-native microservices and APIs that integrate seamlessly with major observability platforms (Splunk, Elastic, Datadog, New Relic)
- Establish robust monitoring, alerting, and observability solutions for distributed systems operating at enterprise scale
- Lead cross-functional technical initiatives, collaborating with Product, Data Science, and DevOps teams to translate strategic vision into technical solutions
- Drive system performance, cost efficiency, and reliability optimization through advanced profiling, testing, and infrastructure design
Other
- 10+ years of software engineering experience
- Strong leadership and technical communication skills with experience driving technical decisions across multiple teams and stakeholders
- Track record of mentoring engineers and establishing technical standards and best practices in complex engineering organizations
- Experience with technical strategy and roadmap planning for large-scale distributed systems
- Partial on-site presence at our Mountain View, CA headquarters