Spire's Space Reliability Engineering team needs to ensure the reliable operation of its satellite constellation, ground stations, and software by monitoring availability, latency, performance, and capacity, applying Site Reliability Engineering principles to space operations.
Requirements
- 5+ years professional software engineering, devops, or SRE experience
- Familiarity with Linux, including Bash scripting and basic system administration
- Familiarity with data analysis and data analysis tools
- Proficiency using and developing containers for development and production environments
- 5+ years of experience with system development using Python
- Familiarity with data backends like S3, RDS, Postgres, Redis, and Elasticsearch
- Experience implementing monitoring and alerting system using systems like Grafana, Prometheus, or Nagios
Responsibilities
- Develop operational automation for mission execution, anomaly detection and resolution, space situational awareness, and performance monitoring.
- Develop a data platform that allow for flexible data storage and analytics
- Data analysis of the telemetry from Ground and Space Assets to ensure reliable and efficient performance
- Maintain configuration control of all assets and provide for the rollout of updated software and configuration
Other
- Candidates will need to interface with many teams across Spire to adapt user needs into system requirements and have the experience to collaborate and iterate on the resultant projects.
- Spire operates a hybrid work model, and this position will require you to work a minimum of three days per week in the office.
- Access to US export-controlled software and/or technology may be required for this role.
- All candidates who receive a conditional offer will be required to complete a background check. This may include criminal history and employment verification.