Leidos is looking to solve the problem of securely delivering, implementing, and sustaining software in the field by hiring a Site Reliability Engineer (SRE) to join their team and work on real-time hands-on fielding challenges and develop reusable solutions.
Requirements
- Expertise with Linux and Windows operating systems, network administration, and networking protocols/functions (e.g., HTTP, HTTPS, SSL/TLS, SMTP, DNS)
- Expertise provisioning and managing resources within IaaS/Cloud infrastructures (e.g., Azure, AWS, Google Cloud Platform, etc)
- Experience with Terraform, Ansible, Helm, BASH Scripting, CloudFormation, Chef, Puppet, Ansible or similar technologies.
- Expertise with container technologies such as Docker and container orchestration tools like Kubernetes.
- Expertise with Kubernetes kubectl
- Expertise of a version control system (e.g., Git)
- Experience with monitoring and alerting tools such as Grafana, Prometheus
Responsibilities
- Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding of an microservice enterprise system (cloud and on-premises)
- Partner with development teams to improve services, diagnostics, and deployment tools through gap identification, concept development, and rigorous testing and release procedures
- Participate in system design consulting, platform management, and capacity planning
- Create sustainable systems and services through service automation
- Design, develop, troubleshoot, and debug mission critical infrastructure on-prem and remote
- Manage on-premises and private/public cloud environments via infrastructure-as-code (IaC) and hands-on/client site activites.
- Participate in the concept design of reusable infrastructure components for scalable, highly available, secure architectures for cloud native applications.
Other
- Typically requires a Bachelor’s degree in computer science or computer engineering with 4+ years of experience in a relevant field.
- Must be able to pass an in-depth background check (CBP Public Trust BI)
- Ability to travel up to 70% of times to remote locations, mostly in the US along the border to troubleshoot network and software bugs during initial deployment and sustainment.
- U.S. Citizenship is required
- Ability to work and collaborate effectively within a multi-disciplined engineering team.