NetApp is looking to solve the problem of sustainably scaling systems through automation and driving changes that improve reliability and velocity in their cloud services lifecycle, from design through deployment, operation, and refinement.
Requirements
- 8+ years experience in scripting and infrastructure automation using tools such as PowerShell, Python, Go or Ruby
- Deep working knowledge of Containers, Kubernetes, Serverless computing implementation, and distributed systems design patterns.
- Knowledge of DevOps/SRE development methodologies.
- Proficiency in Linux/Unix and CoreOS.
- Experience with cloud platforms such as AWS, Azure, or Google Cloud.
Responsibilities
- Identify tasks and areas where automation can be applied to achieve time efficiencies and risk reduction.
- Develop software for deployment automation, packaging, and monitoring visibility.
- Work in tandem with other Cloud Infrastructure Engineers and developers to ensure maximum performance, reliability, and automation of our deployments and infrastructure.
- Consult and influence developers on new feature development and software architecture to ensure scalability.
- Undertake debugging and troubleshooting of service bottlenecks throughout the entire software stack.
- Provide advanced tier 2 and 3 support for NetApp's Cloud Data Service solutions.
- Continuously monitor, analyze, and measure system health, availability, and latency using tools like Prometheus, Stackdriver, ElasticSearch, Grafana, and SolarWinds.
Other
- This position involves participation in a rotation-based on-call schedule as part of our global team.
- This position will have ON-CALL rotations as well as an ask to work odd hours.
- Must be a US Citizen or Green Card holder.
- Preference if you possess either an interim Secret clearance (or above) or have recently undergone a Criminal Justice Information Services (CJIS) background check to verify criminal history, employment history, and financial/credit history.
- Ability to lead a scrum team, influence stakeholders to effectively maintain a product backlog, manage sprints.