U.S. Bank is looking to build the platforms and ecosystems that help over 1.5 million customers around the world to achieve their financial goals, and this position will be responsible for the analysis, design, testing, development and maintenance of best in class software experiences.
Requirements
- Proven experience as a Site Reliability Engineer or similar role.
- Strong knowledge of monitoring tools and incident management.
- Proficiency in SQL and scripting languages like Java , SQL scripts.
- Strong experience with AWS or Azure services
- Relevant certifications in AWS, Azure, or containerization technologies
- Experience with Docker and container clustering technologies like AWS ECS or Kubernetes
- Experience with monitoring and logging tools such as Data Dog, Splunk, Elasticsearch, Kibana and CloudWatch
Responsibilities
- Set up and manage monitoring tools, dashboards, and alerts.
- Collaborate with development teams to troubleshoot issues by analyzing the logs.
- Write SQL queries and scripts.
- Maintain documentation and lead knowledge-sharing sessions.
- Ensure system reliability, availability, and performance.
- Lead incident response efforts, troubleshoot issues, and conduct root cause analysis.
- Drive continuous improvement initiatives to reduce Mean Time to Recovery (MTTR)
Other
- Bachelor’s degree, or equivalent work experience
- Three to five years of relevant experience
- Must be open to doing production support and on call rotation.
- Strong communication and collaboration abilities.
- The role offers a hybrid/flexible schedule, which means there's an in-office expectation of 3 or more days per week and the flexibility to work outside the office location for the other days.