At U.S. Bank, the business and technical problem is to develop, maintain, and ensure the reliability, availability, and performance of best-in-class software experiences for over 1.5 million customers globally, enabling them to achieve their financial goals through tailored payment solutions powered by the latest technology.
Requirements
- Proven experience as a Site Reliability Engineer or similar role.
- Strong knowledge of monitoring tools and incident management.
- Proficiency in SQL and scripting languages like Java , SQL scripts.
- Experience with React, Spring Boot, J2EE applications
- Strong experience with AWS or Azure services
- Experience with Docker and container clustering technologies like AWS ECS or Kubernetes
- Experience with monitoring and logging tools such as Data Dog, Splunk, Elasticsearch, Kibana and CloudWatch
Responsibilities
- Set up and manage monitoring tools, dashboards, and alerts.
- Collaborate with development teams to troubleshoot issues by analyzing the logs.
- Debug and troubleshoot Java based applications in production and pre-production environments
- Write SQL queries and scripts.
- Maintain documentation and lead knowledge-sharing sessions.
- Ensure system reliability, availability, and performance.
- Lead incident response efforts, troubleshoot issues, and conduct root cause analysis.
Other
- The candidate is a self-motivated individual who can collaborate with a team and across the organization.
- The candidate takes responsibility of the software artifacts produced adhering to U.S. Bank standards in order to ensure minimal impact to the customer experience.
- The candidate will be adept with the agile software development lifecycle and DevOps principles.
- Must be open to doing production support and on call rotation.
- The role offers a hybrid/flexible schedule, which means there's an in-office expectation of 3 or more days per week and the flexibility to work outside the office location for the other days.