At U.S. Bank, the business problem is to support production applications, automate discoveries, eliminate incidents, and improve application availability, latency, performance, efficiency, and proactive monitoring.
Requirements
- Proven experience as a Site Reliability Engineer
- Strong knowledge of monitoring tools and incident management
- Proficiency in database technologies (DB2, Oracle, Postgres, SQL scripting)
- Strong Linux skills (command line, scripting, cron)
- System administration skills (restarting JVMs, F5 Pool management, autosys, etc)
- Experience with observability, monitoring and logging tools such as Data Dog, Splunk, AppDynamics, Kibana, etc.
- Experience with AWS or Azure services
Responsibilities
- Developing, coordinating, and conducting technical reliability studies on engineering designs to assess the likelihood that a product/process performs its intended function over the intended lifecycle.
- Measuring and analyzing the reliability of the design, materials, processes, cost, and final products of production.
- Recommending design or test methods and statistical process control procedures for achieving required levels of product reliability.
- Completing risk analysis studies of new designs and processes.
- Undertaking testing and analysis on failures, proposing changes in design or formulation to improve system and/or process reliability.
- Improving application availability, latency, performance, efficiency, and effective proactive monitoring.
- Supporting production applications and proactively looking for ways to automate discoveries, eliminate incidents from recurring and/or reduce the time it takes to get customers back up and running.
Other
- Bachelor's degree, or equivalent work experience
- Five to seven years of relevant work experience in business and risk analysis, IT Service Management, production support, product/project management, or application development
- Must be open to doing production support, on call rotation and occasional after-hours work
- Strong communication and collaboration abilities
- Hybrid/flexible schedule, with an in-office expectation of 3 or more days per week