Ensuring the operational stability, availability, and performance of production application flows for JPMorgan Chase's Commercial and Investment Bank.
Requirements
- Formal training or certification on Site Reliability concepts and 2+ years applied experience
- Experience in troubleshooting, resolving, and maintaining information technology services
- Knowledge of applications or infrastructure in a large-scale technology environment on premises or public cloud
- Exposure to observability and monitoring tools and techniques
- Familiarity with processes in scope of the Information Technology Infrastructure Library (ITIL) framework
- Basic understanding of DevOps and SRE methodologies or processes is a must
- Knowledge of one or more general purpose programming languages or automation scripting
Responsibilities
- Analyze and troubleshoot production application flows to ensure end-to-end application or infrastructure service delivery supporting the business operations of the firm
- Improve operational stability and availability through participation in problem management
- Monitor production environments for anomalies and address issues utilizing standard observability tools
- Assist in the escalation and communication of issues and solutions to the business and technology stakeholders
- Identify trends and assist in the management of incidents, problems, and changes in support of full stack technology systems, applications, or infrastructure
Other
- Must be able to multi-task in a complex production environment and quickly acquire broad knowledge of applications