Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

JP Morgan Chase Logo

Lead Site Reliability Engineer

JP Morgan Chase

Salary not specified
Sep 2, 2025
New York, NY, USA
Apply Now

Firm wide Planning & Analysis (FW P&A) at JPMorgan Chase is looking to improve financial reporting, forecasting, budgeting, and strategic oversight by leveraging data engineering and cloud technologies.

Requirements

  • Demonstrated proficiency in reliability, scalability, performance, security, enterprise system architecture, toil reduction, and other site reliability best practices
  • Extensive experience with cloud platform (AWS) in setting up infrastructure using Terraform
  • Fluent in at least one programming language such as: Python, Java/Spring Boot, .Net
  • Advanced knowledge of software applications and technical processes with emerging depth in one or more technical disciplines
  • Proficient knowledge and experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others
  • Proficient with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform
  • Proficient with container and container orchestration: (ECS, Kubernetes, Docker)

Responsibilities

  • Consistently models and champions site reliability culture and practices and exerts technical influence throughout your team
  • Leads initiatives to improve the reliability and stability of your team’s applications and platforms using data-driven analytics to improve service levels
  • Drives collaboration with your team to identify comprehensive service level indicators and the stakeholder partners to establish reasonable service level objectives and error budgets with your customers
  • Offers a high level of technical expertise within one or more technical domains and proactively identifies and solves for technology-related bottlenecks in your areas of expertise
  • Serves as the main point of contact during major incidents for your application and have the skills to identify and solve the issue quickly to avoid financial loss to the business
  • Documents and shares knowledge within your organization via internal forums and communities of practice

Other

  • Possess 7+ years of experience, ideally working with Data/Python applications in Production environment
  • Actively self-educates, evaluates new technology, and recommends suitable ones
  • Demonstrate strong knowledge across multiple technical domains and advise others on the technical and business issues facing them
  • Facilitate resiliency design reviews, deconstruct complex problems into digestible work for other engineers, and act as a technical lead for medium to large sized products
  • Hold a leadership role by providing advice and mentorship to other engineers on your team and line of business