JPMorgan Chase is looking to design and deliver reliable data collection, storage, access, and analytics solutions that are secure, stable, and scalable within the Consumer and Community Bank - Connected Commerce Technology
Requirements
- Advanced experience with SQL (e.g., joins and aggregations), and working understanding of NoSQL databases
- Advanced proficiency in at least one programming language including Python, Java or Scala
- Advanced proficiency in at least one cluster computing frameworks including Spark, Flink or Storm
- Advanced proficiency in leveraging Gen AI models from Anthropic (or OpenAI, or Google) using APIs/SDKs
- Advanced proficiency in Gen AI SDKs such as LangChain, LangGraph, LangSmith
- Advanced proficiency in at least one cloud data lakehouse platform such as AWS data lake services, Databricks or Hadoop
- Advanced proficiency in at least one scheduling/orchestration tool such as Airflow, AWS Step Functions or similar
Responsibilities
- Supports review of controls to ensure sufficient protection of enterprise data
- Advises and makes custom configuration changes in one to two tools to generate a product at the business or customer request
- Updates logical or physical data models based on new use cases
- Frequently uses SQL and understands NoSQL databases and their niche in the marketplace
- Develop enterprise data models, Design/ develop/ maintain large-scale data processing pipelines (and infrastructure)
- Lead code reviews and provide mentoring thru the process
- Drive data quality, Ensure data accessibility (to analysts and data scientists)
Other
- Formal training or certification on data engineering concepts and 3+ years applied experience
- Adds to team culture of diversity, opportunity, inclusion, and respect
- Ensure compliance with data governance requirements
- Ensure business alignment (ensure data engineering practices align with business goals)
- Agile methodology, TDD or BDD and CI/CD tools