PNC Bank is seeking a highly skilled Senior Data Engineer with extensive experience in Hadoop-based big data ecosystems to design, develop, and maintain scalable data pipelines and infrastructure to support advanced analytics and business intelligence initiatives.
Requirements
- 5+ years of experience in data engineering, with a strong focus on Hadoop ecosystem tools.
- Strong programming skills in Python, Java, or Scala.
- Deep understanding of HDFS, Hive, Spark, Sqoop, and Kafka.
- Experience with workflow orchestration tools such as Apache Airflow or Oozie.
- Proficient in SQL and working with relational and non-relational databases.
- Experience working in cloud environments (AWS/GCP/Azure) is preferred.
- Familiarity with data governance and security compliance in the financial services industry.
Responsibilities
- Design, build, and optimize large-scale data pipelines using Hadoop and related technologies (Hive, Pig, HDFS, Spark, Sqoop, etc.).
- Collaborate with data architects and business stakeholders to understand data requirements and translate them into scalable data solutions.
- Implement data integration and ETL processes from diverse sources into Hadoop data lake environments.
- Ensure data quality, data security, and governance standards are upheld across systems.
- Troubleshoot and resolve issues in complex data environments.
- Monitor and maintain performance, availability, and reliability of big data platforms.
- Mentor junior engineers and contribute to the development of best practices and engineering standards.
Other
- Bachelor’s or Master’s degree in Computer Science, Information Systems, Engineering, or related field.
- You will work with cross-functional teams to enable data-driven decision-making across the organization.
- Collaborate with DevOps to support CI/CD and automation processes.