GE Vernova is seeking a Data Engineer to design, build, and maintain the infrastructure and data pipelines for AI/ML applications in the energy domain, enabling effective data collection, processing, storage, and analysis to accelerate the path to more reliable, affordable, and sustainable energy.
Requirements
- Proficiency in Python, SQL, and at least one other programming language commonly used in data engineering (e.g., Scala, Java).
- Experience with relational databases (e.g., PostgreSQL) and NoSQL databases (e.g., MongoDB, Cassandra).
- Familiarity with cloud platforms like AWS, Azure, or GCP for deploying and managing data systems.
- Extensive experience with ETL processes (Extract, Transform, Load) and automating data pipeline workflows.
- Several years of experience in data engineering or a related field, with expertise in designing scalable data solutions.
- Familiarity with big data technologies like Hadoop, Kafka, or Spark for processing large-scale data.
- Hands-on experience with GraphDB, SQL/NoSQL databases, and data warehousing technologies like Snowflake or Redshift.
Responsibilities
- Design and maintain database architectures, schemas, and data models tailored to grid innovation and energy system applications.
- Utilize efficient data storage technologies (e.g., Relational Databases, Data Lakes, NoSQL) to ensure scalable and secure data access.
- Build, optimize, and maintain reliable data pipelines for data ingestion, cleaning, transformation, and feature extraction from structured and unstructured sources.
- Develop and manage integrations with internal and external data sources and APIs to enable seamless data flow.
- Identify new and relevant datasets to improve product capabilities and decision-making across the business.
- Automate data integration and transformation workflows for diverse data formats and operational needs.
- Monitor performance and scalability of data systems, and implement enhancements to increase efficiency and reliability.
Other
- PhD, Master’s, or Bachelor’s degree in Computer Science, Electrical/Computer Engineering, or a related field with a focus on data engineering or electric power engineering.
- Minimum of 3 years of significant experience in data engineering, with hands-on expertise in building and managing data pipelines.
- Ability to collaborate effectively in a team environment, contributing ideas and taking initiative to solve problems.
- Adaptability to work in a dynamic, multi-tasking environment, with the ability to address evolving challenges.
- Effective communication skills, with the ability to collaborate smoothly with cross-functional teams and resolve conflicts proactively.