GPRS needs to store and analyze large volumes of data to support business needs, including developing a master data management system, integrating data from various sources, and providing qualified data sets at appropriate intervals.
Requirements
- Proficiency in programming languages such as Python, SQL, and Java
- Experience with data processing frameworks like Apache Spark or Hadoop
- Knowledge of database systems (e.g., MySQL, PostgreSQL, MongoDB)
- Familiarity with cloud platforms (e.g., AWS, Azure, Google Cloud)
- Familiarity with the operationalization of machine learning in a business environment
- Minimum 2 years of experience as a Data Engineer, or similar role
- MS Data Analytics, Data Engineering, or similar, preferred
Responsibilities
- Support GPRS’ definition and build-out of a Master Data Management system, including data governance, integration, quality, and system design and architecture
- Maintain appropriate databases, warehouses, lakes, etc., for data integrity and availability with a focus on scalability and optimization
- Build and maintain data pipelines and automated ETL processes for data ingestion and transformation
- Work closely with stakeholders to identify additional data sources that are relevant to support a date-driven organization
- Support the organization with ad hoc data manipulation exercises, especially with M&A activities and data migrations
- Design, develop, and maintain data pipelines that extract, transform, and load (ETL) data from multiple sources into data warehouses or data leaks.
- Partner with business unit stakeholders to define and optimize the data processing systems to the appropriate quality measures.
Other
- Bachelor’s degree in Business, IT, Data Analytics, or similar discipline and/or 4 years of work experience in lieu of degree
- Strong problem-solving skills and attention to detail
- Excellent communication and teamwork abilities