The job is looking to solve the problem of efficiently extracting, transforming, and loading data from various sources into a usable format for analytics, and designing robust data models to support these processes.
Requirements
- strong technical capabilities and sense of database performance
- sound understanding of data modelling standards
- Capability to design an efficient way of processing high volumes of data
- Capability to design and implement models, capabilities, and solutions to manage data within the enterprise (structured and unstructured, data archiving principles, data warehousing, data sourcing, etc.)
- Capability to review (profile) a data set to establish its quality against a defined set of parameters and to highlight data where corrective action (cleansing) is required to remediate the data
- Capability to discover, integrate, and ingest all available data from the machines that produce it, as fast as it is produced, in any format, and at any quality
- Understand the difference between on-prem and cloud-based data integration technologies.
Responsibilities
- Leads the delivery processes of data extraction, transformation, and load from disparate sources into a form that is consumable by analytics processes, for projects with moderate complexity, using strong technical capabilities and sense of database performance
- Designs, develops and produces data models of relatively high complexity, leveraging a sound understanding of data modelling standards to suggest the right model depending on the requirement
- Capability to design an efficient way of processing high volumes of data where a group of transactions is collected over a period
- Capability to design and implement models, capabilities, and solutions to manage data within the enterprise (structured and unstructured, data archiving principles, data warehousing, data sourcing, etc.). This includes the data models, storage requirements and migration of data from one system to another
- Capability to review (profile) a data set to establish its quality against a defined set of parameters and to highlight data where corrective action (cleansing) is required to remediate the data
- Capability to discover, integrate, and ingest all available data from the machines that produce it, as fast as it is produced, in any format, and at any quality
- Understand the difference between on-prem and cloud-based data integration technologies.
Other
- Excellent interpersonal skills to build network with variety of department across business to understand data and deliver business value
- may interface and communicate with program teams, management and stakeholders as required to deliver small to medium-sized projects