Rexel is looking to build, operate, and scale the company's data ingestion, data preparation, and data egress layers to centralize source data into Snowflake and transmit transformed data out of Snowflake in a manner that is reliable, fast, accurate, and trusted.
Requirements
- Hands-on experience configuring and operating managed ingestion tools such as Fivetran, including connector setup, incremental syncs, and schema drift handling.
- Strong experience with tools like Census for syncing data from Snowflake to downstream systems
- Experience using dbt Cloud for source configuration, data preparation models, snapshots, and testing.
- Aptitude working within an established dbt project structure and comfort identifying and recommending improvements for simplicity, consistency, and cost
- Working knowledge of Snowflake concepts including databases, schemas, warehouses, performance considerations, time travel, data classification (E.G. PII detection and tagging)
- Fluency working with engineering tools and workflows such as GitHub, code reviews, CI/CD, release management, and deployment pipelines
Responsibilities
- Configure & Operate Ingestion Pipelines: Set up, operate, and monitor Fivetran connections to ingest data from enterprise systems (SQL Server, Oracle, DB2, etc.) into Snowflake.
- Source System Coordination: Partner with IT teams, DBAs, and application administrators to ensure continuous source access and prevention of breaking changes.
- Historical Tracking & Snapshots: Implement efficient patterns to capture change history, soft deletes, and record versioning in support of SCD and time series use cases
- Standardization: Develop and maintain dbt models / macros to standardize raw data, including naming conventions, data type enforcement, document hygiene, and source-level testing.
- Pipeline Monitoring & Incident Response: Leverage tooling to monitor ingestion, preparation, and egress jobs for failures, delays, and data anomalies.
- Data Egress & Activation Snowflake Data Egress: Support operational and data activation use cases by configuring and maintaining Census syncs from Snowflake to internal and third-party destinations such as APIs, file servers, and cloud storage.
- Reliability of Reverse ETL Pipelines: Monitor and troubleshoot Census jobs to ensure dependable delivery of data to consuming systems
Other
- 5+ years of experience in data engineering, analytics engineering, or data platform roles, working with cloud-native, code-first tools
- Demonstrated ability to operate and support data pipelines with a high standard of reliability, accuracy, and responsiveness
- Ability to work effectively with engineers, analysts, and IT partners, clearly communicating requests, dependencies, active issues, mitigation efforts, and post-incident reviews
- Bachelor's Degree or Equivalent - Required
- Computer Science, Engineering, Information Systems, or a related technical field