Geotab is seeking a Data Operations Developer to design, build, and optimize automated end-to-end data pipelines to minimize time from insight to production and ensure data quality and governance.
Requirements
- 3-5 years of experience in Data Engineering or a similar role.
- 3-5 years of technical experience with Structured Query Language (SQL) and building ETL/ELT production pipelines in Python.
- Required knowledge of workflow orchestration tools (e.g., Apache Airflow) and CI/CD processes/tools like GitLab runners.
- Experience working in cloud-based infrastructure, specifically Google Cloud Platform (GCP) and Big Data environments like BigQuery.
- Familiarity with Linux command line, Jira for issue management, and Python package development is highly regarded.
- Highly organized with the ability to engage with all levels of the organization and a flexibility to stay current with technology trends.
- Post-Secondary Diploma/Degree specialization in Computer Science, Engineering, Mathematics, or a related field.
Responsibilities
- Data Pipeline Development & Optimization: Design and build automated ETL/ELT processes using tools like Apache Airflow to extract, transform, and load data into Google BigQuery or data lakes.
- Orchestration and Automation: Automate repetitive tasks within Apache Airflow, such as data ingestion and quality checks, to improve organizational efficiency.
- Performance Optimization: Continually monitor and optimize the performance and reliability of data pipelines to handle growing data volumes.
- Data Quality & Governance: Build automated tests and validation rules to identify and fix data issues, ensuring all data processes comply with regulations and standards.
- Release Management: Coordinate with development and operations teams to manage the smooth deployment of new or updated pipelines using robust version control.
- Collaboration & Support: Work cross-functionally to understand data needs and provide troubleshooting support to resolve data-related issues in a timely manner.
- On-Call Support: Participate in a 24x7 on-call rotating schedule as required to ensure pipeline reliability.
Other
- Post-Secondary Diploma/Degree specialization in Computer Science, Engineering, Mathematics, or a related field.
- Flex working arrangements
- Home office reimbursement program
- Baby bonus & parental leave top up program
- Online learning and networking opportunities