Seaspan Marine Group is looking to expand its data operations through its data platform by developing, testing, and documenting an ETL project and its associated components.
Requirements
- Coursework or project experience in databases, data structures, or software development
- Familiarity with at least one programming language (e.g., Python or SQL)
- Interest in cloud platforms (e.g., Azure, AWS, or GCP)
- Interest in quantitative methods and algorithms in their application in machine learning
- Exposure to IDEs such as Visual Studio or VS Code for development and version control practices
- Exposure to Agile Project Management concepts
Responsibilities
- Develop and implement end-to-end ETL processes for data procurement and integration, aligned with the architectural design and tools available within the Seaspan Marine Group technology landscape
- Actively participate in implementing efficient data engineering workflows
- Support best practices in data modeling methodologies for the design and implementation of effective datasets
- Support the establishment and enforcement of data quality standards and practices to ensure accuracy and reliability throughout the data life cycle
- Develop data quality monitoring, alerting, and profiling processes
- Write and maintain documentation of requirements, data infrastructure, data source catalogs, data models, ETL runbooks, and data quality standards
- Create predictive models to detect trends in ingested data, highlighting potential business risks and opportunities
Other
- Currently enrolled in an undergraduate or graduate degree program in Data Science, Big Data, Machine Learning and Artificial Intelligence or a related area of study
- Strong analytical thinking, problem-solving, and curiosity to learn
- Self-driven, collaborative, and organized
- This position is required to be fully on-site based at 10 Pemberton Avenue, North Vancouver, BC.
- We require a full-time commitment Monday–Friday, 40 hours per week