BlackSky is looking for a Staff Machine Learning Quality Engineer to support the development, operation, and capability evolution of Spectra AI, BlackSky’s cutting edge AI/ML Platform. This role is instrumental in shaping computer vision products by managing the quality of data that trains AI/ML models and overseeing the entire labeling process.
Requirements
- Minimum of eight (8) years of hands-on experience as a data quality engineer, imagery analyst, GIS analyst, or quality assurance engineer in the space of satellites, computer vision, or machine learning.
- Experience interpreting satellite imagery to identify objects, behaviors, or activities and working with GIS datasets.
- Hands on experience with annotation platforms such as LabelBox, SuperAnnotate, AWS Sagemaker, or CVAT and managing remote annotation workforces.
- Experience with quality assurance practices, model testing & evaluation, and computing metrics & statistics from data.
- Experience working with or managing Data Labeling teams such as CloudFactory, Hive, or iMerit.
- Experience with AI advancements in image annotation such as the Segment Anything Model (SAM), Embeddings, and other AI Assistance tools.
- Experience writing code and scripts, especially for processing large amounts of data.
- Proficiency with Python for automation and data manipulation tasks.
- Knowledge and experience working in an AWS Cloud environment.
Responsibilities
- Own and architect vision, strategy, and technical roadmap for BlackSky’s data labeling pipeline.
- Lead and manage data labeling teams, providing oversight for labeling campaigns.
- Act as a key technical leader, partnering with stakeholders across the broader team (e.g., computer vision, mlops, platform) to meet customer-driven timelines and product requirements.
- Research for AI product development, identifying geographic areas where objects exist, developing object ontologies, building annotation instructions, and identifying imagery for annotation.
- Collaborate with machine learning and product teams to define metrics, assess, and report on the quality of machine learning models in development and production using statistical and quality assurance approaches.
- Create processes, tools, and scripts to manage and track data quality and labeling performance.
- Take part in the entire project lifecycle from requirements development to deployment.
Other
- Bachelor’s Degree or higher in one of the following fields: computer science, GIS, data science, mathematics, physics, statistics, or another scientific field, or equivalent experience (four years).
- Able to interdependently manage work projects defining, managing, and executing based on fixed objectives.
- Collaborates well with others and able to communicate ideas to those with other backgrounds.
- Attention to detail and ability to define, enforce, and follow rigorous process controls for managing data.
- The program this role will support requires Us-born citizenship.
- Demonstrated ability to lead and manage small teams.