Allen Control Systems (ACS) is developing an autonomous gun turret that uses computer vision and control systems to target and neutralize small drones and loitering munitions. The company needs a Data Engineer to build and manage the data pipelines required for training these computer vision models.
Requirements
- 3+ years of experience in data engineering or equivalent fields.
- Proficient in using AWS for data management and processing.
- Proficient in Python for scripting and data processing; proficient with SQL and Linux.
- Solid understanding of data structures and systems design for orchestrating data-related workflows.
Responsibilities
- Design and own end-to-end image+video pipelines for computer vision model training: multi-source ingestion, QA and visualization, standardization, and organization.
- Develop and use synthetic data generation workflows to create realistic synthetic training data for computer vision models.
- Coordinate collection of real-world data; coordinate label creation and QA with labelers.
- Develop and use data quality tooling: metrics for balance, drift, and annotation error; active-learning sampling to target gaps; feedback loops from production back to curation.
- Implement and own dataset versioning, release management, and lineage+metadata cataloging.
Other
- Proven ability to communicate well across engineering teams, and write and maintain effective documentation.
- Bachelor’s or Master’s degree in Computer Science or a related field.