Rose International is looking for a Machine Learning Data Engineer to work at the intersection of data engineering and applied machine learning to process and transform complex multimedia data into complete machine learning datasets suitable for consumption by researchers.
Requirements
- Strong knowledge of Python in the context of data engineering and data processing (SQL, data cleaning, anomaly detection)
- Working understanding of ML training - specifically how data quality impacts ML training outcomes from PyTorch workflows
- Demonstrable programming experience in Python using common ML and data libraries, i.e. numpy, scipy, pandas
- Proficiency in Linux and shell scripting
- Working knowledge of audio, image and video formats
- Knowledge of multimodal data sets (Audio/Video, Optitrack, multi-Camera/Sensor)
- Experience with relational and graph / NoSQL databases
Responsibilities
- Design, develop, and maintain scalable data-processing pipelines for large volumes of multimedia (audio, video) and sensor data (e.g. IMU), ensuring reliability and reproducibility.
- Gather and interpret processing requirements from stakeholders, translating them into practical technical solutions and devising novel approaches where needed.
- Perform diverse data-processing operations, from mathematical transformations and filtering to feature extraction, synchronization, and inference through ML models.
- Interface with various internal tooling and training frameworks to prepare raw data for machine learning, including validation, transformation, and quality assurance.
- Collaborate with machine learning researchers to integrate research prototypes into production pipelines.
- Ensure compliance with data governance, security, and relevant standards.
- Meet with researchers/research assistants to understand issues/solutions
Other
- Bachelor’s degree in a relevant technical field (e.g. Computer Science, Data Science) with industry experience in machine learning or data engineering; or equivalent combination of education and experience.
- 5+ years of experience
- Only those lawfully authorized to work in the designated country associated with the position will be considered.
- Must be willing to work onsite in Redmond, WA, USA
- Temporary position with an estimated duration of 13 months