COLSA is seeking to develop and apply interdisciplinary methods, algorithms, statistics, and research techniques to extract knowledge and provide insights from structured and unstructured data.
Requirements
- Familiarity with advanced machine learning, data science techniques and mathematical approaches
- Working knowledge of current operating systems and programming languages
- 2+ years programming experience in Python, R, Julia, TensorFlow, CUDA, JavaScript, Scala, Java, Unix/Linux, C, C++
- Experience implementing and utilizing Generative AI (such as large language models (LLMs))
- Experience designing, building, and maintaining scalable, reliable data pipelines (ETL/ELT workflows)
- Familiarity with MLOps best practices
- Knowledge of containerization (Docker, Podman)
Responsibilities
- Performs analysis of data systems, such as Big Data systems
- May perform statistical analysis and provide input for reports and dashboards
- Involved with the development of data products, reports, and dashboards or other display techniques
- Evaluates data, algorithms, and their interaction to improve algorithm performance
- May apply advanced statistics, including natural language processing and machine learning to create solutions
- May assist in data modeling and data virtualization
- May write code to preprocess and clean data
Other
- Bachelors degree or higher in computer science, data science, engineering, math, statistics, operations research or related field or equivalent experience
- Minimum of 5 - 7 years related experience
- U.S. Citizenship required
- Active DoD Top Secret security clearance with eligibility for DIA-SCI access. Candidate selected must successfully pass a DIA CI polygraph within 60 days of hire
- Proactive self-starter capable of finding and solving problems with little guidance