CGI Federal is looking for a Data Scientist to lead state-of-the-art solutions for a gratifying mission with worldwide impact, delivering quality production solutions that embody solid MLOps practices to significantly expand customer capabilities.
Requirements
- 7+ year's experience obtaining data from multiple, disparate data sources including structured, semi-structured and unstructured data.
- Demonstrable expertise in an area of data science or analytics (e.g. machine learning, deep learning, NLP, computer vision, predictive modeling and forecasting, statistics).
- Proficiency in tools and languages such as Graph DBs, PostgreSQL/SQL, Python, or Java.
- Experience using ML algorithms packages contained in AWS SageMaker or open-source libraries (Python, C++, etc.)
- Experience developing, verifying, and monitoring AI/ML Models
- Familiarity with DevOps processes and code versioning tools such as git
- Experience With Hadoop, Spark, Or Other Parallel Computing Processes
Responsibilities
- Support development of Artificial Intelligence (AI)/ML based pipelines for data inference.
- Apply supervised, unsupervised, and NLP Machine Learning (ML) algorithms for clustering, classification dimensionality reduction, and metadata extraction & content analysis.
- Analyze complex datasets, detect trends and patterns, develop conclusions, and provide recommendations to improve data ingestion, increase data scaling, and enhance data exploitation to ensure more interoperable and secure data.
- Verify data quality and/or ensure it via data cleaning for accuracy and completeness.
- Perform statistical analysis and utilize results to improve the model and data pipeline.
- Work with data engineers to design, develop, and maintain ETLs to extract, transform the data into existing data warehouses.
- Automate data analysis to improve and enhance data.
Other
- BS or MS degree in Computer Science, Engineering, Information Technology Management or related technical field.
- Master's degree in Computer Science, Engineering, Information Technology Management or related technical field.
- Demonstrated experience with enhancing, managing, optimizing performance, and processing large volumes of data
- Familiarity with industry best practices for software/hardware development when processing large data sets
- Experience in the following required task areas: machine learning, statistical modeling, time-series forecasting, and/or geospatial analytics