ManTech seeks a motivated, career and customer-oriented Data Scientist to support our DoD contract out of Reston, VA or Fort Belvoir, VA.
Requirements
- 5+ years of experience in applied data science or ML roles, with strong Python skills and experience in NLP and LLM implementation
- 5+ years of experience with data exploration, data cleaning, data analysis, data visualization, or data mining
- Exposure to production-level systems, data lake environments, and streaming data (e.g. Kafka)
- Experience implementing end-to-end ML workflows, from data prep to deployment and evaluation
- Ability to quickly learn infrastructure or systems concepts (e.g. how pipelines interface with data lakes)
- Experience collaborating with MLOps and infrastructure engineers to ensure robust model deployment, monitoring, and retraining pipelines
- Experience with Distributed data/computing tools, including MapReduce, Hadoop, Hive, EMR, Spark, Gurobi, or MySQL
Responsibilities
- Lead and perform hands-on data and threat/intel analysis leading to development of analytics solutions (e.g. predictive models, visual analytics reports), to support DTRA users conduct mission critical activities.
- Demonstrate proficiency in extracting, cleaning, and transforming DTRA transactional and associated data sets within an identified problem space to build predictive models as well as develop appropriate supporting documentation.
- Leverage knowledge of a variety of statistical and machine learning techniques to develop, evaluate, and deploy new predictive analytical models that directly inform mission decisions.
- Utilize and explore variety of statistical/modeling tools and languages to compare and assess best performing Machine Learning results.
- Execute projects including those intended to identify patterns and/or anomalies in large datasets; perform automated text/data classification and categorization as well as entity recognition, resolution and extraction; and named entity matching.
- Ability to design, implement, and iterate on ML models for document classification, extraction, summarization, and search
- Ability to take ownership of data science workflows that interact with a production system streaming millions of documents per week
Other
- motivated, career and customer-oriented
- Must have an active Top Secret / SCI
- Sedentary Work