SourceFly is seeking a Data Scientist to support a government customer in developing and delivering high-quality predictive modelling solutions to solve complex business problems.
Requirements
- Experience with programming languages including: R, Python, JavaScript, Visual Basic
- Experience with creating VBA applications and macros to structure, manage, and wrangle key datasets
- Experience with core data science libraries – Pandas, NumPy, Matplotlib, Plotly, etc.
- Experience with Anaconda distribution of Python for package management and deployment
- Familiarity with command-line shell programming (Powershell, cmd, etc.)
- Proficiency with SQL programming
- Familiarity with RESTful APIs, web scraping, and processing unstructured data
Responsibilities
- Lead and perform hands-on analysis and modeling involving the creation of intervention hypotheses and experiments, assessment of data needs and available sources, determination of optimal analytical approaches, performance of exploratory data analysis, and feature generation (e.g., identification, derivation, aggregation).
- Collaborate with mission stakeholders to define, frame, and scope mission challenges where big data interventions may offer important mitigations and develop robust project plans with key milestones, detailed deliverables, robust work tracking protocols, and risk mitigation strategies.
- Demonstrate proficiency in extracting, cleaning, and transforming CBP transactional and mission data associated within an identified problem space to build predictive models as well as develop appropriate supporting documentation.
- Leverage knowledge of a variety of statistical and machine learning techniques and methods to define and develop programming algorithms; train, evaluate, and deploy predictive analytics models that directly inform mission decisions.
- Execute projects including those intended to identify patterns and/or anomalies in large datasets; perform automated text/data classification and categorization as well as entity recognition, resolution and extraction; and named entity matching.
- Brief project management, technical design, and outcomes to both technical and non-technical audiences including senior government stakeholders throughout the model development/ project lifecycle through written as well as in-person reporting.
Other
- Bachelor’s Degree (required) in operations research, industrial engineering, mathematics, statistics, computer science/engineering, or other related technical fields with equivalent practical experience.
- 7-12 years of relevant experience
- Selected applicants must be a US Citizen and able to obtain and maintain a government security clearance
- Active Top Secret clearance desired
- Master’s Degree in mathematics, statistics, computer science/engineering, or other related technical fields with equivalent practical experience desired