Unlocking the secrets held by a data set and solving global challenges using IoT, machine learning, and artificial intelligence at Booz Allen
Requirements
3+ years of experience in applied data science or ML roles, including using Python and NLP and LLM implementation
3+ years of experience with data exploration, data cleaning, data analysis, data visualization, or data mining
Experience with production-level systems, data lake environments, and streaming data, including Kafka
Experience implementing end-to-end ML workflows from data prep to deployment and evaluation
Ability to quickly learn infrastructure or systems concepts, including how pipelines interface with data lakes
Experience with Distributed data or computing tools, including MapReduce, Hadoop, Hive, EMR, Spark, Gurobi, or MySQL
Experience with visualization packages, including Plotly, Seaborn, or ggplot2
Responsibilities
Work closely with clients to understand their questions and needs, and then dig into their data-rich environments to find the pieces of their information puzzle
Guide teammates and lead the development of algorithms and systems
Use the right combination of tools and frameworks to turn sets of disparate data points into objective answers to advise your clients as they make informed decisions
Design, implement, and iterate on ML models for document classification, extraction, summarization, and search
Take ownership of data science workflows that interact with a production system streaming millions of documents per week
Implement end-to-end ML workflows from data prep to deployment and evaluation
Use data exploration, data cleaning, data analysis, data visualization, or data mining to find the answers in the data
Other
TS/SCI clearance
Bachelor's degree
Ability to work with clients to understand their questions and needs
Ability to guide teammates and lead the development of algorithms and systems