MTSI is looking to improve data quality across various datasets, particularly unstructured data, and automate data extraction and transformation processes for information migration.
Requirements
- Proficiency in R and Python, with experience in data manipulation and analysis, particularly with unstructured data.
- Experience using GitHub for version control and collaborative coding.
- applying techniques such as natural language processing (NLP) and text mining to extract relevant insights.
Responsibilities
- Assist with conducting assessments of data quality across various datasets, focusing on unstructured data such as word, pdf, or PowerPoint, images, and multimedia files.
- Assist with information migration by leveraging R and Python to automate data extraction and transformation processes, particularly for unstructured data.
- writing scripts to parse project files, extract relevant information, and formatting it for SharePoint.
- Facilitate collaboration with team members and ensure that all changes are tracked and managed effectively using GitHub for version control of scripts and documentation.
- Assist with data quality efforts by utilizing R and Python to clean, transform, and analyze unstructured data, applying techniques such as natural language processing (NLP) and text mining to extract relevant insights.
- Assist with preparing reports on data quality metrics, findings, and recommendations for improvement.
Other
- Be an advocate of and abide by MTSI's employee first culture and core values.
- Strong attention to detail and analytical thinking skills.
- Experience working collaboratively in a team environment and independently.
- Ability to work multiple projects in parallel.
- Ability to obtain/maintain a government security clearance.
- U.S. Citizenship is required for most MTSI positions.