Amplity is seeking to deliver high-impact data products for clients in the healthcare industry by applying expertise in text mining, natural language processing, and data analysis.
Requirements
- Text Mining & NLP: Proficiency in REGEX, NLP tools/techniques, and familiarity with Linguamatics I2E (preferred).
- Database Management: Skilled in SQL and MongoDB for retrieving, managing, and analyzing datasets.
- Data Visualization: Experience creating impactful visualizations using Plotly or similar tools.
- API Integration: Knowledge of integrating OpenAI API and other large language model (LLM) technologies for data mining and summarization tasks.
- Python, Pandas, data wrangling, ETL processes, and statistical analysis.
- Experience with Jupyter Notebooks.
- Familiarity with OpenAI API and other large language model (LLM) technologies.
Responsibilities
- Utilize NLP tools to mine and structure insights from a database of 80M+ unstructured medical records.
- Retrieve and analyze raw text and structured data from MongoDB and SQL databases.
- Integrate OpenAI API and other large language model (LLM) technologies for data mining and summarization tasks.
- Transform and clean data for analysis, ensuring accuracy, consistency, and usability.
- Perform data analysis using Python, Jupyter Notebooks, and Pandas, producing descriptive statistics and aggregations.
- Develop impactful visualizations using Plotly or similar tools to communicate key insights.
- Collaborate with client stakeholders to troubleshoot queries and deliver training as needed.
Other
- Bachelor’s degree from an accredited institution.
- 2–5 years of experience in programming and data analysis.
- Willingness to travel 20% or more to client sites/meetings
- Eastern time zone preferred
- Must successfully complete a skills assessment as part of the application process.