Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

ExecutivePlacements.com Logo

Data Scientist, RWD & NLP

ExecutivePlacements.com

$135,000 - $145,000
Nov 13, 2025
Columbus, OH, United States of America
Apply Now

Norstella is seeking to drive data science initiatives and innovations, particularly in the development of rich multimodal real-world datasets to expedite RWD-driven drug development in pharma, by leveraging NLP and Language Models

Requirements

  • Deep understanding and direct experience (2+ years) in handling and interpreting either Electronic Health Records (EHR) and laboratory tests results or genetic test results
  • Proven experience (2+ years) in NLP with a strong knowledge of NLP techniques such as Named Entity Recognition (NER), text summarization, topic modeling, etc. and their applied use in healthcare
  • Expert-level understanding and practical experience (1+ years) with open-source Large Language Models (Llama2/3, Mixtral etc.), e.g., prompt engineering, inference, and fine-tuning
  • Proficient in Python and SQL, with strong experience in NLP libraries such as NLTK, spaCy, Hugging face Transformers, and deep learning libraries such as PyTorch, TensorFlow
  • Experience in working with AWS cloud environment and large databases (e.g., AWS Redshift)
  • Experience in managing ML lifecycle using open-source tools (e.g., MLflow)

Responsibilities

  • Employ and leverage NLP and open-source Large Language Models (LLM) such as LLama2, Mixtral, Qwen, BERT, etc., to extract, process, and interpret unstructured medical data from diverse sources like EHRs, medical notes, and laboratory reports
  • Collaborate with clinical scientists and data scientists to create efficient NLP models for healthcare, exhibiting an understanding of both the technical and medical aspects of the data
  • Conduct data cleaning, preprocessing, and validation to maintain the accuracy and reliability of insights gathered from NLP processes
  • Validate and present data findings to stakeholders, exhibiting clear and effective communication skills

Other

  • Master's or Ph.D. degree in Computational Biology, Computer Science, Data Science, Computational Linguistics, Machine Learning, or a related analytical field
  • Excellent verbal and written communication skills, with ability to present complex data to non-technical audience
  • All candidates must be authorized to work in the United States
  • We do not provide visa sponsorship or transfers
  • We are not currently accepting candidates who are on an OPT visa