Designing, building and maintaining document capture applications using Machine Learning NLP Models and Gen AI Models for a company located in Addison, Texas
Requirements
- At least 5 years programming experience in software development and Agile process
- At least 5 years Python (or equivalent) programming experience to work with ML/NLP models
- Experience in ML/NLP development pipelines of large data sets, both structured and unstructured
- At least 2 years' experience in designing and developing enterprise-scale ML/NLP solutions
- Knowledge and hands-on experience working with OCR products
- Deep understanding and some exposure to new Gen AI Open-source Models
- Experience with Named Entity Recognition, Document Classification, Document Summarization, Topic Modelling, Dialog Systems, Sentiment Analysis, OCR text processing
Responsibilities
- Designing, building and maintaining document capture applications
- Building Machine Learning NLP Models
- Working with Gen AI Models
- Setting up supervised and unsupervised learning ML/NLP models
- Data cleaning, data analytics, feature creation, model selection ensemble methods, performance metrics visualization
- Designing and developing enterprise-scale ML/NLP solutions
- Working with OCR products
Other
- 7+ years of experience as Data Scientist or related roles
- Bachelor's degree in Computer Science, or a related technical field
- Highly motivated, proactive and a self-starter; strong sense of ownership ability to create and execute plans without daily oversight
- Collaborating with a diverse set of partners and stakeholders from various Line of Business
- Master's degree in Computer Science/Data Science, or a related technical field