The company is looking to solve the problem of designing, building, and maintaining document capture applications, and is seeking a Data Scientist with a solid background in software engineering and experience in building Machine Learning NLP Models and Gen AI Models.
Requirements
- Deep understanding and some exposure to new Gen AI Open-source Models
- At least 5 years Python (or equivalent) programming experience to work with ML/NLP models
- Experience in setting up supervised and unsupervised learning ML/NLP models
- Experience in ML/NLP development pipelines of large data sets, both structured and unstructured
- Knowledge and hands-on experience working with OCR products
- At least 2 years' experience in designing and developing enterprise-scale ML/NLP solutions
- Programming experience in software development and Agile process
Responsibilities
- Designing, building and maintaining document capture applications
- Setting up supervised and unsupervised learning ML/NLP models including data cleaning, data analytics, feature creation, model selection ensemble methods, performance metrics visualization
- Experience in ML/NLP development pipelines of large data sets, both structured and unstructured
- Designing and developing enterprise-scale ML/NLP solutions in one or more of: Named Entity Recognition, Document Classification, Document Summarization, Topic Modelling, Dialog Systems, Sentiment Analysis, OCR text processing
- Knowledge and hands-on experience working with OCR products
- At least 5 years programming experience in software development and Agile process
- At least 5 years Python (or equivalent) programming experience to work with ML/NLP models
Other
- Bachelor's degree in Computer Science, or a related technical field
- 7+ years of experience as Data Scientist or related roles
- Highly motivated, proactive and a self-starter; strong sense of ownership ability to create and execute plans without daily oversight
- Critical thinker; ability to analyze problems and identify issues and provide solutions
- Excellent communication and Presentation skills