Machine Learning is integral to the continued success of Turnitin, and the company is looking to deliver cutting-edge, well-engineered Machine Learning systems to integrate into a broad suite of learning, teaching, and integrity products.
Requirements
- Experience working with text data to build Deep Learning and ML models, both supervised and unsupervised.
- A strong understanding of the math and theory behind machine learning and deep learning.
- Software engineering background with at least 8 years of experience (we use Python, SQL, Unix-based systems, git, and github for collaboration and review).
- Machine / Deep Learning development skills, including experiment tracking (we use AWS SageMaker, Hugging Face, transformers, PyTorch, scikit-learn, Jupyter, Weights & Biases).
- An understanding of Language Models, using and training / fine-tuning and a familiarity with industry-standard LM families.
- Familiarity in coding for at-scale production, ranging from best practices to building back-end API services or stand-alone libraries.
- Essential dev-ops skills (we use Docker, AWS EC2/Batch/Lambda)
Responsibilities
- Work with subject matter experts and product owners to determine what questions should be asked and what questions can be answered.
- Work with subject matter experts to curate, generate, and annotate data, and create optimal datasets following responsible data collection and model maintenance practices.
- Answer questions and make trainable datasets from raw data, using efficient SQL queries and scripting languages, visualizing when necessary.
- Develop and tune Machine Learning models, following best practices to select datasets, architectures, and model parameters.
- Utilize, adopt, and fine-tune Language Models, including third-party LLMs (through prompt engineering and orchestration) and locally hosted LMs.
- Optimize models for scaled production usage.
- Write clean, efficient, and modular code, with automated tests and appropriate documentation.
Other
- Master's degree or PhD in Computer Science, Electrical Engineering, AI, Machine Learning, applied math or related field, with relevant industry experience, or outstanding previous achievements in this role.
- Excellent communication and teamwork skills.
- Fluent in written and spoken English.
- Showcase previous work (e.g. via a website, presentation, open source code)
- Ability to work with subject matter experts and product owners