Reducto is looking to solve the problem of ingesting real-world enterprise data with state-of-the-art accuracy, specifically extracting information from unstructured file formats like PDFs and spreadsheets.
Requirements
- 2+ years of experience with training, fine tuning, and evaluating ML models used in production systems
- Exceptional skills in Python or similar
- Well-versed in both traditional computer vision and VLMs
- Ability to build data pipelines, evaluate model performance, and integrate models into the product
- Experience with novel techniques to improve LLM accuracy
- Familiarity with Streamlit or similar tools
- Strong understanding of machine learning and AI concepts
Responsibilities
- Training and deploying new state of the art models for parsing and interpreting unstructured data
- Experimenting with novel techniques to improve LLM accuracy
- Build data pipelines, evaluate model performance, and integrate models into the product
- Working directly with the founders and customers to shape the product direction and engineering strategy
- Training, fine tuning, and evaluating ML models used in production systems
- Building tools as needed, like a quick Streamlit app to test hypotheses or create a dataset
- Debugging, experimenting, and iterating fast on the full development lifecycle
Other
- 2+ years of experience
- Philosophy: high bar for quality, ships fast, with high agency, and actively jumps in to fix problems
- Unlimited PTO, lunch, reimbursed transportation, insurance, health and wellness budget, and parental leave
- In-person role at the office in SF, requires working hard and moving quickly
- Degree requirements not specified, but experience and skills are more important