Onto Innovation is looking to solve yield, device performance, quality, and reliability issues in the semiconductor industry by leveraging AI and machine learning technologies.
Requirements
- Python proficiency (data wrangling, APIs, packaging); comfort on Linux and with Git.
- Built at least one LLM app using a framework such as LangChain, LlamaIndex, or Semantic Kernel.
- Hands-on with vector search (e.g., FAISS/Weaviate/Milvus) and embeddings; understands chunking, metadata, and hybrid search basics.
- Familiarity with RAG and prompt engineering; can measure quality (groundedness/relevance) and reduce hallucinations.
- Basic backend skills (REST/JSON, auth, environment secrets); experience containerizing with Docker.
- Comfortable reading technical manuals/logs and collaborating with non-software teammates.
- Experience with agent frameworks (LangGraph, AutoGen, CrewAI) or implemented tool-calling/plan-execute loops.
Responsibilities
- Prototype AI assistants & agents for field workflows: guided recipe setup, log triage, playbook lookups, parts/alarms advice, and fleet-wide health checks.
- Build retrieval systems (RAG): ingest manuals, specs, ticket notes, recipes, logs, and best-practice docs; design chunking, embeddings, and indexing; tune prompts and retrieval for accuracy/latency.
- Connect AI to our tools and data: stand up MCP servers (Model Context Protocol) and other connectors to safely expose internal systems (document stores, MES, issue trackers, telemetry APIs) to LLMs.
- Fine-tune or adapt models (e.g., LoRA/QLoRA) for domain terms, error codes, and tool-specific intents when retrieval alone isn’t enough.
- Evaluate and harden: set up offline & online evals for groundedness/relevance; add guardrails, observability, and traceability; write runbooks.
- Ship small apps: package prototypes behind simple APIs or lightweight UIs that field engineers can use (web chat, Slack/Teams bots, or CLI).
- Data plumbing: parse messy PDFs/images/CSVs; normalize schemas for recipes, events, alarms, SPC/trace data.
Other
- BS in CS/EE/CE/ME (or equivalent experience).
- Ability to work in a collaborative environment and communicate effectively with non-technical teammates.
- Willingness to learn and adapt to new technologies and workflows.
- Ability to document everything (design notes, runbooks, and “how to” guides) and gather field feedback for iteration.
- US Citizenship, US Permanent Residency, or protected individual status may be required for positions involving access to technical data.