ZoomInfo is looking to solve the problem of building a next-generation go-to-market platform using high-quality GTM data, agentic workflows, and a robust intelligence layer to give sales, marketing, and revenue operations teams a competitive advantage.
Requirements
- Proficiency in PyTorch or TensorFlow for model development and fine-tuning
- Experience with vector databases (Pinecone, Weaviate, FAISS, OpenSearch) and hybrid retrieval systems
- Strong software engineering skills in Python; familiarity with Go/Java is a plus
- Knowledge of MLOps tools: Docker, Kubernetes, GitOps, feature stores, model registries
- Hands-on experience with LLM fine-tuning techniques (LoRA, quantization, distillation) is a plus
- Understanding of agentic workflows and multi-agent systems
- Experience building language-agnostic ML solutions and cross-lingual models
Responsibilities
- Improve data quality for ZoomInfo's foundation datasets including firmographics, demographics, C-suite profiles, workforce information, titles, skill sets, scoops, intent signals, and web-extracted data
- Design and implement data validation pipelines and quality metrics to ensure high-fidelity information across millions of records
- Build and fine-tune embedding models using large language models (Llama) and small language models (BERT) for various text understanding tasks
- Develop language-agnostic clustering and classification models using vector search technologies
- Optimize embedding models for production deployment at petabyte scale
- Build high-recall NER models to extract people, organizations, locations, and industry-specific entities from web-extracted data
- Deploy and maintain ML models serving millions of users daily with sub-second latency requirements
Other
- 3 - 5 years (1+ years post-PhD) of hands-on ML/NLP experience with demonstrated impact on production systems
- Ability to work effectively in cross-functional teams and communicate technical concepts to non-technical stakeholders
- Experience mentoring junior team members and contributing to team knowledge sharing
- Strong problem-solving skills and ability to work independently with guidance from team leads
- Preferred Qualifications: Experience processing large-scale unstructured data, background in information retrieval and search systems