Zillow is looking to transform the real estate industry by developing advanced agentic customer experiences using AI technologies to help millions of people find their next home.
Requirements
- Expertise in agentic AI, pretraining, fine-tuning, and reinforcement learning of large language models.
- Experience deploying and scaling AI services capable of handling hundreds of millions of daily interactions with high availability, low latency, and robust fault tolerance.
- Experience with frontier multimodal LLMs (Large Language Models).
- Experience with agent-based systems, multi-agent collaboration, or similar paradigms.
- Experience with LangGraph or similar frameworks.
- Experience with data pipelines and dataset management.
- Experience with evaluation frameworks and metrics for model quality.
Responsibilities
- Build and maintain data pipelines for LLM (Large Language Model) training and evaluation, curate user-understanding signals (such as intents, preferences, and behavioral features), and ensure data quality, privacy, and proper dataset management.
- Develop and manage labeling and feedback loops, including heuristics, annotation jobs, and prompt-based labeling, to create high-quality corpora, collaborating with Data Engineering and Applied Science partners to improve data coverage and reduce noise.
- Design, prototype, and ship to production agentic AI solutions, including multi-agent systems using frameworks like LangGraph, and implement context-aware features in partnership with senior engineers.
- Implement an evaluation framework to measure model quality on offline test sets (accuracy, bias, safety, user-intent coverage), and build dashboards to track improvements over time.
- Lead and contribute to experimentation by implementing metrics, A/B tests, and monitoring, helping to harden prototypes for reliable rollouts.
- Collaborate with senior engineers and cross-functional partners to select the right technologies, participate in code reviews, and share best practices (including mentoring interns or new hires as needed).
- Summarize research findings and model evaluations into clear write-ups and demos for the team and cross-functional stakeholders.
Other
- A master’s degree or above, or equivalent experience in Computer Science, Electrical Engineering, or a related field.
- 3+ years of hands-on experience building large-scale, high-impact solutions.
- Ability to work remotely and collaborate with cross-functional partners.
- Strong communication and collaboration skills.
- Ability to summarize research findings and model evaluations into clear write-ups and demos.