ZoomInfo is seeking a Senior Data Engineer to design and expand enterprise-level data infrastructure that enables internal teams to interact with data comprehensively through various platforms, including AI-powered systems like LLMs.
Requirements
- Expert-level SQL for building performant, scalable queries and transformations on massive datasets.
- Strong Python programming skills with a focus on distributed computing, data manipulation, and building robust APIs.
- Production-level experience for large-scale batch and streaming data processing.
- Hands-on experience with DBT (Data Build Tool) for advanced data modeling and transformations in a modern data stack.
- Deep knowledge of Snowflake data warehouse design, optimization, and cost modeling.
- Experience implementing Model Context Protocol (MCP) or similar architectures to feed structured and unstructured data into LLM-powered systems.
- Strong understanding of data architecture concepts including data lakes, event-driven architectures (e.g., Kafka), ETL/ELT, and data mesh.
Responsibilities
- Design, develop, and maintain high-performance, product-centric data pipelines using Airflow, DBT, and Python.
- Architect and optimize the massive-scale data warehouse and lakehouse that serves as our single source of truth for all customer data, primarily using Snowflake.
- Lead the integration of diverse structured and unstructured data sources (e.g., web data, third-party APIs) into our data ecosystem, ensuring high-quality and reliable ingestion.
- Implement and enforce Model Context Protocol (MCP) or similar architectures to feed accurate and contextual data into our LLM-powered products for applications like Retrieval Augmented Generation (RAG) and advanced search.
- Define, monitor, and enforce data quality SLAs across all pipelines and products, ensuring data accuracy and lineage are a top priority.
- Mentor and coach junior engineers, promoting best practices in code quality, data architecture, and operational excellence.
- Participate in architectural decisions and long-term strategy planning for our enterprise-wide data infrastructure, with a focus on cost, performance, and reliability.
Other
- Excellent communication skills – ability to explain complex technical concepts to both engineering teams and non-technical stakeholders.
- Strategic & Product-Oriented Thinking – can translate business objectives and customer needs into scalable, high-impact data solutions.
- Leadership & Mentorship – experience guiding and uplifting engineering teams to achieve their full potential.
- Stakeholder Management – able to collaborate effectively across departments (Product, Engineering, Sales, Compliance).
- Agility & Adaptability – thrives in ambiguous, evolving environments and can rapidly prototype and iterate on solutions.