PitchBook is looking to deliver AI-powered features that extract meaningful insights from its vast datasets, including reports, news, and textual content, to enhance customer value by improving the speed, discoverability, quality, and quantity of insights available on the platform.
Requirements
- Demonstrated expertise in natural language processing (NLP) and machine learning, including hands-on experience with classifiers, transformer models, large language models (LLMs), and widely used ML and data science libraries such as scikit-learn, pandas, numpy, TensorFlow, and PyTorch
- Experience delivering production-grade GenAI or LLM-based systems with measurable business impact
- Familiarity with the LangChain ecosystem, including tools such as LangSmith and LangGraph, and experience using them in production environments is a strong plus
- Deep proficiency in building and maintaining scalable data pipelines and distributed systems using technologies such as Apache Kafka, Airflow, and cloud data platforms like Snowflake
- Strong programming skills in Python and SQL, with working knowledge of additional languages such as Java or Scala considered a plus
- Practical experience with cloud-native development, containerization, and orchestration technologies such as Docker and Kubernetes
- Demonstrated ability to solve complex technical problems, contribute to architectural decisions, and deliver high-performance, reliable solutions
Responsibilities
- Deliver high-impact AI and ML capabilities that drive insight generation on the PitchBook Platform. Ensure your work contributes to broader business goals and is aligned with the team's strategic priorities
- Provide hands-on expertise in designing, building, and deploying AI/ML models and services with a focus on NLP, summarization, semantic search, classification, and prediction. Contribute to the development of scalable, high-performance systems that meet production-grade reliability and efficiency standards
- Support a culture of technical excellence by mentoring peers, sharing knowledge, and participating in code and design reviews. Promote innovation and continuous improvement through collaborative engineering practices
- Build and optimize models that leverage classifiers, transformers, LLMs, and other NLP techniques to generate meaningful insights from structured and unstructured data. Integrate these models into the broader AI/ML infrastructure in collaboration with partner teams
- Collaborate with engineering, product management, and data collection teams to ensure models are informed by high-quality data and support strategic product goals
- Explore and experiment with emerging technologies, methodologies, and tools in the fields of GenAI, NLP, and search. Translate research findings into practical solutions that enhance PitchBook’s AI capabilities
- Contribute to best practices in model transparency, monitoring, evaluation, and compliance. Help maintain high standards of security, data integrity, and responsible AI use across your projects
Other
- Must be authorized to work in the United States without the need for visa sponsorship now or in the future
- The job conditions for this position are in a standard office setting. Employees in this position use PC and phone on an on-going basis throughout the day. Limited corporate travel may be required to remote offices or other business meetings and events.
- This role is expected to be in the office 5 days a week.
- Excellent communication and collaboration skills, with experience working cross-functionally with product managers, engineers, and data scientists in globally distributed teams
- Experience working in fast-paced, data-driven environments. Prior exposure to fintech or financial data platforms is a strong advantage