ServiceNow is looking to build Small Language Models (SLMs), open-weight models like Apriel, and traditional machine learning algorithms and pipelines to power their AI-enhanced technology offerings and deliver powerful AI capabilities to their customers.
Requirements
- Deep technical expertise in modern machine learning and deep learning, including a strong understanding of LLM architectures, training paradigms (e.g., pre-training, supervised fine-tuning), and evaluation methodologies.
- Proven track record of building and deploying production-level AI models at scale.
- Hands-on experience with major deep learning frameworks like PyTorch or TensorFlow
Responsibilities
- Lead and scale a team of talented researchers, applied researchers and machine learning engineers.
- Define the strategy and roadmap for model training and evaluation, ensuring we deliver high-quality, performant, and reliable models.
- Drive innovation in training techniques, including developing new methods for fine-tuning, knowledge distillation, and reinforcement learning.
- Establish best practices for model evaluation, including creating robust benchmarks, metrics, and frameworks to ensure model quality and integrity.
- Collaborate with cross-functional teams across product, engineering, and research to translate business needs into technical requirements and deliver impactful solutions.
- Stay ahead of the curve by researching and implementing the latest advancements in large language models and AI research.
Other
- Experience in leveraging or critically thinking about how to integrate AI into work processes, decision-making, or problem-solving.
- Extensive experience in leading and managing high-performing machine learning or AI teams, with a focus on model training and evaluation.
- Strong leadership and management skills with the ability to inspire and grow a world-class team, with extensive experience managing large organizations of 50 or more engineers and researchers.
- Excellent communication and collaboration skills to work effectively with stakeholders at all levels, from technical teams to senior executives.
- A Ph.D. or Master's degree in Computer Science, Machine Learning, or a related field is a plus.