ServiceNow's PLATO organization is looking to build an AI and Machine Learning (ML) platform to transform user experience and workflow efficiency for enterprise services, requiring high-performance inferencing capabilities for global enterprise customers.
Requirements
- Experience in leveraging or critically thinking about how to integrate AI into work processes, decision-making, or problem-solving.
- Low Latency Optimization: Experience in optimizing models for low latency inference, important for real-time applications.
- High Throughput Optimization: Knowledge of maximizing inference throughput.
- Real-time Systems: Understanding the constraints of real-time systems on model inference.
- Model Quantization and Compression: Practical experience in reducing model size and computational cost.
- Proficient in prompt engineering and developing LLM based features
- Experience in using AI productivity tools such as Cursor, Windsurf, etc
- Proficiency in Python and Golang, with a strong grasp of software engineering principles.
- Hands-on experience with prompt engineering: ability to craft, test, and optimize prompts for task accuracy and efficiency.
- Knowledge of unit testing, profiling, and code tuning
Responsibilities
- Utilize your expertise in Python and Golang to develop high-performance components of the AI Platform.
- Collaborate with cross-functional teams to integrate AI capabilities seamlessly into workflows and user experiences.
- Ensure reliability and performance of AI models by applying best practices in software engineering and AI inferencing.
- Stay ahead of the curve by quickly learning emerging technologies and applying them to enhance the AI Platform.
Other
- This Role is based in our Santa Clara office and requires two days in the office
- Minimum 5 years of experience working in Software Development role.
- Demonstrated ability to thrive in fast-paced, dynamic environments.
- All qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, national origin or nationality, ancestry, age, disability, gender identity or expression, marital status, veteran status, or any other category protected by law.
- For positions requiring access to controlled technology subject to export control regulations, including the U.S. Export Administration Regulations (EAR), ServiceNow may be required to obtain export control approval from government authorities for certain individuals.