Attentive is looking for a Senior Software Engineer to join their Machine Learning Platform (MLOps) team to build and maintain a foundational ML platform that enables Attentive's AI product suite to train, inference, and deploy ML models with higher velocity and performance, while maintaining reliability.
Requirements
- You have been working in the areas of ML Platform / MLOps / Platform Engineering / DevOps / Infrastructure for 5+ years, and have an understanding of gold standard practices and best in class tooling for ML
- You understand the key differences between online and offline ML inference and can voice the critical elements to be successful with each to meet business needs
- You understand the importance of CI/CD in building high-performing teams and have worked with tools like Jenkins, CircleCI, Argo Workflows, and ArgoCD
- You are passionate about observability and worked with tools such as Splunk, Nagios, Sensu, Datadog, New Relic
- You have worked with tools like Ray, MLFlow, Metaflow, Argo, and Spark
- You have experience with Kubernetes
- You have experience with CI/CD
Responsibilities
- Expand, mature, and optimize our ML platform built around cutting edge tooling like Ray, MLFlow, Metaflow, Argo, and Spark to support traditional, deep learning, and reinforcement learning ML models
- Build and mature capabilities to support CPU / GPU clusters, model performance monitoring, drift detection, automated roll-outs, and improved developer experience
- Build, operate, and maintain a low-latency, high volume ML serving layer covering both online and batch inference use cases
- Orchestrate Kubernetes and ML training / inference infrastructure exposed as an ML platform
- Expose and manage environments, interfaces, and workflows to enable ML engineers to develop, build, and test ML models and services
- Design and implement an online inference pipeline with champion/challenger shadow model testing
- Scale real-time feature streaming use cases to handle low-latency, high-volume RL use cases
Other
- Self-motivated, highly driven
- Passion is exposing platform capabilities through interfaces that enable high performance ML practices, rather than designing ML experiments
- Competitive perks and benefits
- The US base salary range for this full-time position is $170,000 - $230,000 annually + equity + benefits
- Equity is a substantial part of the total compensation package