Transform how MetLife leverages observability, ITSM, and business data to drive predictive insights, enable autonomous operations, and elevate the experience of technology and business partners.
Requirements
- Proven experience in data science, machine learning, or applied statistics, preferably in IT operations or infrastructure domains.
- Strong proficiency in Python, R, or similar languages, and experience with data platforms (e.g., Spark, Databricks, Snowflake).
- Familiarity with observability tools (e.g., Elastic, NexThink, Prometheus, Grafana) and ITSM platforms (e.g., ServiceNow).
- Experience with cloud platforms (AWS, Azure, GCP) and infrastructure automation tools (e.g., Ansible, Terraform).
Responsibilities
- Develop and deploy predictive models using observability, ITSM, and business data to proactively identify and resolve service issues.
- Partner with engineering and operations teams to embed AI/ML capabilities into our AIOps and automation platforms.
- Architect and maintain data pipelines that feed into our experience health data lake, enabling anomaly detection, pattern recognition, and real-time insights.
- Advance our journey toward autonomous infrastructure by contributing to self-healing and event-driven automation initiatives.
- Collaborate with stakeholders to define and track metrics that measure resiliency, reliability, and service health.
- Support the evolution of our governance and analytics capabilities from manual to AI-enabled, with a focus on data quality and prediction accuracy.
Other
- Strong problem-solving skills with the ability to drive a project to conclusion.
- Excellent communication skills and the ability to translate complex data into actionable insights.
- The expected salary range for this position is $100,000 - $115,000.