HamburgerMenu
iimjobs

Posted by

Job Views:  
200
Applications:  104
Recruiter Actions:  0

Posted in

IT & Systems

Job Code

1668824

Solarwinds - Senior Product Manager - Intelligent Events/Health/SLO

Solarwinds India Pvt Ltd.5 - 7 yrs.Bangalore
Posted 2 months ago
Posted 2 months ago

Description:

Job Title

Senior Product Manager Intelligent Events, Health & SLOs

Role Summary

The Senior Product Manager Intelligent Events, Health & SLOs will own the product strategy for defining, measuring, and communicating the reliability and operational health of customer systems within the SolarWinds platform. This role is central to transforming raw observability data into intelligent, actionable events and proactive health insights that enable effective Service Level Objective (SLO) management and incident prevention.

The role will drive the vision for intelligent event correlation, anomaly detection, and system health scoringhelping customers transition from reactive monitoring to proactive reliability management. Success will be measured through adoption of SLO capabilities, reduction in alert noise, improved system health visibility, and meaningful reductions in customer Mean Time to Repair (MTTR).

Key Responsibilities

Product Strategy & Roadmap

- Define and own the product vision, strategy, and roadmap for Service Level Indicators (SLIs) and Service Level Objectives (SLOs).

- Ensure seamless SLO configuration, tracking, alerting, and reporting experiences for enterprise customers.

- Translate customer reliability goals and SRE best practices into scalable, intuitive product features.

Intelligent Event Management

- Lead the evolution of the event management platform, focusing on advanced event correlation, noise reduction, and root cause analysis.

- Drive initiatives that significantly reduce alert fatigue while increasing signal quality and operational confidence.

- Define proprietary approaches to intelligent alert grouping, prioritization, and suppression.

System Health & Risk Scoring

- Establish a unified, platform-wide health and risk scoring framework that synthesizes metrics, logs, and traces into a single, actionable system health view.

- Ensure health scores are transparent, explainable, and trusted by technical and business stakeholders alike.

AI & Anomaly Detection

- Partner closely with engineering and machine learning teams to integrate AI-driven anomaly detection and predictive alerting capabilities.

- Enable proactive identification of emerging issues before they impact customer SLAs or SLOs.

- Champion data-driven experimentation and continuous improvement of detection models.

Cross-Functional Leadership

- Act as the primary voice of the Site Reliability Engineering (SRE) persona across product, engineering, design, and go-to-market teams.

- Drive alignment across stakeholders to deliver high-impact, customer-centric outcomes.

- Communicate product vision, strategy, and outcomes clearly to internal and external audiences.

Required Qualifications

- Deep domain expertise in AIOps, event management, and Site Reliability Engineering (SRE) practices, with hands-on experience defining and launching SLI/SLO-based products.

- 5+ years of product management experience building technical platforms that leverage data science, machine learning, or statistical models for event correlation and anomaly detection.

- Strong working knowledge of observability data (metrics, logs, traces) and how it is used to measure system health and automate operational workflows.

- Proven ability to deliver products that materially improve operational efficiency, reduce alert noise, and lower MTTR for customers.

- Exceptional written and verbal communication skills with the ability to influence senior stakeholders and lead cross-functional teams.

- Strong foundation in product management fundamentals, including roadmap planning, customer discovery, prioritization, and outcome-based delivery.


Didn’t find the job appropriate? Report this Job

Similar jobs that you might be interested in

Posted by

Job Views:  
200
Applications:  104
Recruiter Actions:  0

Posted in

IT & Systems

Job Code

1668824