Senior Product Manager Intelligent Events, Health, and SLOs
Description:
Role Summary:
This Senior Product Manager will own the strategy for how SolarWinds defines, measures, and communicates the reliability and operational health of customer systems.
The core focus is transforming raw observability data into actionable, intelligent events and proactive health scores, directly enabling customer Service Level Objective (SLO) management and incident reduction.
This role drives the product vision for event correlation and anomaly detection, ensuring our platform moves customers from reactive monitoring to proactive incident prevention.
Success will be measured by the adoption of SLO features, the reduction in alert volume for customers, and the quality of system health scoring.
Core Responsibilities:
- Define the product strategy and roadmap for Service Level Indicators (SLIs) and Service Level Objectives (SLOs), ensuring seamless configuration, tracking, and reporting for our enterprise users.
- Lead the development of the event management platform, focusing on advanced correlation, root cause analysis, and proprietary noise reduction techniques to combat alert fatigue.
- Establish a unified, platform-wide health and risk scoring model that synthesizes metrics, logs, and traces into a single, understandable view of system wellness.
- Collaborate with machine learning and engineering teams to integrate AI-driven anomaly detection and predictive alerting features for the proactive identification of impending issues.
- Must be the voice of the Site Reliability Engineering (SRE) persona, translating industry best practices into powerful, simple-to-use product features.
Required Qualifications:
- Deep domain expertise in AIOps, event management, and Service Reliability Engineering (SRE) practices, with specific experience in defining and launching SLO/SLI features.
- 5+ years of experience in product managing technical platforms that leverage data science, machine learning, or statistical models for event correlation and anomaly detection.
- Strong working knowledge of how observability data (metrics, logs, traces) is used to calculate system health and drive automated workflows.
- Must possess impeccable oral and written communications skills, exceptional leadership capabilities to drive cross-functional alignment, and proven expertise in product strategy and product management fundamentals.
- Demonstrated ability to deliver features that fundamentally improve a customers operational efficiency and reduce Mean Time to Repair (MTTR).Didn’t find the job appropriate? Report this Job