Overview
Baseline alerts use a moving average to establish a historical norm for a metric, then compare the current value against that baseline. If the current value deviates from the baseline by more than a configured percentage, the alert fires. This approach detects anomalies without requiring you to set a fixed numeric threshold.
Every baseline alert rule uses the Baseline Metric rule type. Baseline alerts are supported for Browser and APM products.
Available metrics
Browser baseline metrics
| Metric | Unit |
|---|---|
| Page Load Time | % |
| Page Rendering Time | % |
| Page View Throughput | % |
| JS Error Count | % |
| AJAX Total Time | % |
| AJAX Response Time | % |
| AJAX Callback Time | % |
| AJAX Throughput | % |
APM baseline metrics
| Metric | Unit |
|---|---|
| Web Response Time | % |
| Web Throughput | % |
| Exception Count | % |
| External Request Response Time | % |
| External Request Throughput | % |
| Database Query Time | % |
| Database Throughput | % |
All baseline metrics use % as the threshold unit because you are configuring the maximum allowed percentage deviation from the moving average. For example, a threshold of
20means the alert fires when the current value deviates by more than 20% from the baseline.
Targets
Baseline alert rules use appId for targeting, which scopes the rule to a specific project. The baseline and current values are computed for the selected project only.
Filters
Baseline rules inherit the standard filter mechanism. Filters narrow the data set before the baseline and current values are computed.
Group By
Group By is not typically used with baseline alerts. The rule evaluates the aggregate metric value for the targeted project.
Evaluation logic
Baseline evaluation runs in three steps:
1. Query historical and current values
The alerting engine queries ClickHouse for two values:
previousValue(baseline): The metric value over the baseline period, defined bybaselineMovingAvgDurationandbaselineMovingAvgDurationUnit.currentValue: The metric value over the current evaluation period.
2. Calculate percentage deviation
The engine computes how far the current value has deviated from the baseline:
percentage = (currentValue - previousValue) / previousValue * 100
A positive percentage means the current value is higher than the baseline. A negative percentage means it is lower.
3. Compare against threshold
The comparison depends on the operator:
| Operator | Triggers when | Use case |
|---|---|---|
above |
percentage >= threshold |
Current value exceeds baseline by at least the threshold percentage |
below |
percentage <= (-1 * threshold) |
Current value drops below baseline by at least the threshold percentage |
equal |
Either above or below condition is met |
Detect any significant deviation in either direction |
Configuration fields
| Field | Type | Description |
|---|---|---|
baselineMovingAvgDuration |
Number | How far back to look for the baseline value |
baselineMovingAvgDurationUnit |
String | Unit for the duration: minutes or hours |
operator |
String | Comparison direction: above, below, or equal |
threshold |
Number | Percentage deviation from baseline that triggers the alert |
Examples
Response time deviation alert
Detect when web response time spikes significantly compared to the recent historical average, which may indicate a performance regression.
| Field | Value |
|---|---|
| Product | APM |
| Metric | Web Response Time |
| Baseline duration | 2 hours |
| Operator | above |
| Threshold | 50 % (50% deviation above baseline) |
| Target | Production application appId |
This rule computes the average web response time over the past 2 hours as the baseline. If the current response time is 50% or more above that baseline, an incident is opened. For example, if the 2-hour baseline is 200 ms and the current value is 320 ms, the deviation is 60%, which exceeds the 50% threshold.
Throughput drop detection
Alert when page view throughput drops significantly, which may indicate an outage or routing issue preventing users from reaching the application.
| Field | Value |
|---|---|
| Product | Browser |
| Metric | Page View Throughput |
| Baseline duration | 1 hour |
| Operator | below |
| Threshold | 30 % (30% deviation below baseline) |
| Target | Production application appId |
This rule computes the average page view throughput over the past hour. If the current throughput drops to 30% or more below that baseline, an incident is opened. For example, if the 1-hour baseline is 1,000 page views per minute and the current value is 650, the deviation is -35%, which exceeds the -30% threshold.
+1-415-800-4104