Alert keeps firing even after the issue has been resolved

vanilladonut · April 16, 2025, 10:41am

The app is running on Kubernetes as a StatefulSet and the issue has been resolved by a new instance. Nevertheless, even though the alert’s conditions are, or should be, no longer met, the alert is still firing because its set to "noDataState": "Alerting" and the old instance isn’t providing any data (at least that’s the only explanation I can think of). I’ve attached the panel screenshot and the alert’s config.

Can the alert’s config be improved or is this a genuine bug? I’d like to keep "noDataState": "Alerting" setting.

Zoomed in, to confirm that the metric value is indeed > 0:

(The app has a low load and the graph is squished on the first screenshot as I wanted to capture everything on 1 image.)

modify-export-Infra-eeewt539dnym8f-1744799754862.json (1.6 KB)

Related: Grafana sends the firing status although the alert is resolved

vanilladonut · April 16, 2025, 11:04am

Resolved by changing the ‘Time range’ from 6h to 2h ("relativeTimeRange" in JSON). 6h was the default set by the panel to which this alarm belongs.

This being “solved”, a followup question: to avoid false alarms, what should an alert’s time range be set to (especially in environments, such as Kubernetes, where targets come and go)? Simply make it always equal to its evaluation period (with some additional time buffer)?

jangaraj · April 16, 2025, 11:05am

Make sure that alert query returns single time serie. Now you have 2 time series (they have different instance labels).

vanilladonut · April 16, 2025, 11:22am

Thanks! Would this be the correct query?

sum (
  rate(docs_indexed{service="app",environment="production"}[5m])
)

I could sum by (instance), but this would still alert for separate pods (there’s pod="app-0", pod="app-1", etc. in the labels).

Topic		Replies	Views
Alerting in kubernetes on multi-pod data set Alerting legacy-alerting	1	454	October 6, 2022
Continuing an Alert When Value Changes to "No Data" Alerting alerting , query-help	3	280	January 17, 2025
Alerts getting closed stating Normal (Missingseries) Grafana alerting	14	2787	September 12, 2024
Firing alerts get resolved with no data Alerting	4	173	March 27, 2025
Grafana alert issue: when one metrics is no data, the state will always be OK Grafana alerting	0	172	December 13, 2023

Alert keeps firing even after the issue has been resolved

Related topics