Grafana Alerting issue in HA mode when Alert gets updated

xakraz · January 9, 2025, 4:50pm

Hi there

I have a question regarding Grafana Alerting behavior in HA mode

Nominal usage

we have 2 Grafana instances with their Alert-managers peered
An alert goes into the “firing” state
- We can see in the Alert History State transition related to both instances
- In our ContactPoint backend, we see only 1 Alert, so Alert-managers deduplication works as expected

→

Issue

However, when we update the Alert definition (from 1 Grafana instance given the load-balancer),

The Alert state of this instance is reset to the “Normal” state:
The alert in our ContactPoint backend gets closed:
BUT a new Alert with the same name is created in our ContactPoint backend again a few seconds/minutes later…

Hypothesis

The second Grafana instance still has the previous Alert definition and creates a new Alert in our ContactPoint backend …
Then, at the next evaluation interval, the Grafana has the new Alert definition and sends a “Close” event as expected

The issue is that the evaluation interval can be several hours depending on the Alert

→ This means the team gets notified 2x and the alert stays opened (in fact gets re-created) during the whole Evaluation interval despite having been updated …

Questions

Does anyone face the same issue?
If so, have you done something to address it?

Many thanks for your feedback

jangaraj · January 9, 2025, 5:27pm

Did you verify ha setup?

xakraz · January 10, 2025, 8:48pm

Hello @jangaraj ,

Yes we did (or did I miss something?) and the HA regarding alerting works, as detailed in the initial post

The issue we faced is when an alert definition gets updated

xakraz · February 6, 2025, 9:33am

Hello there

Does anyone have a suggestion about that?

Would adding an external Alert-Manager help?
(As mentioned here: Configure high availability | Grafana documentation)

yuriy.tseretyan · February 7, 2025, 8:29pm

resetting all alert instances to Normal is expected after you changed the rule. In other words, if a rule that has instances in “Alerting” state gets updated, all those instances are resolved. Then at the next evaluation, the result produces instances in Alerting state, and you get the notification about it again.

Depending on what version you use, there can be fields that are ignored, i.e. any changes of those fields do not cause the state to be reset.

Topic		Replies	Views
Grafana alerting ha duplicate alert history Alerting	5	1286	June 7, 2023
Slack notifications spamming for 1 and only alert 🤔 Grafana Alerting in HA mode? Alerting ha-grafana , slack	0	71	November 21, 2024
Control alert in grafana HA Grafana	2	343	May 5, 2022
Non-alerting Grafanas wont update status of the alerts Alerting alerting , grafana	1	210	October 23, 2024
Alerts/notifications are not deduplicated when using HA unified alerting Configuration ha-grafana , unified-alerting	3	2902	April 14, 2023

Grafana Alerting issue in HA mode when Alert gets updated

Nominal usage

Issue

Hypothesis

Questions

Related topics