CPU and memory usage of Grafana surge after adding more alerts

junzesmg · October 11, 2023, 2:59pm

Hi there, we’re using Grafana v10.0.2 (b2bbe10fbc) with unified alerting. We have 2 pods hosting Grafana on our Kubernetes cluster. Grafana seems to struggle with CPU and memory after we added 500+ alert rules. Scaling out to more replicas doesn’t help, so we could only increase the memory. Normally the CPU usage was 0.2, and memory around 200MiB, but after adding the alert rules, CPU usage is around 3 cores and memory usage is around 1.2GiB. Based on this code, i suppose each Grafana instance would fetch all alert rules from DB and evaluate them in parallel, so adding instances wouldn’t help? I’m wondering if this is expected resource usage for unified alerting. Can we only scale up the pod?

jasonmallory · October 13, 2023, 5:10pm

Are you using sqlite as Grafana’s database?

junzesmg · October 16, 2023, 6:46am

No, we’re actually using Postgres.

Topic		Replies	Views
Grafana alert for cpu/memory usage Prometheus alerting	0	5909	April 27, 2022
Grafana out of memory issue Alerting	9	5396	May 11, 2023
Aggregating query results into alert annotations Alerting alerting , grafana-cloud	3	789	May 28, 2024
Grafana performance degraded with too many alert rules Alerting	2	588	January 27, 2023
How to set alerts in grafana for CPU, Memory, Disk usage, Server up/down Alerting alerting	3	14538	October 18, 2022

CPU and memory usage of Grafana surge after adding more alerts

Related topics