Grafana Alerting Scalability v8.0.0+

  • What Grafana version and what operating system are you using?
    v 8.0.0 and above

  • What are you trying to achieve?

My team would like to use Grafana native alerting in the latest vanilla Grafana image (so this may be 9.x going forward) and wanted to know if there is stress testing or benchmarking data available to us. We’ve searched online and asked an account rep but haven’t been able to get any information around how many alerts it can handle, for example.
We currently have nearly 25,000 alerts in production right now and this number will likely grow in the future. Do you think native alerting will work well at this scale or should we expect to run into issues?

Thank you!

I’d like to add on to this to ask about alerting scale-out plans in grafana (that is, to shard up the alerting workload across multiple instances instead of executing all alerts on all instances), is that a planned feature?

1 Like

Bump… Currently looking to scale alerts as well. I’m using Grafana managed alerts using InfluxDB. The options I’m weighing are Grafana Mimir (would have to rewrite/downsample InfluxDB points to Prometheus) or using InfluxDB Tasks.

1 Like