Grafana docker container has CPU spikes and failes to load huge dashboards from multiple datasources

malinamihuc · December 15, 2025, 10:29am

Hello,

We are running in a clustered environment with grafana and prometheus.

Lately we have some issues with huge dashboards which are not loading (only in a few minutes sometimes) and in cadvisor we see grafana container has spikes in cpu over 100%. But the actual server doesn’t struggle with cpu saturation.

In the logs, we see these errors:

Failed creating data source proxy error="validation of data source URL \"\" failed: empty URL string" traceID=

level=error msg="Request Completed" method=GET path=/api/datasources/proxy/uid/...../api/v1/status/buildinfo status=500 remote_addr=ip time_ms=1 duration=1.923898ms size=61 referer="https://url/dashboard?from=now-1m&orgId=1&to=now" handler=/api/datasources/proxy/uid/:uid/*

This loading issues are not common on all dashboards.

What can we do to debug or increase performance?

Grafana version: v10.0.3 (eb8dd72637)

Thank you!

jangaraj · December 15, 2025, 11:55am

Why it is a huge dashboard?

Problem is usually ineficient panel query - e.g. query a lot of data into Grafana, chew it and then visualize result as a single number. So what your queries are doing?

malinamihuc · December 15, 2025, 12:08pm

I wouldn’t say it contains complex queries, I am mostly displaying count of a query and I am querying over the last minute and no refresh - loads in 3-4minutes / it looks like in tries multiple times to display the values in the dashboard and only after a while it succeeds. Otherwise, it never loads.

jangaraj · December 15, 2025, 12:11pm

Be exact, pls: Datasource type and queries.

malinamihuc · December 15, 2025, 12:16pm

datasource type: prometheus → dashboard loads data from 3 DS.

Example of queries:

count(sum by (cluster) (up{job=”sso-exporter", env=“prd”,cluster != “”}) == 0 ) or on() vector(0)
max by (domain) (abs(agroal_active_count{env=“prd”,job=“quarkus-exporter”, domain=“domain”}))
count by () (

(max by (application) (ibm_mq_queue_manager_status{env=“prd”}) != 1)

or

(max by (application) (up{job=“IBMMQ”,env=“prd”}) != 1)

)

or on() vector(0)
i have 17 vizualizations which contain queries similar to the ones above

jangaraj · December 15, 2025, 12:21pm

Queries don’t look heavy. Any transformations? Dashboard autorefresh? How your Prometheus behave when there is 17 parallel queries - enable lazy loading (you have quite old version, so check doc how)?

yosiasz · December 16, 2025, 12:11am

also what backend is grafana using? sqlite? mysql?

malinamihuc · December 16, 2025, 7:01am

We have mariadb with galera for clustering for backend.

malinamihuc · December 16, 2025, 7:10am

I have reduce transformation on almost all visuzlizations (Reduce → Series to Rows / Calculations Total).

I set dashboard autorefresh to OFF, otherwise it doesn’t load…

malinamihuc · December 17, 2025, 7:39am

we noticed the issue in the dashboards was caused by multiple panels which filter alerts - alert list type of panel. But we still encounter slowness sometimes, on some dashboards, event after removing some of the alert list panels. What performance issues can cause the Alert List panels?

Topic		Replies	Views
I am unable to reduce the load of Grafana Alerting	13	1410	June 28, 2024
Grafana not responding properly	9	6611	January 13, 2021
Grafana dashboard taking more time to load Prometheus	0	720	July 13, 2020
Issue with Grafana's dashboard not loading properly Dashboards	5	5004	May 26, 2023
Grafana Dashboard displaying empty graphs, not data coming Configuration	8	25624	February 2, 2024

Grafana docker container has CPU spikes and failes to load huge dashboards from multiple datasources

Related topics