Grafana Mimir/Prometheus long range sum

mihalyrprospect · January 16, 2025, 5:37pm

On Grafana Cloud, I have a Stat panel with the following Prometheus Instant query:

sum_over_time(sum(increase(processed_task_seconds_count{environment=~"$environment", instance=~"$instance"}[$__rate_interval]))[$__range:$__rate_interval])

The Stat panel then takes the Last* value to display the number of invocations of a processed_task method over the selected range.

This works fine up to 32 day range. But I needed to display the value for the previous fiscal quarter in which case Grafana throws an error and shows No data.

The error is the following:

execution: the query time range exceeds the limit (query length: 768h16m3.934s, limit: 768h0m0s) (err-mimir-max-query-length). To adjust the related per-tenant limit, configure -querier.max-partial-query-length, or contact your service administrator.

I realized that the problem is passing a too long $__range value to the sum_over_time function. So, I tried an alternative approach, instead of using an Instant query, I used a Range query and configured the Stat panel to show me the Total value instead of Last*. In theory, this should have worked, but the returned value is completely wrong.

Here is my alternative Range query:

sum(increase(processed_task_seconds_count{environment=~"$environment", instance=~"$instance"}[$__rate_interval]))

This does not give me the err-mimir-max-query-length error, but the Total calculated from this by the Stat panel is far from reality.

When comparing the the two over a shorter interval of 1 day, where both work, here are the results:

Instant sum_over_time = 482K
Range with Total calculated by the panel = 1.93M

For short ranges, the Instant sum_over_time works well. But once in a while I need to look beyond 32 days and it seems I am left with either a broken panel or a panel that shows totally unrelated numbers, which is arguably even worse than the broken panel.

I am fairly new to Prometheus, any advice is appreciated.

davidellis · April 7, 2025, 8:28pm

@mihalyrprospect , try using $__interval instead of $__rate_interval in your alternative query. Your results show that you were getting exactly 4x the expected result, which makes sense, since $__rate_interval == 4 * $__interval

From Prometheus template variables | Grafana documentation
“”" The value of $__rate_interval is defined as max($__interval + Scrape interval, 4 * Scrape interval) “”"

Topic		Replies	Views
Issue with Prometheus Query in Grafana Dashboards templating	1	179	July 9, 2024
Displaying Mimir metrics with $__rate_interval always shows "No data" Time Series Panel	1	2032	June 8, 2023
Displaying results of increase() for every time window	4	21658	February 15, 2022
[SOLVED] "This week" timerange strange behavior ($__range) with slow counter Stat Panel	1	1363	December 14, 2018
Facing an issue with time ranges in queries Dashboards loki	0	300	August 15, 2023

Grafana Mimir/Prometheus long range sum

Related topics