How to label filter after aggregation?

KATIE · March 29, 2025, 6:07am

I’m using Grafana alerting with Loki as the data source.
In my LogQL query, I apply topk() over a metric extracted using unwrap.
Now I want to filter the result of topk to exclude any series where the label status = "start".

Is there a way to filter out those series after topk() — either in a Grafana alert rule or using a recording rule?
If so, how can I implement that?

My goal is to monitor the final state of containers, and only alert on those that have ended with a DIE or OOM status. So I’m using this query to extract and filter only the relevant failure cases.

my logql + metric query is like below

topk(1, last_over_time(
    {container_name=~"docker-events-logger.*"}
    | json
    | Type = `container` 
    | status =~ `die|oom|start`
    | unwrap timeNano [123s]
) by (Actor_Attributes_name, Actor_Attributes_image, host_name, host_ip, status)
) by (Actor_Attributes_name, Actor_Attributes_image, host_name, host_ip)

tonyswumac · March 29, 2025, 10:45pm

A bit confused, normally if you want to filter something out you’d do it before you do any other calculation to avoid performing unnecessary work. Is there any reason not to just remove start from

status =~ `die|oom|start`

If you all you care about is die or oom?

KATIE · March 30, 2025, 3:25am

Thx for reply.
Actually I want to capture docker container’s last status among DIE, START, OOM

by (Actor_Attributes_name, Actor_Attributes_image, host_name, host_ip, status)

→ last events for each container’s status

) by (Actor_Attributes_name, Actor_Attributes_image, host_name, host_ip)

→ last status for each container

after all this query captures container’s last status, but
I dont want to get alarm from status “start”
so.. I am struggling to find way to do that using log query + metric query…

tonyswumac · March 31, 2025, 10:13pm

Your objective is to be alerted if your containers aren’t healthy, yes? If so, it may be a better idea to implement some sort of poke test, or monitor the containers directly via something like cAdvisor and alert from there.

If you have to do it from logs, then your only option really is to be alerted when a container is die or oom, because with any container platform you should reasonably expect containers to restart themselves, which is to say you should always expect start to follow after die or oom, therefore you should simply ignore start status altogether (unless for some reason this isn’t the case for you). Also you should also not expect the start status to be an indication of containers being healthy because they could fail to actually start. This is why monitoring the containers directly is a better idea.

If you really want to do what you originally set out to do, then you can use last_over_time to determine the last/latest value of an entry. The problem is it only works with a metric value, so you’d have to first convert your status into some sort of number. Something like this (not tested):

sum by (Actor_Attributes_name, Actor_Attributes_image, host_name, host_ip) (
  last_over_time({container_name=~"docker-events-logger.*"}
    | json
    | Type = `container` 
    | status =~ `die|oom|start`
    | label_format status_code="{{ if eq .status "die" }}1{{ eles if eq .status "oom"}}2{{ else }}0{{ end }}"
    | unwrap status_code
  [123s]
  )
)

solutionlee · April 2, 2025, 3:06am

formatting label would be great. but I cannot find way to convert label value with condition in metric query & log query.
maybe I should add code to container agent(promtail or cAdvisor)

zorgoban · January 11, 2026, 1:21pm

Good day!

What could I do if I wanted to filter by the records having a count greater than some value. I finally got the query to get the list, but now how to filter this list?

sort_desc(count_over_time({container_name="nginx"} | http_status >= 400 | json | keep remote_addr | remote_addr != ip("10.0.0.0/8") [5d]))

Many Greetings!

galenkistler · January 11, 2026, 5:38pm

Just add > $COUNTto the end of your query.

For example:

sort_desc(count_over_time({service_name="nginx"} [$__auto])) > 5

zorgoban · January 12, 2026, 7:46am

Thanks!

I was intuitively trying to find a solution using pipe.

Many Greetings!

Topic		Replies	Views
Label filtering after unwrapping Alerting	0	40	March 28, 2025
Failed to evaluate queries and expressions: input data must be a wide series but got type long (input refid) Alerting alerting , loki , query-help	14	33468	June 2, 2025
How to set alert for docker container stop and start using promql in grafana dashboard? Alerting alerting	0	1571	January 18, 2023
Grafana Alert expression without functions Grafana Loki alerting , loki , alertmanager	5	349	April 11, 2025
Can observe Loki logs but can't query them when creating alert Grafana alerting	0	119	August 31, 2023

How to label filter after aggregation?

Related topics