LogQL queries to identify "log spam" -- frequently repeating log line causing logfile growth

nparrish42 · October 20, 2022, 5:41pm

I am monitoring the size of all logfiles I’m sending to loki using a very simple shell script and Textfile collector along the lines suggested by Monitoring directory sizes with the Textfile Collector – Robust Perception | Prometheus Monitoring Experts

so now I can pretty quickly identify surprising log growth with a panel like this:

next I would like to be able to find examples of the log lines responsible for this growth.
I fear this is something I can’t really express in LogQL – “give me the most common log line in this stream – allowing for minor variations, e.g. some integer/hex changing.”

to make this concrete, here’s what’s responsible for the spike above:

[2022-10-20 04:05:56,636] INFO [source_20_34_shard_002|task-0] Streaming requested from LSN LSN{3B25/C54273D0}, received LSN LSN{3B25/C5463900} identified as already processed (io.debezium.connector.postgresql.connection.AbstractMessageDecoder:45)
[2022-10-20 04:05:56,636] INFO [source_20_34_shard_002|task-0] Streaming requested from LSN LSN{3B25/C54273D0}, received LSN LSN{3B25/C5463A40} identified as already processed (io.debezium.connector.postgresql.connection.AbstractMessageDecoder:45)
[2022-10-20 04:05:56,636] INFO [source_20_34_shard_002|task-0] Streaming requested from LSN LSN{3B25/C54273D0}, received LSN LSN{3B25/C5463AF0} identified as already processed (io.debezium.connector.postgresql.connection.AbstractMessageDecoder:45)

I can think of ways to do this programmatically outside of loki/grafana, like simple sampling of log lines, light parsing to recognize lines identical but for one or two fields. but I can’t see how that fits into the loki framework…

thanks in advance for your ideas!

system · October 20, 2023, 5:42pm

This topic was automatically closed 365 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Missing log lines when logging identical lines at the same time Grafana Loki loki	1	143	May 17, 2025
Counting comma separated list lines Grafana Loki	2	88	February 9, 2026
Get count of each log contents Grafana Loki loki , promql , grafana	2	5597	April 4, 2024
How to use pipeline in promtail/loki to count specific loglines (noob question) Grafana Loki loki	1	880	September 17, 2020
Duplicate log lines Grafana Loki loki	3	2984	July 28, 2021

LogQL queries to identify "log spam" -- frequently repeating log line causing logfile growth

Related topics