How do I count lines by labels?

hiltonadmin · December 16, 2023, 2:39am

I have an apache log and I want to know which urls are requested most often. So I already have the following

{filename="/var/log/apache2/other_vhosts_access.log"} | regexp `(?P<domain>[^:]+).* "(?P<method>[A-Z]+) (?P<url>(/[^/? ]+){0,2}).*`

which is working fine (the labels have the right values)
I now want the equivilant of the SQL Query select count(*) from ... group by url, domain, method;
I tried the following:

count by(domain, url, method) (rate({filename="/var/log/apache2/other_vhosts_access.log"} | regexp `(?P<domain>[^:]+).* "(?P<method>[A-Z]+) (?P<url>(/[^/? ]+){0,2}).*` [$__range]))

but this does not give me a single value per label tupel, but with

count by(domain, url, method) ({filename="/var/log/apache2/other_vhosts_access.log"} | regexp `(?P<domain>[^:]+).* "(?P<method>[A-Z]+) (?P<url>(/[^/? ]+){0,2}).*`)

I get the error: parse error at line 1, col 155: syntax error: unexpected )

tonyswumac · December 18, 2023, 4:35pm

Try something like this:

sum by (domain, url, method) (
  count_over_time(
    <YOUR_QUERY>
  [$__interval])
)

You can even slap topk on top:

topk(10, sum by (domain, url, method) (
  count_over_time(
    <YOUR_QUERY>
  [$__interval])
))

If you can share an example log line I can test for you in the logql analyzer as well.

hiltonadmin · December 18, 2023, 4:54pm

The problem is that this is still displayed as graph over time and not as list where the time is ignored:

adityapw · September 2, 2024, 5:35pm

Hey, I’m also trying the same thing. Any luck so far?
I’ve tried this:
count_over_time({filename=“/var/log/nginx/access.log”} |~ “(GET|POST) /geoserver/web” [30d])

galaxy · October 14, 2024, 7:29am

refer this article

this is what my log looks like:

{"name":"NSPanel","id":"10017b****","data":{"action":"update","deviceid":"10017b****","apikey":"f47a5333-****-****-****-************","userAgent":"device","d_seq":218583,"params":{"temperature":22.6,"humidity":"blank","tempUnit":0},"seq":"166"}}

my LogQL:

count by(name)(count_over_time({service_name="ewelink", ext="WSP_MSG"}| json[$__range]))

replace $__auto to $__range and set Options.Type from Range to Instant

and I get what I want:(a list instead of a time-seriers data)

Topic		Replies	Views
Loki group by label value Grafana Loki	6	15294	May 8, 2024
Loki status graph from access.log Grafana Loki	2	1297	May 24, 2023
Group by label value without timestamps Grafana Loki	1	179	November 7, 2024
Sum total number of items returned by a LogQL qery Grafana loki , query-help	14	236	September 2, 2024
Get count of each log contents Grafana Loki loki , promql , grafana	2	5449	April 4, 2024

How do I count lines by labels?

Related topics