Describe the bug
The loki-compactor active_retention metrics are 0 and some of them are missing related to namespace: loki_cpmpactor, but we confirmed that the deletions actually happen in the S3 bucket.
To Reproduce
Steps to reproduce the behavior:
- Started Loki 3.1.0
- scrape metrics from loki compactor service
- found that metrics are missing
- example: loki_compactor_apply_retention_last_successful_run_timestamp_seconds 0, loki_compactor_apply_retention_operation_total no-data
Expected behavior
Retention metric should be available because retention is happening
Environment:
- Infrastructure: [Kubernetes]
- Deployment tool: [helm]
Screenshots, Promtail config, or terminal output
Configuration that we used for the compactor:
rentention_period: 183d #this is for Global and for some tenants we apply 14d
compactor:
compaction_interval: 10m
delete_request_cancel_period: 24h
retention_enabled: true
retention_delete_delay: 2h
delete_request_store: s3
Not seeing this problem. I just checked and I see values for both of those metrics, running Loki 3.2.1. Couple of questions and things to try:
- Are you running simple scalable mode?
- How many compactors do you have?
- If you just hit the
/metrics
endpoint on your compactor, are you not seeing those metrics still? If you have multiple compactors be sure to do it on all of them.
Thanks @tonyswumac.
I try things that asked.
- Are you running simple scalable mode? We are using Loki Distributed
- How many compactors do you have? 1 Compactor
- If you just hit the /metrics endpoint on your compactor, are you not seeing those metrics still? If you have multiple compactors be sure to do it on all of them. 1 Compactor only running and still missing
We are using the same configuration deployment on another environment, and it is working fine there. However, we are not able to find with the provided configuration why these metrics are not coming here.
Interesting. I am not sure what’s wrong. One more thing I’d try, since you mentioned that you have another cluster that’s functioning properly, perhaps try hitting the /config
endpoint on compactors from both cluster, then do a diff and see what is different.
Hi @tonyswumac . i didn’t find any difference in the config.
I have a question: can data volume impact no retention metrics? retention is happening but maybe not correctly happening?
I am not sure, but if it’s not working properly you should be seeing some sort of error logs.