Promtail: old logs

marchenkoalexandr · April 30, 2024, 2:00pm

Whenever promtails starts first time on a server it goes crazy in attempt to send everything it found to Loki.

The fun fact is that Loki by default wont accept old logs and respond with an error.

The pain here is that while it happens promtail eats CPU resource quite noticeably.

Did not found any workarounds for this, I was expected to something like, either:

configuration option like in Kafka topic - aka read log from beginning or from the end in case if there is no stored position
configuration option to drop old log lines the same way as it is configured in Loki

Wondering if there are some well known workarounds

osobistezakupy · May 1, 2024, 10:32am

In this article it explained. But for my also it is not clear. From one side we can see that LOKI accepts old logs only when 1h behind. But on the othe side following configuration is set by default:

reject_old_samples: true
reject_old_samples_max_age: 1w

marchenkoalexandr · May 1, 2024, 11:45am

Not exactly, as mentioned, Loki is already rejecting old logs - it is completely fine

The problem is that now, I’m deploying Promtail for a very first time on server

and it starts to process months of logs as crazy eating CPU just to receive an 400 logs are too old from Loki

Which is kinda weird

looking for a way to say to promtail - ignore old logs and do not even bother to preprocess and send them to Loki

tonyswumac · May 2, 2024, 11:57pm

It may be less obvious, but you can use the drop stage in promtail to drop logs older than specified time period. See drop | Grafana Loki documentation.

jsabater · December 28, 2024, 8:33am

In the current version of Loki, 3.3.1 as of this writing, do not forget this bit, as the default value is 2 hours:

# Configure the ingester and how the ingester will register itself to a key value store.
ingester:
  # The maximum duration of a timeseries chunk in memory. If a timeseries runs for
  # longer than this, the current chunk will be flushed to the store and a new
  # chunk created. Default is 2h.
  max_chunk_age: "8766h"

Once old logs are imported, changing this configuration option should not be required anymore.

plaskplask · January 7, 2025, 3:49pm

What about the warning from the original article?

The default here is two hours. I strongly discourage increasing this value to create a larger out-of-order window. At Grafana Labs we do not run Loki this way so I can’t easily tell you what challenges you may face. I recommend trying to find a way to use labels to separate the streams so they can be ingested separately.

I am currently struggling with extending the time window for old logs. I would rather not touch this setting - in our environment, the situation where we need to ingest the logs from (for example) last 10 days may happen pretty often, so i would like to set it once and for good. I tried to set reject_old_samples_max_age to 3w and 5000h (while reject_old_samples is set to true), but the Loki still reports:

level=warn ts=2025-01-07T14:33:51.108061786Z caller=grpc_logging.go:76 method=/logproto.Pusher/Push duration=6.108444ms msg=gRPC err="rpc error: code = Code(400) desc = entry with timestamp 2024-12-19 16:10:27.142 +0000 UTC ignored, reason: 'entry too far behind, entry timestamp is: 2024-12-19T16:10:27Z, oldest acceptable timestamp is: 2025-01-07T13:30:25Z'

In other words, Loki still does not accept anything older than 1 hour…

Any word of advise?

Topic		Replies	Views
Promtail - only get/process the latest log after Promtail is started Grafana Loki	3	1281	February 23, 2022
Loki/Promtail : parsing timestamp that are too old Grafana Loki	4	14450	January 23, 2022
Promtail log synchronization data loss Grafana Loki loki , promtail	4	619	June 7, 2024
Pushing an offline logs from the past Grafana Loki loki	3	2305	November 15, 2023
Getting "entry too far behind" even if reject_old_samples and reject_old_samples_max_age should allow Grafana Loki loki	3	1181	March 19, 2025

Promtail: old logs

Related topics