Loki full CPU for a long time and need restart to work again

christoforosd · August 30, 2023, 7:12am

Hi, random times i get full CPU for a long time and every stop working, i need to restart loki to work. IT was full for 15 min before restart container.

I have one container only, with boltdb and S3.

Any idea what i need to search to find the issue? Or maybe why is happening?

tonyswumac · August 30, 2023, 3:59pm

What is your log volume looking like? Perhaps it’s time to look into simple scalable deployment.

But if you want to just run a single instance, I’d recommend you to try and reduce the number of chunks written to S3, and avoid splitting query too much.

christoforosd · August 31, 2023, 2:33pm

how can i find my volume?

Is to small maybe a 5-6 GB per day. Now is one EC2 with 8 CPU, i am planning to have a small scalable but i cannot now but i need to find the issue with this but no luck

Average per day is ~25% CPU and some spikes to 50-60%.

tonyswumac · August 31, 2023, 5:38pm

Check your storage and see how many files you have a day. If you are running a single instance I’d try to aim for having as few files as possible.
Reduce query split.

christoforosd · September 4, 2023, 2:41pm

hi Tony, thank you for your reply

storage files:
from S3 bucket metrics: 30000 (not sure how i can check from loki metrics)

If you are running a single instance I’d try to aim for having as few files as possible.

How can you have fewer files? delete the old?

Reduce query split.

I will try change this (not sure if is this )

limits_config:
split_queries_by_interval

tonyswumac · September 5, 2023, 3:47pm

There are a couple of configurations that you can use to control how many chunks get written, such as max_chunk_age, chunk_idle_period, and chunk_target_size. Make sure to also adjust query_ingesters_within to match.

As for splitting queries, split_queries_by_interval is what you want. You can change that to 12h or 24h for example and see how it performs.

system · September 4, 2024, 3:48pm

This topic was automatically closed 365 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Grafana loki S3 cost spike Grafana Loki loki	6	145	September 6, 2024
Loki performance and missing logs issues Grafana Loki loki	4	450	November 4, 2024
Improving Performance in Loki System for Production Use Grafana Loki loki	3	3040	June 15, 2024
Getting throttled by s3 Grafana Loki	4	3379	February 7, 2022
How to improve Loki Performance Grafana Loki	6	17048	March 17, 2022

Loki full CPU for a long time and need restart to work again

Related topics