[loki-distributed] Does Loki in microservices mode require external storage?

justinstauffer · February 1, 2022, 3:11pm

I’ve been spending the last couple of weeks absolutely struggling to get Loki (and Tempo) up and running properly in an on-premises Kubernetes cluster.

My first attempt was to use the “single binary” approach but the Helm chart (https://github.com/grafana/helm-charts/tree/main/charts/loki) lets you inappropriately attempt to scale it so I was having problems with that because it was getting confused having multiple instances. For reference on those details, see my post in the Tempo forum: Traces and Logs intermittently disappearing from Tempo and Loki

Now my production use-case is going to involve a lot of logging so I don’t think the “single binary” approach is appropriate as I need the ability to scale up. My impression was that using Loki in distributed/microservices mode would be the way to go. However, I do NOT want to rely on external storage like S3, GCS, or Azure but after actually testing the loki-distributed Helm chart with filesystem storage, it seems this does NOT work (details in my Logs disappearing post).

The problem is that I am seeing conflicting info in the documentation that is confusing me. I thought it was possible to use file system storage with boltdb-shipper because the Architecture → Storage → Single Store says:

Loki stores all data in a single object storage backend. This mode of operation became generally available with Loki 2.0 and is fast, cost-effective, and simple, not to mention where all current and future development lies. This mode uses an adapter called boltdb_shipper to store the index in object storage (the same way we store chunks ).

OK, sounds like boltdb-shipper is the way to go! All future development lies here…

Oops, but here on this Single Store Loki (boltdb-shipper index type) page it doesn’t mention anything about using local storage with boltdb-shipper.

Then on the Filesystem Object Store page it explicitly states filesystem doesn’t work with scaling…

So perhaps I’ve been thoroughly confused by the Helm charts inappropriate flexibility but can someone who knows for sure please clarify whether external storage (S3, GCS, Azure, etc.) is required for Loki in distributed mode if you actually want to take advantage of the scaling capabilities? I have the same question for Tempo as well.

justinstauffer · February 4, 2022, 2:37pm

Based on further testing, it seems that in order to run Loki in distributed, you do need to use external storage. If you don’t want to offload your storage to a cloud provider however, it seems you can utilize MinIO by running that locally on your on-premises cluster as MinIO has an Amazon S3 compatible API.

system · February 4, 2023, 2:38pm

This topic was automatically closed 365 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Distributing file system based storage Grafana Loki	1	308	June 14, 2022
How to scale out a loki cluster? Grafana Loki	25	6355	September 28, 2022
Loki distributed database Grafana Loki loki	3	691	July 12, 2022
Centralized Loki index/chunks storage Grafana Loki loki	3	1777	December 19, 2020
Boltdb-shipper - Can we push indexes/chunks from local instance of loki to GCS Bucket Grafana Loki	1	996	December 10, 2020

[loki-distributed] Does Loki in microservices mode require external storage?

Related topics