Multiple Data sources blended into a single data source after S3 migration

tecora · April 3, 2025, 11:37am

Hey guys,

we’ve been using 3 monolithic Loki servers running on local disk to capture syslog from our network. It was 3 servers for 3 data hubs. Everything was working as intended with having 3 data sources on Grafana to query etc. As you could expect, every data source had access to only hosts which are connected to their Loki.

Recently we’ve started to send logs to S3 (our local Ceph server); all 3 servers shares the same bucket. It seemed like to work as intended but I’ve just realized that whichever data source I use on Grafana, every query uses the “whole” database. Meaning even if I choose Loki1 as data source, I’m getting results from all 3.

Granted, they now share the same bucket but I would expect they are grouped differently so that if there is query for a specific data source, only data belonging to that data source should be pulled.

Do I need to configure something differently or as long as they share the same bucket, Loki/Grafana will act as if there is only a single data source, regardless of different Loki servers.

Thank you!

tonyswumac · April 3, 2025, 9:58pm

You don’t want to share the same S3 bucket across different Loki instances / clusters.

tecora · April 4, 2025, 7:55am

Is it against best-practice or it could lead to lost/duplicate/corrupt data?

I’ve added a unique label on Promtail on 3 servers to distinguish/filter the logs but they are still using the same bucket/data source.

tonyswumac · April 4, 2025, 4:23pm

As you had already discovered, by sharing the backend S3 bucket between loki clusters you are effectively sharing the data across your loki clusters even though they are supposed to be in a cluster. This leads to:

Each cluster being able to see each other’s logs, not ideal since you must’ve separated the clusters for a reason.
Since each of your clusters would be running a compactor this can lead to compactor conflict and could lead to data loss.

You can’t change the directory structure for Loki, it’s best to just create separate S3 buckets dedicated to each instance.

Topic		Replies	Views
Multiple Datasources for Loki Grafana Loki api , loki , plugins	1	42	October 17, 2024
Multiple clusters using the same S3 bucket to store loki logs, but logs are still separate somehow? Grafana Loki	1	1157	October 13, 2022
Multiple Loki sending to one S3 Grafana Loki	3	2675	October 11, 2022
Problems to migrate Logs to new Loki instance Grafana Loki loki	5	610	April 21, 2025
Grafana Loki implementation on multiple kubernetes clusters Grafana Loki loki , kubernetes	2	1587	January 1, 2025

Multiple Data sources blended into a single data source after S3 migration

Related topics