Multiple Loki different inctances send to same bucket

christoforosd · April 3, 2023, 6:56am

I have three different instances of Loki, they are in different region but i write them to the same Bucket.

The issue is that from the main Loki - Grafana it take very long time to show the logs from the other two loki instances (usually 50m-1h).

I try to disable compactor but loki failed. I didn’t join the instances together with memberlist. Not sure if i need to join to work this.

Any ideas please ?

compactor:
  working_directory: /data/loki/boltdb-shipper-compactor
  shared_store: aws

table_manager:
  retention_deletes_enabled: true
  retention_period: 720h

Thanks

tonyswumac · April 3, 2023, 4:23pm

Loki querier queries the writer instances within configured query_ingesters_within. If your querier can’t reach the remote writers then that would explain it.

You could try to lower it, but a better option would be to re-design your architecture. You should keep Loki cluster container within one location, then from your remote accounts setup promtail or fluend forwarders to forward logs to your Loki cluster. The connectivity can be facilitated with VPC endpoint or transit gateway.

christoforosd · April 4, 2023, 6:36am

I only query the main instance. But if i keep Loki Cluster container to one location then the price it will be a lot more. And the SG it will be a nightmare to manage them.

Loki don’t support this architecture?
Write from multiple Loki to same bucket, if i use DynamoDB it should be ok?

christoforosd · April 4, 2023, 8:46am

I change config query_ingesters_within to all three instances to 10m, but still have issue maybe 30m. If i change from boltdb-shipper to DynamoDB the issue it will be fix?

I didn’t understand why is happening that. Is because writer instances with config query_ingesters_within it take too much time to ingest ?

tonyswumac · April 4, 2023, 4:21pm

There is a limit on how often writers can write to chunk storage, it’s controlled either by size, idle period, or max wait period. Typically you’d configure query_ingesters_within within the max wait period.

I don’t know why the cost would be a lot more, you’d be running the same number of hosts (maybe even less), and I am not sure why it would be a nightmare to manage.

christoforosd · April 5, 2023, 6:34am

If i have promtail to send all logs to one region then i will paid for all the traffic pass from transit gateway.

tonyswumac · April 5, 2023, 4:26pm

VPC endpoint service might be an alternative then.

It’s a trade-off, if you think about it. You either accept that there will always be some logs living on the ingester for a period of time before it’s sent to chunk storage, or you try to justify the cost.

christoforosd · April 6, 2023, 7:19am

Thank you a lot for your help

Stupid question if i index storage from boltdb-shipper to DynamoDB and all three nodes write to DynamoDB then this issue it will be better or it will be the same.

I was thinking that the writers, write to chunk storage instantly and just the search it take time to search the storage - index ?

slappeyibkr · April 6, 2023, 3:03pm

@tonyswumac how do I configure the querier to query the ingesters? How does the querier learn about the ingesters? Over cluster gossip? I can’t figure this part out. I’m able to query logs from minio but not from the ingesters. I’ve tried everything I can think of and nothing has worked for me. My query_ingesters_within is default of 3h.

tonyswumac · April 6, 2023, 3:46pm

It probably wouldn’t make any difference, your queriers still need to be able to connect to ingesters for logs that haven’t been written to chunk storage.

tonyswumac · April 6, 2023, 3:52pm

Please see my reply to another topic here: Loki 2.4.1 empty ring Code(500) error for "GET /loki/api/v1/labels" API on AWS ECS - #10 by tonyswumac

It doesn’t directly apply to you (unless you are using ECS as well), but I did explain how service discovery works and how memberlist ring is configured there.

slappeyibkr · April 6, 2023, 4:56pm

When you say “it doesn’t propagate the port which means you have to use native port (meaning no bridge) for your writers.”, what writer port are you referring to? gRPC port? or gossip port? I’m assumnig gRPC. I have all memberlist members communicating successfully over gossip. We are using podman with slirp4netns network mode and port translation. For gossip to work, we had to configure advertise_addr and advertise_port. I just can’t seem to figure out how to get the querier to query the ingesters.

tonyswumac · April 6, 2023, 5:28pm

Just in general, you’d want all open ports from writers to be directly accessible within the cluster. I see you created another topic, perhaps we should move this conversation there (just tag me) instead of hijacking someone else’s topic.

Also please include your loki configuration (particularly the ring memberlist part), and test telnet from your querier to writer and see if they work or not.

system · April 5, 2024, 5:29pm

This topic was automatically closed 365 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Loki simple scalable on ec2 nodes Grafana Loki loki	10	248	February 13, 2026
Loss data Loki simple scalable deployment mode Grafana Loki	14	1370	October 8, 2024
Time until loki streams are searchable by querier Grafana Loki loki , configuration	11	327	September 25, 2025
Centralized Loki index/chunks storage Grafana Loki loki	3	1839	December 19, 2020
Loki microservies multiple Read Rings/Clusters Grafana Loki loki	4	715	June 11, 2024

Multiple Loki different inctances send to same bucket

Related topics