A node in Loki’s memberlist cluster suddenly exited without any abnormalities in memory and CPU, while the other three nodes were running normally without exiting.
The log error which exiting node is as follows:
Segmentation fault once in a while isn’t particularly concerning in my opinion. If you only have this problem with one container you might want to check your Loki metrics and make sure all nodes are load balanced properly.
goroutine 0 gp=0xc000007180 m=3 mp=0xc0000b5008 [idle]:
goroutine 33 gp=0xc000682000 m=3 mp=0xc0000b5008 [GC worker (active)]:
created by runtime.gcBgMarkStartWorkers in goroutine 1
goroutine 1 gp=0xc0000061c0 m=nil [select, 64917 minutes]:
goroutine 2 gp=0xc000006700 m=nil [force gc (idle), 64917 minutes]:
created by runtime.init.6 in goroutine 1
goroutine 3 gp=0xc000006c40 m=nil [GC sweep wait]:
created by runtime.gcenable in goroutine 1
goroutine 4 gp=0xc000006e00 m=nil [sleep]:
created by runtime.gcenable in goroutine 1
goroutine 17 gp=0xc000102380 m=nil [finalizer wait]:
created by runtime.createfing in goroutine 1
goroutine 18 gp=0xc000102540 m=nil [GC worker (idle)]:
created by runtime.gcBgMarkStartWorkers in goroutine 1
goroutine 5 gp=0xc000007340 m=nil [GC worker (idle)]:
created by runtime.gcBgMarkStartWorkers in goroutine 1
goroutine 19 gp=0xc000102700 m=nil [GC worker (idle)]:
created by runtime.gcBgMarkStartWorkers in goroutine 1
goroutine 6 gp=0xc000007500 m=nil [GC worker (idle)]:
created by runtime.gcBgMarkStartWorkers in goroutine 1
goroutine 49 gp=0xc000702000 m=nil [GC worker (idle)]:
created by runtime.gcBgMarkStartWorkers in goroutine 1
goroutine 50 gp=0xc0007021c0 m=nil [GC worker (idle)]:
created by runtime.gcBgMarkStartWorkers in goroutine 1
goroutine 34 gp=0xc0006821c0 m=nil [GC worker (idle)]:
created by runtime.gcBgMarkStartWorkers in goroutine 1
goroutine 7 gp=0xc000802380 m=nil [select, 64917 minutes]:
created by github.com/baidubce/bce-sdk-go/util/log.NewLogger in goroutine 1
goroutine 20 gp=0xc000c01500 m=nil [select]:
created by go.opencensus.io/stats/view.init.0 in goroutine 1
goroutine 138 gp=0xc000c01180 m=nil [chan receive]:
created by github.com/grafana/loki/v3/pkg/util/log.newPrometheusLogger.WithFlushPeriod.func2 in goroutine 1
goroutine 171 gp=0xc000c461c0 m=nil [select, 1 minutes]:
created by github.com/uber/jaeger-client-go.NewRemotelyControlledSampler in goroutine 1
goroutine 173 gp=0xc000c46380 m=nil [select]:
created by github.com/uber/jaeger-client-go/utils.newReconnectingUDPConn in goroutine 1
goroutine 174 gp=0xc000c46540 m=nil [select]:
created by github.com/uber/jaeger-client-go.NewRemoteReporter in goroutine 1
goroutine 181 gp=0xc000c468c0 m=nil [select, 64917 minutes]:
created by google.golang.org/grpc/internal/grpcsync.NewCallbackSerializer in goroutine 1
goroutine 182 gp=0xc000c46c40 m=nil [select, 64917 minutes]:
created by google.golang.org/grpc/internal/grpcsync.NewCallbackSerializer in goroutine 1
goroutine 183 gp=0xc000c46e00 m=nil [select, 64917 minutes]:
created by google.golang.org/grpc/internal/grpcsync.NewCallbackSerializer in goroutine 1
goroutine 812 gp=0xc000c01340 m=nil [select, 64917 minutes]:
created by google.golang.org/grpc/internal/grpcsync.NewCallbackSerializer in goroutine 771
loki version:
loki, version 3.2.0 (branch: k218, revision: 659f5421)
build user: root@003ce357cdf4
build date: 2024-09-18T16:21:52Z
go version: go1.22.6
platform: linux/amd64
tags: netgo