Hello,
I have a web application, that for each visitor, a unique guid is generated… like f1b4fbc3-f8cb-48b9-ad3a-e93091d01f39.
I have many millions of such guids per month, over 10 million.
I ingesting the logs, by having these guids converted into a loki labels.
The label has the key of ‘visitorguid’ and the value is the many values that we can have.
I then want to query via grafana, via a specific ‘guid’ and find all logs related to that guid.
However after my initial testing, it seems to be really slow to the point that it does not work anymore.
How can I index these guids in loki so that its fast when i query them?
Remember, each guid can have multiple ‘logs’.
example
f1b4fbc3-f8cb-48b9-ad3a-e93091d01f39 - user entered page with url of /gifts
f1b4fbc3-f8cb-48b9-ad3a-e93091d01f39 - user bought a panda bear
etc.
Some people might say that I need to narrow them down via a timerange, or other methods, but narrowing them down is not something that I want to do, I just want to retrieve all the logs, that contain a certain ‘Guid’ label, and not have to invest time into thinking how to narrow them now.
It used to work flawlessly even with this amount of data in elastic, so why cant loki index these labels so I can then query them. Is it because loki is not designed to allow me to query a very specific log, but to look at logs from a high level?