I represent an ISP from Riga, Latvia.
I have recently installed Prometheus with Grafana and SNMP export. I am very surprised at how fast it queries and shows me data. But now I am wondering if this fits our network.
We have thousands of switches that I want to query for interface statistics, CPU, memory and maybe temperature. I think it is going to be up to 100k of time series.
I image it to work this way:
- I get info about a new switch that had been installed. For example, querying external DB. At this point I know IP and model.
- Then I put IP address in an appropriate device_list.yml file that is bind to a job of prometheus.yml. I can to it manually now and it is working.
- Grafana/Prometheus sees a new device and automatically links it to a graph template.
Does this sound real? Any suggestions on how to accomplish this?
What if I generate hundreds or even thousands of dashboards in Grafana? E.g. dashboard per switch where are interface statistics graphs. Up to 30 graphs per dashboard.
I am struggling now with drawing delta on my graph. Any suggestions on how to generate delta from two IfInOctets and create a Mbps graph in Grafana?
Thank you very much beforehand!