Table with host downtime and aggregate of downtime across the farm

anon11352080 · November 8, 2018, 11:57pm

Hello friends! I’m using Prometheus and Grafana. I’ve two goals right now. The first is to get a table with a column for the host, and a column of how long the host has been down depending on the time range I’m looking at. With the help of the folks from IRC, I was able to get this query to work, but it only works for a single stat panel:

(1 - avg(avg_over_time(up[__interval]))) * (__interval_ms / 1000)
Option stat set to Total
Unit of time set to Duration (s)
Instant: Turned off

However, trying to apply this same query to a table gives me a column for Time and Value. So, I tried duplicating the single stat column which does work, but seems overkill with these huge blocks for hundreds of hosts.

And of course, once I get this resolved, I’d like to get an aggregate value of all the hosts to see downtime and graph it (I’d like to drill down by customer, region, etc eventually, but that’s in time). Can anyone assist?

adeverteuil · November 9, 2018, 3:57pm

Hi @anon11352080,

I think this panel may be relevant for you. I’m not sure you can get the exact layout you describe, but you can create an array of host status.

Best regards,
--
Alexandre de Verteuil

Topic		Replies	Views
Dashboard chart to show server uptime and downtime history Dashboards panel , prometheus	6	14223	January 3, 2023
Table: Last data timestamp from instance Grafana	2	3786	August 13, 2022
How to monitor host status?	0	4407	December 20, 2019
Creating Singlestat panel to also report unresponsive hosts Stat Panel	0	591	August 8, 2018
Visualizing multiple prometheus datasource queries as a table Grafana	13	1706	October 4, 2020

Table with host downtime and aggregate of downtime across the farm

Related topics