Monitor website is up or down using grafana

jag3773 · December 2, 2019, 4:18pm

I run telegraf on all our nodes with URL checks setup against all our sites. This creates a many to many checking relationship that mitigates bad data.

In grafana, I setup two queries, the first one is graphed and records the average response time:

alias(averageSeries(telegraf.*.GET.success.https:--my_url_com*.*.*.response_time), 'Avg Reponse Time')

The second one is disabled on the graph but records the response code:

averageSeries(telegraf.a_server_com.GET.success.https:--my_url_com*.*.http_response.http_response_code)

Then I setup my alert like this:

Essentially, I get an alert if the average response time is greater than 1 second or if the average HTTP response code is greater than 300 (which could indicate any number of problems). This combination seems to catch the problem situations you’ll run into.

Topic		Replies	Views
Http(s) status monitoring Alerting alerting , plugins	2	2838	January 21, 2022
Dashboard website monitoring is showing only one of two defined urls Prometheus	0	857	June 7, 2022
Node up or down dashboard Grafana	1	2767	August 22, 2020
URL Monitoring For Windows Prometheus grafana	0	288	August 9, 2023
Grafana is showing Website is down while my website is up Prometheus alerting , query-help	3	2240	December 16, 2023

Monitor website is up or down using grafana

Related topics