Thanks for the info much appreciated, have you setup the alerting within Grafana for this?
I have set a query for the panel
|> range(start: v.timeRangeStart, stop: v.timeRangeStop)
|> filter(fn: ® => r["_measurement"] == “ping”)
|> filter(fn: ® => r["_field"] == “reply_received”)
|> aggregateWindow(every: v.windowPeriod, fn: mean, createEmpty: false)
|> yield(name: “mean”)
And the alert does not fire as this covers a few hosts - when the host drops it does not change the metric (tried by stopping telegraf service as this would replicate down) do you have to set this per host?