Good afternoon - I have set up a dashboard with graphs that monitor memory usage based on the following Nagios command and service definitions
define command{
command_name check_nt
command_line $USER1$/check_nt -H $HOSTADDRESS$ -p 12489 -v $ARG1$ $ARG2$
}
# Monitor TOTAL memory usage with check_nt
define service{
use generic-service
#host_name MyServer
hostgroup_name MyServers
service_description Win_TotMem_NT
check_command check_nt!MEMUSE!-w 90 -c 95
}
However, the values returned seem higher than if I remote into the servers individually, so I tried to replicate the above functionality with check_nrpe using the following:
define command{
command_name check_nrpe_totmem
command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -p 5666 -c CheckMEM -a MaxWarn=80% MaxCrit=90% ShowAll=long
}
define service{
use generic-service
#host_name MyServer
hostgroup_name MyServers
service_description Win_TotMem_NRPE
check_command check_nrpe_totmem
}
Whilst the values returned in Nagios for the check_nt and check_nrpe services match each other, when I try to set up graphs for these in a Grafana dashboard, whilst the check_nt charts work fine, the check_nrpe ones don’t - they pretty much flatline. I have the Y Axis unit set to megabytes and whilst check_nt reports values in the range 2.0-3.0 GB, check_nrpe shows minimal MB values.
I’m assuming that this is something to do with the way in which the check_nrpe data is parsing, but I have no idea as to where to start in resolving this, so any advice would be gratefully received!
Thanks in advance
Pete