This article covers a version of CentOS that is no longer supported. If you are currently operating a server running CentOS 6, we highly recommend upgrading or migrating to a supported version of CentOS.
Reason: CentOS 6 reached end of life (EOL) on November 30th, 2020 and no longer receives security patches or updates. For this reason, this guide is no longer maintained.
See Instead:
This guide might still be useful as a reference, but may not work on other CentOS releases. If available, we strongly recommend using a guide written for the version of CentOS you are using.
Bash is a popular command processor available on Linux by default.
We have previously covered how to install Nagios monitoring server on CentOS 6 x64.
This time, we will expand on this idea and create Nagios plugins using Bash. These plugins will be running on client VPS, and be executed via NRPE.
rpm -ivh http://pkgs.repoforge.org/rpmforge-release/rpmforge-release-0.5.3-1.el6.rf.x86_64.rpm yum -y install nagios-nrpe useradd nrpe && chkconfig nrpe on
It would be a good idea to keep your plugins in same directory as other Nagios plugins (/usr/lib64/nagios/plugins/ for example).
For our example, we will create a script that checks current disk usage by calling "df" from shell, and throw an alert if it is over 85% used:
#!/bin/bash used_space=`df -h / | grep -v Filesystem | awk '{print $5}' | sed 's/%//g'` case $used_space in [1-84]*) echo "OK - $used_space% of disk space used." exit 0 ;; [85]*) echo "WARNING - $used_space% of disk space used." exit 1 ;; [86-100]*) echo "CRITICAL - $used_space% of disk space used." exit 2 ;; *) echo "UNKNOWN - $used_space% of disk space used." exit 3 ;; esac
We will save this script in /usr/lib64/nagios/plugins/usedspace.sh and make it executable:
chmod +x /usr/lib64/nagios/plugins/usedspace.sh
The entire Nagios NRPE plugin boils down to using exit codes to trigger alerts.
You introduce your level of logic to the script, and if you want to trigger an alert (whether it is OK, WARNING, CRITICAL, or UNKNOWN) - you specify an exit code.
Refer to the following Nagios Exit Codes:
Exit Code | Status |
0 | OK |
1 | WARNING |
2 | CRITICAL |
3 | UNKNOWN |
Delete original /etc/nagios/nrpe.cfg and add the following lines to it:
log_facility=daemon pid_file=/var/run/nrpe/nrpe.pid server_port=5666 nrpe_user=nrpe nrpe_group=nrpe allowed_hosts=198.211.117.251 dont_blame_nrpe=1 debug=0 command_timeout=60 connection_timeout=300 include_dir=/etc/nrpe.d/ command[usedspace_bash]=/usr/lib64/nagios/plugins/usedspace.sh
Where 198.211.117.251 is our monitoring server from previous articles. Change these to your own values.
Make sure to restart Nagios NRPE service:
service nrpe restart
Define new command in /etc/nagios/objects/commands.cfg
define command{ command_name usedspace_bash command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c usedspace_bash }
As you can see, it uses NRPE to make TCP connections to port 5666 and run command 'usedspace_bash', which we defined in /etc/nagios/nrpe.cfg on that remote host.
Add this check to your Nagios configuration file for client VPS.
For our example, we will monitor a server called CentOSDroplet and edit /etc/nagios/servers/CentOSDroplet.cfg
define service { use generic-service host_name CentOSDroplet service_description Custom Disk Checker In Bash check_command usedspace_bash }
Restart Nagios:
service nagios restart
Verify that the new check is working:
And you are all done!
Thanks for learning with the DigitalOcean Community. Check out our offerings for compute, storage, networking, and managed databases.
This textbox defaults to using Markdown to format your answer.
You can type !ref in this text area to quickly search our full set of tutorials, documentation & marketplace offerings and insert the link!
Sign up for Infrastructure as a Newsletter.
Working on improving health and education, reducing inequality, and spurring economic growth? We'd like to help.
Get paid to write technical tutorials and select a tech-focused charity to receive a matching donation.
I had to move print $5 to print $4, this now prints out the correct number, however it still always returns and exit code of 3 regardless of the number that prints.
command$ df -h / | grep -v Filesystem | awk ‘{print $5}’ | sed ‘s/%//g’ output$ /
command$ df -h / | grep -v Filesystem | awk ‘{print $4}’ | sed ‘s/%//g’ output$ 11
command$ ./used_space_root.sh output$ UNKNOWN - 11% of root partition used.
@Zackary: Looks like there might be a problem with your bash script. It’s returning a wrong value. Make sure you’ve copied it correctly. What is the output of running the df command directly:
<pre> df -h / | grep -v Filesystem | awk ‘{print $5}’ | sed ‘s/%//g’ </pre>
Getting this as the status:
UNKNOWN - /% of disk space used.