-
Task
-
Resolution: Fixed
-
Normal
-
None
-
None
-
None
-
NDS Sprint 13
Nagios is an open source monitoring system. In general, the Nagios server is installed in one location and the Nagios Remote Plugin Executor (NRPE) on each node to be monitored. Nagios provides public service monitoring through standard plugins (e.g., DNS, HTTP, SMTP, etc). It provides private service monitoring throug NRPE (CPU, memory, disk, etc).
For NDS Labs, we'll do the following:
- Create Nagios server Docker image following the instructions in https://www.digitalocean.com/community/tutorials/how-to-install-nagios-4-and-monitor-your-servers-on-ubuntu-14-04
- Create Nagios daemonset for NRPE following the instructions in
https://www.digitalocean.com/community/tutorials/how-to-install-nagios-4-and-monitor-your-servers-on-ubuntu-14-04 - Provision VM to run Nagios server at remote site (SDSC)
- Configure Nagios tunneling (stunnel)
- Create nagios configuration github repository to maintain versioned nagios monitoring per-cluster (starting with beta) configurations
- Configure Nagios contacts
- Configure Nagios hosts for priority systems. This includes;
- Ingress/Nginx
- Web UI/API including Kube API/Etcd availability
- Kube system (GFS, LMA tools, etc)
- Openstack
- Backups
Note, this may require changing these systems to support monitoring.