Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Fixed
Priority: Major
Fix Version/s: Labs Workbench - Beta
Affects Version/s: None
Component/s: Infrastructure
Labels:
None

Sprint:
NDS Sprint 12

Resulting from the discussions surrounding ~~NDS-405~~, we have a slightly better idea of how we would like to approach logging, monitoring, and alerts (LMA).

We now know:

which infrastructure / services we will need to monitor - ingress, ui, api, gfs, kube-system, openstack, backups, etc
we will need to run Qualys on every container running on nodes with a public IP - loadbalancer, skydns, etc
that we should be running a healthz on each service to ensure things stay running smoothly

Now we just need to explore the tools themselves (nagios, healthz, kibana, prometheus) and set up a prototype.

This ticket is complete when we have laid out how we plan to approach logging, monitoring, and (most importantly) alerts and filed any resulting work that we deem necessary into new JIRA tickets.

is triggered by

NDS-405 Discuss requirements for external monitoring

Closed

relates to

NDS-282 Health checks/monitoring

Closed

mentioned in: Page Loading...

Wiki Page: Wiki Page Loading...

Assignee:: Craig Willis

Reporter:: Sara Lambert

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 24/Aug/16 5:21 PM

Updated:: 06/Oct/16 3:36 PM

Resolved:: 23/Sep/16 11:12 AM

Estimated:

Remaining:

Logged:

Details

Description

Gliffy Diagrams

Attachments

Issue Links

Activity

People

Dates

Time Tracking

Tasks