-
Requirement
-
Resolution: Unresolved
-
Normal
-
None
-
Labs Workbench - Beta
-
None
-
None
-
NDS Sprint 19, NDS Sprint 20, NDS Sprint 21
Discuss various aspects of supporting the Labs Workbench as a production service.
Topics of interest might include:
- What are the hours during which we are expected to support this product?
- Who is responsible for fixing Labs when it breaks? (i.e. when NAGIOS complains)
- Does it depend on which component is broken?
- What if this person is unavailable or cannot be contacted?
- Is there an on-call schedule?
Ideally, we should also:
- Come up with an actual maintenance schedule and process / plan for bringing things down for maintenance
- Set up a "Contact Details" wiki page including emails, phone numbers, and chat client preferences of the support team and important technical contacts
- Set up a nice at-a-glance dashboard for viewing logs / performance of the cluster
- This could easily just be an AngularJS page with iframes into kibana / grafana / kubedash / etc
This ticket is complete when we have discussed these topics, as well as any other related issues or concerns, and gathered them into a Confluence wiki page.