Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Fixed
Priority: Major
Fix Version/s: Labs Workbench - Beta
Affects Version/s: None
Component/s: Backend, Infrastructure, User Interface
Labels:
None

Sprint:
NDS Sprint 12

Resulting from the discussions surrounding ~~NDS-310~~, we have a slightly better idea of how we plan to approach performance testing.

We now know:

GFS is our primary point of failure, and should be stress tested fairly heavily (~~NDS-262~~)
~~we need to do quite a bit more exploration on how many concurrent users we plan to support~~ plan for 50 users initially, will scale as needed/possible
~~once~~ since we know our potential capacity, we should revisit the approach that willis8 used to profile the system for the IASSIST workshop in May

Now we just need to explore the process itself.

Some unanswered questions:

How do we learn our limits?
- Capacity planning / monitoring - plan for 50 users initially
- what is the workload?
- what resource limits per user?
What is the process for testing that we can handle the given workload
- Write a test plan for testable components (see comment below)
- Revisit IASSIST profiling techniques used by willis8
- See NDS Labs Test Plan
By what performance metric(s) do we judge pass/fail?
- Given the above metric(s), what constitutes a failure?
- Critical services crashing into "CrashLoopBackoff"? "Error"?
- Dead openstack node? i.e. blackholed?
~~What happens when we need to:~~
- ~~restart CoreOS?~~ (~~NDS-346~~)
- ~~add GFS bricks?~~ (NDS-529)
- ~~add kubernetes nodes?~~ (~~NDS-528~~)
- ~~remove a kubernetes node for maintenance~~ (~~NDS-528~~)

This ticket is complete when we have laid out how we plan to approach performance testing, including the metrics that we plan to use for pass/fail and service-specific, as well as an estimate of resource requirements to Kenton based on the capacity analysis.

is triggered by

NDS-310 Discuss performance testing requirements

Closed

is triggering

NDS-579 Prototype performance load test process

Resolved

relates to

NDS-529 Ability to add storage to the cluster

Open

NDS-262 Stress-testing on prototype GlusterFS

Resolved

NDS-528 Ability to add a node to the cluster

Resolved

NDS-346 Determine issues when CoreOS rolling-update is enabled

Closed

mentioned in: Page Loading...; Page Loading...; Page Loading...; Page Loading...

Wiki Page: Wiki Page Loading...; Wiki Page Loading...; Wiki Page Loading...

(1 relates to, 4 mentioned in, 3 Wiki Page)

Assignee:: Sara Lambert

Reporter:: Sara Lambert

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 24/Aug/16 5:31 PM

Updated:: 23/Sep/16 11:11 AM

Resolved:: 23/Sep/16 11:11 AM

Estimated:

Remaining:

Logged:

Details

Description

Gliffy Diagrams

Attachments

Issue Links

Activity

People

Dates

Time Tracking

Tasks