You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 23 Next »

Overview

This is a page to house the results of the manual load testing done on the NDS Labs Workbench (Beta)

Objective

  1. Generate load on the system for a given number of users
  2. Monitor the system's resource utilization using Grafana
    • This will give us a benchmark of the expected "load" on the cluster
  3. Take user feedback regarding general usability of the system under the desired load conditions
    • This will let us know if user performance has degraded due to any stress on the system
  4. Take note of how any node additions / removals affect resource constraint, and to what degree
    • If the system's resources become constrained, add a node to the cluster alleviating the resource constraint
    • If the system is far over-saturated with resources, remove a node from the cluster to simulate a downed node

Current Cluster Configuration

See inventory at: https://github.com/nds-org/ndslabs-deploy-tools/commit/d8d8ef30dac74b1fe84185c7abc6136516d60e7b

Resulting Actions

1 hour group testing
1 hour writing new issues

Phase 1: Labs Workbench + Management

Workbench Version

  • 1.0.5

Participants

  • Craig Willis
  • David Raila
  • Mike Lambert

Measurement Utilities

Results

Prognosis

So far, aside from a few minor issues, everything is running super smoothly.

Peak usage was measured at:

  • 6% cluster memory usage
  • 3-4% cluster CPU usage

Nearly every service possible was started at some point during 2-ish hours of testing, and only 2 or 3 services encountered the notorious "no data available" problem:

  • pyCharm
  • Jenkins

Overall, this is fantastic news for the stability of the platform. The testing has brought to light several issues that will need to be addressed

Resulting Actions

Higher priority:

  • NDS-464 - Getting issue details... STATUS
  • NDS-640 - Getting issue details... STATUS
  • NDS-621 - Getting issue details... STATUS
  • NDS-648 - Getting issue details... STATUS
  • NDS-173 - Getting issue details... STATUS

 

Lower priority:

  • NDS-646 - Getting issue details... STATUS
  • NDS-647 - Getting issue details... STATUS
  • NDS-645 - Getting issue details... STATUS
  • NDS-644 - Getting issue details... STATUS
  • NDS-649 - Getting issue details... STATUS

Phase 2: Bug Party

Workbench Version

  • 1.0.6

Participants

  • Craig Willis
  • David Raila
  • Mike Lambert
  • Michal
  • Jing
  • Sandeep
  • Qiyue
  • Marcus

Measurement Utilities

Results

  • Michal: No indication of which fields are required for registration
  • Mike: I have a better UI design for the catalog to propose :X
  • David: Recommend whitelisting our site for / disabling pop-ups - can we detect this and make a recommendation to users without correct settings?
  • Michal: couldn't sign up for DSpace - address in use
    • This is a more general problem with any service that generates admin credentials... user should be directed to the Config page
  • Jing: Docker image name validation is incomplete
    • Underscore should be among accepted characters
  • Mike: Saw a failure adding Sufia, only one time... next time it added properly
  • Jing: Custom service failed to start
  • Qiyue: How do we use different versions... for example: Cloud9 Java7 vs Cloud9 Java8
  • Qiyue: What is the storage quota? 20GB
  • Jing: Redis is missing an info link
  • Marcus: NDS Confluence went down, as a result icons could not load
  • Jing: Error messages are confusing - need translation
  • Michal: would it be better to have a pre-populated instances?
    • This would be nice, but may be difficult to handle programmatically in a general way
  • Qiyue: Any plans to support Fortran?

Prognosis

Resulting Actions

  • No labels