Note |
---|
This page is out of date and does not reflect the current Labs Workbench architecture. |
Note: Also see architecture/design notes in within the source repo code under docs
Table of Contents
Table of Contents
...
Concepts and Terminology
- Infrastructure: The compute and storage resources in a cloud or infrastructure service (AWS, etc) that an NDS Labs cluster runs on. The NDS Labs reference architecture is OpenStack.
- Site: A site is an administrative organization that provides resources to and operates one or more NDS Labs clusters.
- Cluster/NDS Labs cluster: The NDS Labs software platform that runs on the infrastructure.
- Project/Namespace: An isolated, named environment within the cluster that contains a set of services that are managed and operated independently of other projects. Projects typically implement the equivalent of a "website".
- Administrator: An authenticated person that manages and operates a part of the system.
NDS Labs System - Roles and Responsibilities
- Infrastructure Administrator:
- Provisions infrastructure to run a NDS Labs cluster
- On OpenStack, AWS, GCE, Rackspace, MaaS, ...
- Deploys the NDS Labs base cluster software
- Registers resources from infrastructure with NDS Labs cluster resource pool
- Provides API and credential to Cluster Administrator
- Provisions infrastructure to run a NDS Labs cluster
- Cluster Administrator:
- Manages and operates the NDS Labs cluster infrastructure
- Manages Projects in the cluster
- Provisions Projects on the cluster
- Manages resource assignments from the cluster pool to project pools
- Provides API and credentials per-project to Project Administrators
- Project Administrator:
- Provisions and deploys services in a project using resources granted to the project pool by the cluster administrator.
- Manages, monitors, and administrates services within independent projects.
- User/Project User: A client/user of the services within a project.
- Tool/Service Provider: A NDS NDSC partner that provides a tool or service in a set of containers that include NDS Labs service descriptors to enable the service to be integrated in a NDS Labs cluster.
...
NDS Labs Architecture
NDSLabs NDS Labs extends the Kubernetes base system with NDS-specific services and REST API's that support NDS Labs cluster services, project services, inter-cluster NDS Labs services. The implementations of NDS Labs services are implemented via cluster-specific Kubernetes pods and sidekick containers that are deployed in conjunction with service pods in the cluster and in project-specific services that "extend" cluster-specific and project-specific pods with integration to services such a monitoring, volume management, etc.
- NDS Labs Cluster Services:
- API Manager: Manages cluster-wide API naming and public API exposure from the cluster public IP firewall/load-balancing system.
- CCD - Service Catalog MaintenanceCatalog, Configuration, and Deployment (CCD) Service: Automatically updated catalog of NDS Labs services available for deployment in the cluster (for cluster admins), and for projects (for project admins). The service catalog manager is configured with NDS Labs-specific container repositories, and periodically pulls service descriptions from the containers.
- CADM - Cluster Admin Project Administration (CADM): Provide the cluster administrator with project provisioning including project admin credentials. Provides management of infrastructure resources to projects, including volumes and managing differentiated compute resources.
- Cluster Administrator Monitoring (CMON) Tools/Service: Provides services for cluster administrators to monitor cluster operations, including logging, performance analysis, and resource utilization. Monitor services include ELK, Prometheus, etc. in addition to Kubernetes-provided tools like cadvisor.
- NDS Labs Project Services:
- PADM - Project Manager Administration (PADM): Allows the project manager to deploy, monitor, and manage application services within their project.
- Per-project Monitoring
- (PMON): Provides project-specific monitoring of project resources, utilization, performance, and application/service specific monitoring and logging.
- PADM - Project Manager Administration (PADM): Allows the project manager to deploy, monitor, and manage application services within their project.
- ICS InterInter-clusterCluster/integrated-cluster servicesIntegrated-Cluster Services (ICS): Provide NDS Labs web services across multiple distributed clusters in the larger NDS Labs context to implement global NDS Labs services such as global resource search, distributed data access, and provide distributed application developers services to implement service discovery and distributed API access within their services.
- Distributed search: Locating named data and services in the NDS Labs global system.
- Resource discovery: Locates attribute-specified resources in the NDS Labs global system, such as specifically sized data-storage resources, or specific compute resources such as HPC resources, or accelerator-enabled compute resources, for example.
- Advanaced Data Management - : Allows composing cross-cluster data management applications
High Level Global Architecture
...
High Level Global Arch
...
Kubernetes Base Beneath NDS
...
The NDS Labs system is comprised of NDS Labs cloud services that run on clusters at various sites. Clusters provide the resources to one or more projects comprised of a related set of cloud-based services targeted to a specific community or application. In a service-oriented model the site is equivalent to an IaaS provider, the cluster is equivalent to a PaaS provider, and a project is equivalent to a uniquely configured and deployed platform on the PaaS system. Specific NDS Labs services are implemented within each layer that assist with convenient deployment and operation of the PaaS and platform layers. Global distributed data services such as search across all NDS Labs sites and projects will be provided in the Inter-cluster system (ICS) that will provide infrastructure building blocks for implementing wide area cross-cluster services.
Gliffy Diagram | ||||||
---|---|---|---|---|---|---|
|
Single Cluster Architecture Diagrams
Layer 0 - Single Cluster Infrastructure
The NDS Labs reference infrastructure is OpenStack. A cluster begins with a cluster of 6 OpenStack VMs. The cluster admin can add additional compute nodes as-needed based on dynamic demand.
Gliffy Diagram | ||||||
---|---|---|---|---|---|---|
|
Layer 1 - Kubernetes Container and Service Orchestration Layer
The initial 6-VM system is provisioned as a CoreOS cluster with 3 serving as etcd masters (not shown for simplicity), and 3 serving as the initial Kubernetes infrastructure, with the following architecture. additional compute resources can be added to the kubernetes cluster as demand requires.
Layer 2 - Single-Cluster Detail - NDS Labs Services and APIs
The NDS Labs architecture layer services on top of kubernetes that implement the NDS Labs services to manage and monitor the cluster, provision and manage resources for projects in the cluster, and provide project managers the ability to manage software service stacks within their project. The NDS Labs services leverage the facilities of the underlying kubernetes cluster orchestration system and the etcd system for managing configuration and state information of the services.
Gliffy Diagram | ||||||
---|---|---|---|---|---|---|
|
API/Service Catalog
Service | APIs | UsedBy |
Service/Component/Role Matrix with Descriptions
Stage of Development Color Key: | Completed | In Development | In Design | Future |
Service | Component | Planning Notes | Role/Use | ||||
Infrastructure Admin | Cluster Admin | Project Admin | Tool Developer | System Service | |||
---|---|---|---|---|---|---|---|
Developers Environment https://github.com/nds-org/ndslabs-developer-shell | Kubernetes Devenv Host-node network IPaddrs |
| NA | NA | Test project deploy | Test tools | NA |
Kubernetes Devenv w/ External firewall IPaddrs |
| NA | NA | Test project with proper public interface | Develop to proper external interface | NA | |
Container build support Makefiles |
| NA | NA | NA | NA | ||
OpenStack | Production Cluster Deploy |
| Deploy Cluster Infrastructure | NA | NA | NA | NA |
Volume Interface Service |
| Provide vol resources | Allocate vol resources to projects | Implicit use of auto-named vols | NA | Register/track resources | |
Catalog, Configure, Deploy (CCD) | Service catalog Admin Interface (CATADM) |
| NA | Admin Catalogs - register catalog URLS | NA | Publish service Needs service format | |
Update local service catalog from configured catalogs (CATSVC) |
| NA | NA | NA | NA | Pull catalogs maintain in etcd | |
Project Deploy CLI Deploy service stacks in project |
| NA | NA | Deploy named service stacks in project | NA | Uses kubernetes API | |
Project Deploy GUI/Server Web deploy tool on CLI (CCDSRV) |
| NA | NA | Web configurator and deploy | Use to test newly developed tools | NA | |
Project Service Monitor (PMON) |
| NA | NA | CCD gui | NA | NA | |
Cluster Administator/Ops (CADM) | Cluster Monitor (CMON) |
| NA | Monitor cluster health and performance | NA | NA | NA |
Inter-Cluster Services (ICS) | Search |
| NA | Registration | Register data resources | relevant for developing search interfaces/tools | External interface to cluster Distributed global service |
Registration Cluster registration/federation | Needs development | NA | Global registration | Project resource registration | ?? | Local and global distributed service |
...