Overview

The Container Analysis Environments Workshop was sponsored by NDS and DataExpLab to bring together a variety of groups leveraging container technology in research computing and data analysis access. Groups represented a wide variety of projects including Blue Waters, LIGO, LSST/DES, CyVerse, TERRA-REF, SciServer, Whole Tale, yt.Hub, NDS Labs, TACC (Agave, BioContainers), SDSC (JupyterHub), XSEDE Gateways, CyberGIS, CRC, and Brown Dog. 

Following a set of presentations, the group was asked to prioritize a list of topics for breakout groups and deep dive discussions.  The topics were discussed as follows:

  1. Integration with HPC environments
    1. Singularity/Shifter
    2. Launching jobs with data from interactive environments
    3. Agave API
  2. Shared storage across containers
    1. How groups are supporting data access
    2. Performance/reliability/scalability
    3. Deep dive on Whole Tale data management architecture
    4. Security (i.e., HIPAA compliance)
  3. Archiving/management/preservation of images
    1. Centralized registry for Docker and singularity images used in research environments
    2. Best-practices for image preservation
    3. Allowing users to dynamically compose images
  4. Interactive analysis environments
    1. Figuring out "gotchas" -- how are people solving problems.  What do they include? Jupyter vs Rstudio vs X environments

    2. Profiling load/capping access

  5. Opportunities for interoperability/collaboration between systems
  6. Container orchestration (scheduling) systems

Several topics could not be discussed in detail due to a lack of expertise including workflow systems and authentication.

Major takeaways from the workshop include:


Actions: