Collection: A user defined group of data setsdatasets.
Comment:
High level information associated with a file or
data set dataset left by users.
Content Based Retrieval:
A means of indexing collections of data where instead of indexing by text or keywords items are indexed by signatures and users query the collection with example files in order to retrieve files with similar contents.Content Management System (CMS)::
A system used to store, manage, and curate collections of files and datasets.Data Extraction: A transformation that creates new data from the given data. An example would be the execution of analysis code on an image file's contents to determine if a face is in the iamgeimage. Clowder utilizes extractions to automatically generate metadata, signatures, and previews from a file's contents and provide users with means of finding, relating, and utilizing data that may be difficult otherwise.
Data SetDataset:
A A group of files that through some defined relationship or corresponding metadata are strongly tied together and not representable otherwise by the individual files.
...
Metadata:
Simply data about data
(e.g. tags or keywords).. Available on datasets and individual files.Preview: Derived data from a file or data set which is easier for a user to view, perhaps over the web where Special representation of a dataset or a file used by a previewer to visualize information about the dataset or file on the web. Often used to provide a smaller version of a dataset or file when bandwidth is a consideration.
Section: A subset of a files contents (e.g. a sub-image, a line from a document, a frame from a video, etc...). A sections is tied to a file.
Signature:
A typically numerical representation for some semantic aspect of a files contents. This can be thought of as a hash of the files contents. Various means of generating these signatures are typically available and focus on different aspects of a files data (e.g. color distributions in an image vs edge distributions). Signatures are used in content based retrieval to index and find similar data to a given example. Space: A group of collections, data sets, and files with defined user access rights.
...
Technical Metadata:
Anchor |
---|
| technical_metadata |
---|
| technical_metadata |
---|
|
Automatically generated
tags, signatures, or previews produced from metadata produced by the system via extractors.
User Metadata: Anchor |
---|
| user_metadata |
---|
| user_metadata |
---|
|
Tags or comments Metadata associated with file or data setdataset, entered by a human user.
...