...
Domain | Tool | Type | Description | Code | Py1/Py2 | JSON-LD | DCKR | Test File | TC | DPL | DL | Assignment | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
General | 1 | Calibre (ebook-converter) | Converter | Convert e-books to a number of document formats. | N/A | Yes | Yes | Yes | Yes | N/A | |||
2 | CMU Sphinx (ncsa.audio.speech2text) | Extractor | Audio recognition to extract text for speech within audio. | Yes | Yes |
|
| ||||||
3 | Daffodil (daffodil) | Converter | Convert formats with a provided DFDL schema to XML. | N/A | Yes | Yes | N/A | ||||||
4 | DBPedia (ncsa.dbpedia) | Extractor | Find and define named entities within the given text. | Yes |
|
|
| ||||||
5 | FFmpeg (ffmpeg) | Converter | Convert between a large number of video formats. | N/A | Yes |
| N/A | Bing Zhang | |||||
6 | FLAC (flac) | Converter | Convert to and from the FLAC format from other audio formats. | N/A | Yes | Yes | N/A | Bing Zhang | |||||
7 | Ghostscript (ghostscript) | Converter | Convert between document formats. | N/A | Yes | Yes | N/A | Bing Zhang | |||||
8 | htmldoc (htmldoc) | Converter | Convert HTML to a number of document formats. | N/A | Yes |
| N/A | ||||||
9 | ImageMagick (ImageMagick) | Converter | Convert between a large number of image formats. | N/A | Yes | Yes | N/A | ||||||
10 | ImageMagick (ncsa.image.metadata) | Extractor | Pull available EXIF image metadata from a given image. | Yes | Yes | No |
|
| |||||
11 | Kabeja (kabeja) | Converter | Convert between a handful of 3D and image formats. | N/A | Yes | Yes | N/A | ||||||
12 | OpenCV - Faces (ncsa.cv.faces) | Extractor | Find faces in an image and return their locations. | Yes | Yes | Yes | Yes |
| |||||
13 | OpenCV - Eyes (ncsa.cv.eyes) | Extractor | Find eyes in an image and return their locations. | Yes | Yes | Yes | Yes |
| |||||
14 | OpenCV - Closeups (ncsa.cv.closeups) | Extractor | Determine whether an image is a closeup of a person or not. | Yes | Yes | Yes | Yes |
| |||||
15 | OpenCV - Profiles (ncsa.cv.profiles) | Extractor | Find human face profiles in an image and return their locations. | Yes | Yes | Yes | Yes |
| |||||
16 | Langid (ncsa.nlp.simplelanguage) | Extractor | Identify the language of the given text. | Yes | Py2 | Yes | Yes | Yes | Yes | needs push | ? | ||
17 | LibreOffice (unoconv) | Converter | Convert to and from a variety of document formats. | N/A | Yes | Yes | N/A | Bing Zhang | |||||
18 | NLTK - Summary (ncsa.nlp.simplesummary) | Extractor | Summarize a body of text. | Yes | Py2 | Yes | Yes | Yes | Yes | needs push | ? | ||
19 | Siegfried (siegfried) | Extractor | Extract information about a given file relevant to identifying its type and validating its format. | Yes | Py2 | Yes | Yes | Yes | Yes | needs push | ? | ||
20 | Stanford CoreNLP (ncsa.nlp.SNLP) | Extractor | Natural Language Process extractions such as parts of speech, named entities, langauge, etc. | Py1 Java equiv. | No | No | No | Yes | ? | ? | |||
21 | Tesseract (ncsa.image.ocr) | Extractor | Object Character Recognition (OCR) to extract text from images containing text. | Yes | Yes | Yes |
| ||||||
22 | Tika (ncsa.nlp.tika) | Extractor | Document extractions such as language identification, ... | Yes |
|
|
| ||||||
23 | txt2html (txt2html) | Converter | Convert text documents to HTML. | N/A | Yes |
| N/A | ||||||
24 | Versus - Color Distribution (ncsa.versus.image) | Extractor | Generate a distribution of color values within an image to be used for comparing how similar two images are. | Yes | Yes |
|
| Bing Zhang | |||||
25 | VLFeat (ncsa.image.caltech101) | Extractor | Classify images as to whether they contain objects from the Caltech101 dataset (e.g. people, airplanes, motorcycles, cougars, ...). | Yes | Py1 | yes | yes | yes | Yes | yes |
| ||
26 | Zip (zip) | Converter | Unzip zip archives. | N/A | Yes | Yes | N/A | Bing Zhang |
...