Domain | Tool | Type | Description | Repositories | Contact | TC? | Deployed? |
---|
General | Calibre (ebook-converter) | Converter | Convert e-books to a number of document formats. | https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ebook-convert/browse | | | |
---|
CMU Sphinx (ncsa.audio.speech2text) | Extractor | Audio recognition to extract text for speech within audio. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/audio/speech2text | | | |
Daffodil (daffodil) | Converter | Convert formats with a provided DFDL schema to XML. | https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-daffodil/browse | Kenton McHenry | | |
DBPedia (ncsa.dbpedia) | Extractor | Find and define named entities within the given text. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-dbpedia/browse | | | |
FFmpeg (ffmpeg) | Converter | Convert between a large number of video formats. | https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ffmpeg/browse | | | |
FLAC (flac) | Converter | Convert to and from the FLAC format from other audio formats. | https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-flac/browse | | | |
Ghostscript (ghostscript) | Converter | Convert between document formats. | https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ghostscript/browse | | | |
htmldoc (htmldoc) | Converter | Convert HTML to a number of document formats. | https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-htmldoc/browse | Kenton McHenry | | |
ImageMagick (ImageMagick) | Converter | Convert between a large number of image formats. | https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-imagemagick/browse | Kenton McHenry | | |
ImageMagick (ncsa.image.metadata) | Extractor | Pull available EXIF image metadata from a given image. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/image/metadata | | | |
Kabeja (kabeja) | Converter | Convert between a handful of 3D and image formats. | https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-kabeja/browse | | | |
OpenCV - Faces (ncsa.cv.faces) | Extractor | Find faces in an image and return their locations. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-faces | Kenton McHenry | | |
OpenCV - Eyes (ncsa.cv.eyes) | Extractor | Find eyes in an image and return their locations. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-eyes | Kenton McHenry | | |
OpenCV - Closeups (ncsa.cv.closeups) | Extractor | Determine whether an image is a closeup of a person or not. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-closeups | Kenton McHenry | | |
OpenCV - Profiles (ncsa.cv.profiles) | Extractor | Find human face profiles in an image and return their locations. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-profiles | Kenton McHenry | | |
Langid (ncsa.nlp.simplelanguage) | Extractor | Identify the language of the given text. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleLanguage | | | |
LibreOffice (unoconv) | Converter | Convert to and from a variety of document formats. | https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-unoconv/browse | | | |
NLTK - Summary (ncsa.nlp.simplesummary) | Extractor | Summarize a body of text. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleSummary | | | |
Siegfried (siegfried) | Extractor | Extract information about a given file relevant to identifying its type and validating its format. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-siegfried/browse | | | |
Stanford CoreNLP (ncsa.nlp.SNLP) | Extractor | Natural Language Process extractions such as parts of speech, named entities, langauge, etc. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SNLP/SNLPExtractor | | | |
Tesseract (ncsa.image.ocr) | Extractor | Object Character Recognition (OCR) to extract text from images containing text. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-tesseract | Kenton McHenry | | |
Tika (ncsa.nlp.tika) | Extractor | Document extractions such as language identification, ... | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/tika | | | |
txt2html (txt2html) | Converter | Convert text documents to HTML. | https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-txt2html/browse | Kenton McHenry | | |
Versus - Color Distribution (ncsa.versus.image) | Extractor | Generate a distribution of color values within an image to be used for comparing how similar two images are. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-versus/browse | | | |
VLFeat (ncsa.image.caltech101) | Extractor | Classify images as to whether they contain objects from the Caltech101 dataset (e.g. people, airplanes, motorcycles, cougars, ...). | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-vlfeat | Kenton McHenry | | |
Zip (zip) | Converter | Unzip zip archives. | https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-zip/browse | Kenton McHenry | | |