Scientific
General
Tool | Type | Description | Repositories | Contact | |||
---|---|---|---|---|---|---|---|
Siegfried (sigfried CMU Sphinx (ncsa.audio.speech2text) | ExtractorExtract | information about a given file relevant to identifying its type and validating its formatAudio recognition to extract text for speech within audio. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-siegfriedcore/browse/audio/speech2text | ||||
DBPedia (ncsa.dbpedia) | Extractor | Find and define named entities within the given text. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-dbpedia/browse | ||||
Tesseract (ncsa.image.ocr) | Extractor | Object Character Recognition (OCR) to extract text from images containing text. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-tesseract | Kenton McHenry | |||
OpenCV Faces (ncsa.cv.faces) | Extractor | Find faces in an image and return their locations. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-faces | Kenton McHenry | |||
OpenCV Eyes (ncsa.cv.eyes) | Extractor | Find eyes in an image and return their locations. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-eyes | Kenton McHenry | |||
OpenCV Closeups (ncsa.cv.closeups) | Extractor | Determine whether an image is a closeup of a person or not. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-closeups | Kenton McHenry | |||
OpenCV Profiles (ncsa.cv.profiles) | Extractor | Find human face profiles in an image and return their locations. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-profiles | Kenton McHenry | |||
VLFeat (ncsa.image.caltech101) | Extractor | Classify images as to whether they contain objects from the Caltech101 dataset (e.g. people, airplanes, motorcycles, cougars, ...). | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-vlfeat | Kenton McHenry | CMU Sphinx (ncsa.audio.speech2text) | Extractor | Audio recognition to extract text for speech within audio. |
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/audio/speech2text | Langid (ncsa.nlp.simplelanguage) | Extractor | Identify the language of the given text. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleLanguage | |||
NLTK Summary (ncsa.nlp.simplesummary) | Extractor | Summarize a body of text. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleSummary | Versus Color Distribution (ncsa.versus.image | |||
Siegfried (sigfried) | Extractor | Generate a distribution of color values within an image to be used for comparing how similar two images areExtract information about a given file relevant to identifying its type and validating its format. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors- | versussiegfried/browse | Kenton McHenry|||
Stanford CoreNLP (ncsa.nlp.SNLP) | Extractor | Natural Language Process extractions such as parts of speech, named entities, langauge, etc. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SNLP/SNLPExtractor | ||||
Tika (ncsa.nlp.tika) | Extractor | Document extractions such as language identification, ... | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/tika | ||||
Versus Color Distribution (ncsa.versus.image) | Extractor | Generate a distribution of color values within an image to be used for comparing how similar two images are. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-versus/browse | ||||
ImageMagick (ncsa.image.metadata) | Extractor | Pull available EXIF image metadata from a given image. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/image/metadata |
...