...
Tool | Type | Description | Repositories | Contact | |||||
---|---|---|---|---|---|---|---|---|---|
CMU Sphinx (ncsa.audio.speech2text) | Extractor | Audio recognition to extract text for speech within audio. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/audio/speech2text | ||||||
DBPedia (ncsa.dbpedia) | Extractor | Find and define named entities within the given text. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-dbpedia/browse | ||||||
Tesseract ImageMagick (ncsa.image.ocrmetadata) | Extractor | Object Character Recognition (OCR) to extract text from images containing textPull available EXIF image metadata from a given image. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors- | cvcore/browse/ | extractors-tesseractKenton McHenryimage/metadata | ||||
OpenCV Faces (ncsa.cv.faces) | Extractor | Find faces in an image and return their locations. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-faces | Kenton McHenry | |||||
OpenCV Eyes (ncsa.cv.eyes) | Extractor | Find eyes in an image and return their locations. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-eyes | Kenton McHenry | |||||
OpenCV Closeups (ncsa.cv.closeups) | Extractor | Determine whether an image is a closeup of a person or not. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-closeups | Kenton McHenry | |||||
OpenCV Profiles (ncsa.cv.profiles) | Extractor | Find human face profiles in an image and return their locations. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-profiles | Kenton McHenry | VLFeat (ncsa.image.caltech101) | Extractor | Classify images as to whether they contain objects from the Caltech101 dataset (e.g. people, airplanes, motorcycles, cougars, ...). | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-vlfeat | Kenton McHenry |
Langid (ncsa.nlp.simplelanguage) | Extractor | Identify the language of the given text. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleLanguage | ||||||
NLTK Summary (ncsa.nlp.simplesummary) | Extractor | Summarize a body of text. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleSummary | ||||||
Siegfried (sigfried) | Extractor | Extract information about a given file relevant to identifying its type and validating its format. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-siegfried/browse | ||||||
Stanford CoreNLP (ncsa.nlp.SNLP) | Extractor | Natural Language Process extractions such as parts of speech, named entities, langauge, etc. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SNLP/SNLPExtractor | ||||||
Tesseract (ncsa.image.ocr) | Extractor | Object Character Recognition (OCR) to extract text from images containing text. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-tesseract | Kenton McHenry | |||||
Tika (ncsa.nlp.tika) | Extractor | Document extractions such as language identification, ... | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/tika | ||||||
Versus Color Distribution (ncsa.versus.image) | Extractor | Generate a distribution of color values within an image to be used for comparing how similar two images are. | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-versus/browse | ImageMagick | |||||
VLFeat (ncsa.image. | metadatacaltech101) | Extractor | Pull available EXIF image metadata from a given imageClassify images as to whether they contain objects from the Caltech101 dataset (e.g. people, airplanes, motorcycles, cougars, ...). | https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors- | corecv/browse/ | image/metadataRob Kooperextractors-vlfeat | Kenton McHenry |
https://opensource.ncsa.illinois.edu/confluence/display/BD/Extractors