Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Object Character Recognition (OCR) to extract text from images containing textcvextractors-tesseractKenton McHenrymetadataPull available EXIF image metadata from a given imagecoreimage/metadata

Maxwell Burnette

Rob Kooper

Tool

TypeDescription                                                                         RepositoriesContact

CMU Sphinx (ncsa.audio.speech2text)

ExtractorAudio recognition to extract text for speech within audio.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/audio/speech2text
DBPedia (ncsa.dbpedia)ExtractorFind and define named entities within the given text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-dbpedia/browse
Tesseract ImageMagick (ncsa.image.ocrmetadata)ExtractorPull available EXIF image metadata from a given image.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/image/metadata

OpenCV Faces (ncsa.cv.faces)

ExtractorFind faces in an image and return their locations.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-facesKenton McHenry

OpenCV Eyes (ncsa.cv.eyes)

ExtractorFind eyes in an image and return their locations.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-eyesKenton McHenry

OpenCV Closeups (ncsa.cv.closeups)

ExtractorDetermine whether an image is a closeup of a person or not.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-closeupsKenton McHenry

OpenCV Profiles (ncsa.cv.profiles)

ExtractorFind human face profiles in an image and return their locations.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-profilesKenton McHenry

VLFeat (ncsa.image.caltech101)

ExtractorClassify images as to whether they contain objects from the Caltech101 dataset (e.g. people, airplanes, motorcycles, cougars, ...).https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-vlfeatKenton McHenry
Langid (ncsa.nlp.simplelanguage)ExtractorIdentify the language of the given text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleLanguage 
NLTK Summary (ncsa.nlp.simplesummary)ExtractorSummarize a body of text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleSummary 
Siegfried (sigfried)ExtractorExtract information about a given file relevant to identifying its type and validating its format.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-siegfried/browse
Stanford CoreNLP (ncsa.nlp.SNLP)ExtractorNatural Language Process extractions such as parts of speech, named entities, langauge, etc.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SNLP/SNLPExtractor 
Tesseract (ncsa.image.ocr)ExtractorObject Character Recognition (OCR) to extract text from images containing text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-tesseractKenton McHenry
Tika (ncsa.nlp.tika)ExtractorDocument extractions such as language identification, ...https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/tika
Versus Color Distribution (ncsa.versus.image)ExtractorGenerate a distribution of color values within an image to be used for comparing how similar two images are.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-versus/browseImageMagick

VLFeat (ncsa.image.

caltech101)

ExtractorClassify images as to whether they contain objects from the Caltech101 dataset (e.g. people, airplanes, motorcycles, cougars, ...).https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-vlfeatKenton McHenry

https://opensource.ncsa.illinois.edu/confluence/display/BD/Extractors