Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Domain 

Tool

TypeDescription                                     CodePy1/Py2JSON-LDDCKRTest FileTCDPLDLAssignment

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

General

1Calibre (ebook-converter)ConverterConvert e-books to a number of document formats.YesN/AN/AYesYes

Yes
Yes

No

YesNo
2

CMU Sphinx (ncsa.audio.speech2text)

ExtractorAudio recognition to extract text for speech within audio.Yes?   

 Yes

 

 

3Daffodil (daffodil)ConverterConvert formats with a provided DFDL schema to XML.Yes    

 Yes

 Yes

Yes
4DBPedia (ncsa.dbpedia)ExtractorFind and define named entities within the given text.

 Yes

Py1YesYesYes

 No

 No

 

5FFmpeg (ffmpeg)ConverterConvert between a large number of video formats.YesN/AN/AYesYes

 Yes

 Yes?No

Yes
No
6FLAC (flac)ConverterConvert to and from the FLAC format from other audio formats.YesN/AN/AYesYes

 Yes Yes

No

YesBing Zhang
7Ghostscript (ghostscript)ConverterConvert between document formats.YesN/AN/AYesYes

 Yes Yes

No

YesNoBing Zhang
8htmldoc (htmldoc)ConverterConvert HTML to a number of document formats.YesN/AN/AYesYes

 Yes

 Yes?No

Yes
No
9ImageMagick (ImageMagick)ConverterConvert between a large number of image formats.YesN/AN/AYesYes

 Yes

 No

No
10ImageMagick (ncsa.image.metadata)ExtractorPull available EXIF image metadata from a given image.

 Yes

Py1YesYesNo

 No

 No

 No

11Kabeja (kabeja)ConverterConvert between a handful of 3D and image formats.Yes    

 Yes

 Yes

 
12

OpenCV - Faces (ncsa.cv.faces)

ExtractorFind faces in an image and return their locations.

 Yes

 Yes  

 Yes

 Yes

 

13

OpenCV - Eyes (ncsa.cv.eyes)

ExtractorFind eyes in an image and return their locations.

 Yes

 Yes  

 Yes

Yes

 

14

OpenCV - Closeups (ncsa.cv.closeups)

ExtractorDetermine whether an image is a closeup of a person or not.

 Yes

 Yes  

 Yes

 Yes

 

15

OpenCV - Profiles (ncsa.cv.profiles)

ExtractorFind human face profiles in an image and return their locations.

 Yes

 Yes  

Yes

 Yes

 

16Langid (ncsa.nlp.simplelanguage)ExtractorIdentify the language of the given text.

 Yes

Py2YesYesYes

 Yes

 No

No

17LibreOffice (unoconv)ConverterConvert to and from a variety of document formats.YesN/AN/AYesYes

 Yes Yes

No

YesNoBing Zhang
18NLTK - Summary (ncsa.nlp.simplesummary)ExtractorSummarize a body of text.

 Yes

Py2Yes

Yes

Yes

 Yes

No

 No

19Siegfried (siegfried)ExtractorExtract information about a given file relevant to identifying its type and validating its format.

 Yes

Py2YesYesYes

 Yes

No

 No

20Stanford CoreNLP (ncsa.nlp.SNLP)ExtractorNatural Language Process extractions such as parts of speech, named entities, langauge, etc.

 Yes

Py1 Java equiv.NoNoNo

 No

 No

 No

21Tesseract (ncsa.image.ocr)ExtractorObject Character Recognition (OCR) to extract text from images containing text.

 Yes

Py1YesYesYes

 Yes

 Yes

Yes 

22Tika (ncsa.nlp.tika)ExtractorDocument extractions such as language identification, ...

 Yes

older Py1YesYesYesNo

 

No

 

Yes

 

23txt2html (txt2html)ConverterConvert text documents to HTML.YesN/AN/AYesYes

Yes
 Yes?

No

Yes
24Versus - Color Distribution (ncsa.versus.image)ExtractorGenerate a distribution of color values within an image to be used for comparing how similar two images are.

 Yes

Py1N/AYesYes

 Yes

 Yes?

 Yes

25

VLFeat (ncsa.image.caltech101)

ExtractorClassify images as to whether they contain objects from the Caltech101 dataset (e.g. people, airplanes, motorcycles, cougars, ...).

 Yes

Py1yesyesyes

 Yes

 yes?

 

26Zip (zip)ConverterUnzip zip archives.YesN/AN/AYesYes

 Yes

NoYes

NoBing Zhang

https://opensource.ncsa.illinois.edu/confluence/display/BD/Transformations (Under development, Being Refactored)

...