Page History

...

Domain		Tool	Type	Description	Code	Py1/Py2	JSON-LD	DCKR	Test File	TC	DPL	DL	Assignment
General	1	Calibre (ebook-converter)	Converter	Convert e-books to a number of document formats.	N/A			Yes	Yes	Yes	Yes	N/A	Bing Zhang
	2	CMU Sphinx (ncsa.audio.speech2text)	Extractor	Audio recognition to extract text for speech within audio.	Yes					Yes			Marcus Slavenas
	3	Daffodil (daffodil)	Converter	Convert formats with a provided DFDL schema to XML.	N/A					Yes	Yes	N/A	Kenton McHenry
	4	DBPedia (ncsa.dbpedia)	Extractor	Find and define named entities within the given text.	Yes								Luigi Marini
	5	FFmpeg (ffmpeg)	Converter	Convert between a large number of video formats.	N/A					Yes		N/A	Bing Zhang
	6	FLAC (flac)	Converter	Convert to and from the FLAC format from other audio formats.	N/A					Yes	Yes	N/A	Bing Zhang
	7	Ghostscript (ghostscript)	Converter	Convert between document formats.	N/A					Yes	Yes	N/A	Bing Zhang
	8	htmldoc (htmldoc)	Converter	Convert HTML to a number of document formats.	N/A					Yes		N/A	Bing Zhang
	9	ImageMagick (ImageMagick)	Converter	Convert between a large number of image formats.	N/A					Yes	Yes	N/A	Inna Zharnitsky
	10	ImageMagick (ncsa.image.metadata)	Extractor	Pull available EXIF image metadata from a given image.	Yes		Yes			No			Inna Zharnitsky
	11	Kabeja (kabeja)	Converter	Convert between a handful of 3D and image formats.	N/A					Yes	Yes	N/A	Kenton McHenry
	12	OpenCV - Faces (ncsa.cv.faces)	Extractor	Find faces in an image and return their locations.	Yes		Yes			Yes	Yes		Kenton McHenry
	13	OpenCV - Eyes (ncsa.cv.eyes)	Extractor	Find eyes in an image and return their locations.	Yes		Yes			Yes	Yes		Kenton McHenry
	14	OpenCV - Closeups (ncsa.cv.closeups)	Extractor	Determine whether an image is a closeup of a person or not.	Yes		Yes			Yes	Yes		Kenton McHenry
	15	OpenCV - Profiles (ncsa.cv.profiles)	Extractor	Find human face profiles in an image and return their locations.	Yes		Yes			Yes	Yes		Kenton McHenry
	16	Langid (ncsa.nlp.simplelanguage)	Extractor	Identify the language of the given text.	Yes	Py2	Yes	Yes	Yes	Yes	needs push	?	Gregory Jansen
	17	LibreOffice (unoconv)	Converter	Convert to and from a variety of document formats.	N/A					Yes	Yes	N/A	Bing Zhang
	18	NLTK - Summary (ncsa.nlp.simplesummary)	Extractor	Summarize a body of text.	Yes	Py2	Yes	Yes	Yes	Yes	needs push	?	Gregory Jansen
	19	Siegfried (siegfried)	Extractor	Extract information about a given file relevant to identifying its type and validating its format.	Yes	Py2	Yes	Yes	Yes	Yes	needs push	?	Gregory Jansen
	20	Stanford CoreNLP (ncsa.nlp.SNLP)	Extractor	Natural Language Process extractions such as parts of speech, named entities, langauge, etc.	Yes	Py1 Java equiv.	No	No	No	Yes	?	?	Gregory Jansen
	21	Tesseract (ncsa.image.ocr)	Extractor	Object Character Recognition (OCR) to extract text from images containing text.	Yes					Yes	Yes		Bing Zhang
	22	Tika (ncsa.nlp.tika)	Extractor	Document extractions such as language identification, ...	Yes								Gregory Jansen
	23	txt2html (txt2html)	Converter	Convert text documents to HTML.	N/A					Yes		N/A	Bing Zhang
	24	Versus - Color Distribution (ncsa.versus.image)	Extractor	Generate a distribution of color values within an image to be used for comparing how similar two images are.	Yes					Yes			Bing Zhang
	25	VLFeat (ncsa.image.caltech101)	Extractor	Classify images as to whether they contain objects from the Caltech101 dataset (e.g. people, airplanes, motorcycles, cougars, ...).	Yes	Py1	yes	yes	yes	Yes	yes		Yan Zhao
	26	Zip (zip)	Converter	Unzip zip archives.	N/A					Yes	Yes	N/A	Bing Zhang

...

Page tree

Versions Compared

Old Version 52

New Version 53

Key