Page History

5GDAL (ncsa.geo.tiffExtractor)Extractor https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-geo/browse

N/A

Yes

6GDAL (ncsa.image.geotiff)Extractor https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-geotiff/browse

N/A

Create a Geoserver layer for WMS service. It can be used in previewer

Yes

Py1

Yes

No

Yes

No

5

GeoTiff Extractor by using GDAL (ncsa.geotiff.preview)

Extractor

Create a Geoserver layer for WMS service. It can be used in previewer

Yes

Py1

Yes

No

Yes

No

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/river

6

GeoTiff Metadata Extractor by using GDAL (Another implementation) (ncsa.geotiff.metadata)

Extractor

Extract geospatial metadata from Geotiff. It should be combined with #5 Geotiff extractor.

Yes

Py1

Yes

No

Jong Lee
7	Historical River Extractor (ncsa.cv.river)	Extractor	Extract the river networks from the ancient hand-drawing maps and compare them with current river networks

Yes

Py1

Yes

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-bd-cz/browse/ndviextractor

No	Sandeep Puthanveetil Satheesan
8	Normalized Difference Vegetation Index (ncsa.arcgis.landsat7mosaic)	Extractor

Create a NDVI layer from Landsat data. Calculating vegetation indices such as NDVI and Surface Temperature from Landsat 7 and 8 satellite data.

Yes

Py1

Yes

N/A

(Windows,

ArcGIS)

Yes

No

Yes

(Windows VM)

No

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-bd-cz/browse/chi-analysis


9	River Chi Index (ncsa.chi_analysis)	Extractor	Identify the river dynamics in a river basin and evaluate human activities' influences through Chi index in the streams.

Yes

Py1

Yes

Yes	Sandeep Puthanveetil Satheesan

10

River Sinuosity

GDAL Converter

ExtractorStudy the maturity and equilibrium conditions of a stream through the

Converter	Converts between TIFF and BIL (zip) and FLT (zip)	Yes	N/A	N/A	Yes	Yes	Yes	Yes	Yes	Sandeep Puthanveetil Satheesan
11	River Sinuosity	Extractor	Study the maturity and equilibrium conditions of a stream through the sinuosity index.

Yes

11Soil Moisture ChangeExtractor

Esther Lee (estherl2010@gmail.com)

Qina supposed to be delivering code - Dr K thinks it should make it in Beta
12	Soil Moisture Change	Extractor	Determine role of hydraulic redistribution in AZ (riparian site / upland site) by studying soil moisture change throughout different seasons.	No

12

Per Dr. K - will need discussed and can not be in Beta
13	Species Classifier	Extractor	SAM based Species Classification from Hyperspectral data, Hyperspectral Indices, NDVI, SAVI, MSAVI, etc.

Yes

Debsunder Dutta (debsunderdutta@gmail.com) leaving in July 2016

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-bd-cz/browse/terex_floodplain

13

Dr K does not think this will make it in Beta
14	Floodplain Extractor by using TerEx (

TerEx (

ncsa.arcgis.floodplain)

Extractor

Identify the flat polygons and the heights inside a river valley.

N/A

Yes (but procedure?)

Py1

Yes

No (windows)

Yes

No

Yes

(Windows)

no

15

14

Topographic Depressions

Extractor

Identify topographic depressions (TDs) and their distribution on landscape (Number, location, area, volume of TDs).

Yes - ready to review

Phong Le

1516Valley Safety ZonesExtractorEstimate submerging areas and water depths under extreme floods and map the safety zones in a river valley.

16	Tree Delineation

ConverterTree-wise voxelization of waveform data for lidar metrics that describes canopy structure (max intensity, height, etc...). Individual tree delineation, tree leaf area density to describe vertical leaf distribution. https://opensource.ncsa.illinois.edu/bitbucket/projects/BD/repos/ir-lidar/browse

Yes

N/A

Yes

N/A

Kunxuan Wang

(ncsa.arcgis.treedelin)

Extractor

Create a shape-file with polygons of tree canopy polygon from LiDAR data

Yes

Py1

Yes

N/A

(Windows,

ArcGIS)

Yes

(Windows VM)

No

Sandeep Puthanveetil Satheesan

17

Vegetation IndicesExtractorCalculating vegetation indices such as NDVI and Surface Temperature from Landsat 7 and 8 satellite data.Is this the same as Normalized Difference Vegetation Index extractor above?

Debsunder Dutta (debsunderdutta@gmail.com) leaving in July 2016

Tree-wise voxelization		Tree-wise voxelization of waveform data for lidar metrics that describes canopy structure (max intensity, height, etc...). Individual tree delineation, tree leaf area density to describe vertical leaf distribution.	Yes (But extractor has to be developed)	No	No	No	No	No	No	No	Sandeep Puthanveetil Satheesan
18	Valley Safety Zones	Extractor	Estimate submerging areas and water depths under extreme floods and map the safety zones in a river valley.	Yes								Sandeep Puthanveetil Satheesan emailed March and April. Sandeep will follow up with Qina.
Ecology	1	netcdf (ncdump)	Converter	Convert from binary netcdf to text.	N/A	N/A	N/A	Yes	Yes	Yes	Yes	Yes	Yan Zhao
	2	PEcAn (PEcAn#Ameriflux)	Converter	Convert Ameriflux data to PEcAn's netcdf CF format.	Yes	N/A	N/A	Yes	Yes	Yes	Yes	Yes	Yan Zhao
	3	PEcAn (PEcAn#DALEC

Ecology

1netcdf (ncdump)ConverterConvert from binary netcdf to text.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ncdump/browseN/AN/A

N/A

Yes

N/A

2PEcAn (PEcAn#Ameriflux)ConverterConvert Ameriflux data to PEcAn's netcdf CF format.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse

Yes

N/A

Yes

N/A

3PEcAn (PEcAn#DALEC)ConverterConvert PEcAn's netcdf CF format to the format required by the DALEC model.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse

Yes

N/A

Yes

N/A

4PEcAn (PEcAn#ED2)ConverterConvert PEcAn's netcdf CF format to the format required by the ED model.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse

Yes

N/A

Yes

N/A

5PEcAn (PEcAn#LINKAGEShttps://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse Yes

)

Converter

Convert PEcAn's netcdf CF format to the format required by

the LINKAGES model.

the DALEC model.

Yes

N/A

Yes

N/A

Yes

6

4	PEcAn (

PEcAn#Sipnet

PEcAn#ED2)

Converter

Convert PEcAn's netcdf CF format to the format required

by the Sipnet model.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse

by the ED model.

Yes

N/A

Yes

N/A7

Yes

5

PEcAn (

PEcAn#AmerifluxBNL

PEcAn#LINKAGES)

PEcAn (PEcAn#FLUXNET2015)

Converter

PEcAn (PEcAn#FACE)

PEcAn (PEcAn#PALEON)

PEcAn (PEcAn#NLDAS)

PEcAn (PEcAn#CURNCEP)

PEcAn (PEcAn#GLDAS)

PEcAn (PEcAn#GFDL)

PEcAn (PEcAn#BIOCRO)

PEcAn (PEcAn#CLM)

PEcAn (PEcAn#GDAY)

PEcAn (PEcAn#JULES)

PEcAn (PEcAn#LPJ-GUESS)

PEcAn (PEcAn#MAAT)

PEcAn (PEcAn#MAESPA)

PEcAn (PEcAn#PRELES)

Convert PEcAn's netcdf CF format to the format required by the LINKAGES model.	Yes	N/A	N/A	Yes	Yes	Yes	Yes	Yes	Yan Zhao
6	PEcAn (PEcAn#Sipnet)	Converter	Convert PEcAn's netcdf CF format to the format required by the Sipnet model.

Yes

N/A

Yes

N/A

8PlantCV (terra.plantcv)ExtractorExtract plant height, area, and color distribution from photographs.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-plantcv/browse

Yes

Yan LiuCivil & Environmental Engineering

1

Body of Water Detector (ncsa.image.ponddetect)

ExtractorLand coverage, extract locations of bodies of water from satellite data.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-maps/browse/feature_detection

Yes

7	PEcAn (PEcAn#AmerifluxBNL) PEcAn (PEcAn#FLUXNET2015) PEcAn (PEcAn#FACE) PEcAn (PEcAn#PALEON) PEcAn (PEcAn#NLDAS) PEcAn (PEcAn#CURNCEP) PEcAn (PEcAn#GLDAS) PEcAn (PEcAn#GFDL) PEcAn (PEcAn#BIOCRO) PEcAn (PEcAn#CLM) PEcAn (PEcAn#GDAY) PEcAn (PEcAn#JULES) PEcAn (PEcAn#LPJ-GUESS) PEcAn (PEcAn#MAAT) PEcAn (PEcAn#MAESPA) PEcAn (PEcAn#PRELES)			Yes	N/A	N/A	No	Yes	Yes	Yes	No	Yan Zhao
8	PlantCV (terra.plantcv)	Extractor	Extract plant height, area, and color distribution from photographs.	Yes	Py1	yes	yes	yes	yes	no (bamboo test available, not being deployed)	no	Yan Zhao
9	BIL (hyperspectral)	Converter	Convert bil.zip (contains raw & .hdr) to .terra.nc	Yes (The original code has been updated. Need to figure out if we want to update this)	N/A	N/A	yes	no, only test files available are very large and would take a long time	yes	no (bamboo test available, not being deployed)	no	Yan Zhao
Civil & Environmental Engineering	1	Body of Water Detector (ncsa.image.ponddetect)	Extractor	Land coverage, extract locations of bodies of water from satellite data.	Yes	No	No	No	Yes	No	No	No	??who??
	2	GI Identification	Extractor		Yes	Py1	Yes	Yes	Yes	Yes	Yes	No	Bing Zhang
	3	Human Preference Score (ncsa.image.humanpref)	Extractor	Assign a model derived human preference score to a given image of an urban environment.	Yes	no	no	no	yes	no	yes	no	??who??
	4	Route Greenness (ncsa.greenindex)	Extractor	Derive the green index of a city route.	Yes	Py1	Yes	yes	yes	Yes	Yes	No	Bing Zhang
	5	Social Media GI Preferences	Extractor	Determine if text contains references to visual or functional green infrastructure.	Yes	Py1	Yes	Yes	Yes	No	no	No	Bing Zhang
	6	Stanford CoreNLP - Sentiment (ncsa.nlp.SNLPSentiment)	Extractor	Assign a sentiment score to a piece of text.	Yes	Py2 Java equiv.	Yes	Yes	Yes	Yes	No	No	Gregory Jansen
	7	TUV Triaxus	Extractor	Towed Undulating Vehicle Data Analyzing Tools	Yes	Py1	yes	yes	yes	yes	yes	No	Yan Zhao
Social Science & Humanities	1	Bertillon Card Cell Extractor (ncsa.image.dmp)	Extractor	Extract table cells from a Bertillon Card.	Yes	Py1	Yes	No	No	No	No	No	Sandeep Puthanveetil Satheesan
	2	Census Form Cell Extractor (census-section-segmentor)	Extractor	Extract table cells from a 1930s Census form.	Yes	No	No	No	Yes	No	No	No	Sandeep Puthanveetil Satheesan
	3	Handwritten Decimals Extractor (ncsa.image.sphog.debod)	Extractor	Extract handwritten decimal values from an image.	Yes	Py1	Yes	No	No	No	No	No	Sandeep Puthanveetil Satheesan
	4	Killed Photos (ncsa.image.killedphoto)	Extractor	Identify depression era photos "killed" by Farm Security Administration director Roy Stryker, indicated by a hole punch in the image.	Yes	Py2	Yes	Yes	Yes	Yes	no	no	Sandeep Puthanveetil Satheesan
	5	Mean Grey (ncsa.cv.meangrey)	Extractor	Mean grey values of black and white photos.	Yes	Py1	Yes	Yes	Yes	Yes	Yes	no	Bing Zhang
	6	Movie Slice (ncsa.movieslice)	Extractor	Generates movie slice visualization from video files	Yes	Py1/Py2 not used	No	No	No	No	No	No	Sandeep Puthanveetil Satheesan
	7	Person Detector (ncsa.video.person_detector)	Extractor	Extract locations of people in an image.	Yes	Py2	Yes	Yes (MATLAB external)	Yes	Yes	Yes	No	Sandeep Puthanveetil Satheesan
	8	Person Tracker (ncsa.video.person_tracking)	Extractor	Extract locations and paths of people moving in videos.	Yes	Py2	Yes	Yes (MATLAB external)	Yes	Yes	Yes	No

2GI IdentificationExtractor

Yes

3Human Preference Score (ncsa.image.humanpref)ExtractorAssign a model derived human preference score to a given image of an urban environment.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-maps/browse/humanpref

4

Route Greenness (ncsa.xml.greenindexroute)

ExtractorDerive the green index of a city route.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-maps/browse/greenroute

Yes

5Social Media GI PreferencesExtractor

6Stanford CoreNLP - Sentiment (ncsa.nlp.SNLPSentiment)ExtractorAssign a sentiment score to a piece of text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SNLP/SNLPSentimentExtractor

7TUV TriaxusExtractor

Social Science & Humanities
1Bertillon Card Cell Extractor (ncsa.image.dmp)ExtractorExtract table cells from a Bertillon Card.https://opensource.ncsa.illinois.edu/bitbucket/projects/DEBOD/repos/extractors-debod

2Census Form Cell Extractor (census-section-segmentor) ExtractorExtract table cells from a 1930s Census form.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/census

Inna Zharnitsky

3Handwritten Decimals Extractor (ncsa.image.sphog.debod)ExtractorExtract handwritten decimal values from an image.https://opensource.ncsa.illinois.edu/bitbucket/projects/DEBOD/repos/extractors-handwrittendecimals

4Killed Photos (ncsa.image.iarp_remove_circle)ExtractorIdentify depression era photos "killed" by Farm Security Administration director Roy Stryker, indicated by a hole punch in the image.https://opensource.ncsa.illinois.edu/bitbucket/projects/IARP/repos/image_fetcher/browse/extractors/remove_circle

Marcus SlavenasMarcus Slavenas5Mean Grey (ncsa.cv.meangrey)ExtractorMean grey values of black and white photos.https://opensource.ncsa.illinois.edu/bitbucket/projects/IARP/repos/image_fetcher/browse/extractors/mean_grey

Marcus Slavenas

Marcus Slavenas6Movie Slice (ncsa.movieslice)Extractor https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-movieslice/browse

7Person Detector (person-detector)ExtractorExtract locations of people in an image.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-person-detector

8Person Tracker (ncsa.person-tracker)ExtractorExtract locations and paths of people moving in videos.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-person-tracking

	Sandeep Puthanveetil Satheesan
9	Video Analytics Toolbox (ncsa.cinemetrics_batch)	Extractor	Extract

shot descriptors from videos.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-lsva-integrated/browse

shot descriptors from videos.

Yes

Py1/Py2 not used

No

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-mri/browse/mri2mesh

Biology

Biology, Genomics, Medicine	1	FSL (mri2mesh)	Extractor

Marcus Slavenas

Creates a finite element mesh from a set of mr images

Yes

Py2

no

yes

no


2	Glomeruli (ncsa

.msc

.

diagnosis)ExtractorExtract glomeruli from Kidney biopsy images.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-msc/browse/Kidney

Yes

msc.diagnosis)

Extractor

Extract glomeruli from Kidney biopsy images.

Yes

Py1

Yes

no

General

Domain		Tool	Type	Description

Repositories

	Code	Py1/Py2

TCDL

JSON-LD

DPL

DCKR	Test File

DeveloperContact

TC	DPL	DL	Assignment

General

1Calibre (ebook-converter)Converter

Convert e-books to a number of document formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ebook-convert/browseN/A

Yes

N/A

2

CMU Sphinx (ncsa.audio.speech2text)

ExtractorAudio recognition to extract text for speech within audio.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/audio/speech2textN/A

Yes

General

3

1

Daffodil

Calibre (

daffodil

ebook-converter)

Converter

Convert

formats with a provided DFDL schema to XML.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-daffodil/browse

e-books to a number of document formats.

Yes

N/A

Yes

N/A Kenton McHenry4DBPedia (ncsa.dbpedia)ExtractorFind and define named entities within the given text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-dbpedia/browse

Yes

Luigi Marini

5FFmpeg (ffmpeg)ConverterConvert between a large number of video formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ffmpeg/browseN/A

Yes

N/A

6FLAC (flac)ConverterConvert to and from the FLAC format from other audio formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-flac/browseN/A

Yes

N/A

7Ghostscript (ghostscript)ConverterConvert between document formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ghostscript/browseN/A

Yes

N/A

8htmldoc (htmldoc)ConverterConvert HTML to a number of document formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-htmldoc/browseN/A

Yes

N/A Kenton McHenry9ImageMagick (ImageMagick)ConverterConvert between a large number of image formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-imagemagick/browseN/A

Yes

N/A Kenton McHenry10ImageMagick (ncsa.image.metadata)ExtractorPull available EXIF image metadata from a given image.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/image/metadata

Maxwell Burnette

11Kabeja (kabeja)ConverterConvert between a handful of 3D and image formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-kabeja/browseN/A

Yes

N/A 12

OpenCV - Faces (ncsa.cv.faces)

ExtractorFind faces in an image and return their locations.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-faces

Yes

Kenton McHenry13

OpenCV - Eyes (ncsa.cv.eyes)

ExtractorFind eyes in an image and return their locations.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-eyes

Yes

No

Yes

2

CMU Sphinx (ncsa.audio.speech2text)

Extractor

Audio recognition to extract text for speech within audio.

Yes

N/A

Yes

No

??who??

3

Daffodil (daffodil)

Converter

Convert formats with a provided DFDL schema to XML.

Yes

4

DBPedia (ncsa.dbpedia)

Extractor

Find and define named entities within the given text.

Yes

Py1

Yes

Luigi Marini

5

FFmpeg (ffmpeg)

Converter

Convert between a large number of video formats.

Yes

N/A

Yes

No

Yes

6

FLAC (flac)

Converter

Convert to and from the FLAC format from other audio formats.

Yes

N/A

Yes

No

Yes

7

Ghostscript (ghostscript)

Converter

Convert between document formats.

Yes

N/A

Yes

No

Yes

8

htmldoc (htmldoc)

Converter

Convert HTML to a number of document formats.

Yes

N/A

Yes

No

Yes

9

ImageMagick (ImageMagick)

Converter

Convert between a large number of image formats.

Yes

N/A

Yes

No

Yes

10

ImageMagick (ncsa.image.metadata)

Extractor

Pull available EXIF image metadata from a given image.

Yes

Py2

Yes

11

Kabeja (kabeja)

Converter

Convert between a handful of 3D and image formats.

Yes

N/A

Yes

12

OpenCV - Faces (ncsa.cv.faces)

Extractor

Find faces in an image and return their locations.

Yes

Py1

Yes

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-closeups

13

OpenCV - Eyes (ncsa.cv.eyes)

Extractor

Find eyes in an image and return their locations.

Yes

Py1

Yes

	Kenton McHenry
14	OpenCV - Closeups (ncsa.cv.closeups)	Extractor	Determine whether an image is a closeup of a person or not.

Yes

Yes	Py1	Yes	Yes	Yes	Yes	Yes	Yes	Kenton McHenry
15	OpenCV - Profiles (ncsa.cv.profiles)	Extractor	Find human face profiles in an image and return their locations.

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-profiles

Yes

Py1

Yes

Yes	Kenton McHenry
16	Langid (ncsa.nlp.simplelanguage)	Extractor	Identify the language of the given

text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleLanguage

text.

Yes

Py2

Yes

No

	Gregory Jansen
17	LibreOffice (unoconv)	Converter	Convert to and from a variety of document formats.

https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-unoconv/browse

Yes

N/A

Yes

N/A

No

Yes

	Bing Zhang
18	NLTK - Summary (ncsa.nlp.simplesummary)	Extractor	Summarize a body of text.

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleSummary

Yes

Yes	Py2	Yes	Yes	Yes	Yes	No	No	Gregory Jansen
19	Siegfried (siegfried)	Extractor	Extract information about a given file relevant to identifying

its type and validating its format.

Yes

Py2

Yes

No

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-siegfried/browse

Yes

	Gregory Jansen
20	Stanford CoreNLP (ncsa.nlp.SNLP)	Extractor	Natural Language Process extractions such as parts of speech

, named entities, langauge, etc.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SNLP/SNLPExtractor

, named entities, langauge, etc.

Yes

Py2 Java equiv.

Yes

No

Gregory Jansen


21	Tesseract (ncsa.image.ocr)	Extractor	Object Character Recognition (OCR) to extract text

from images containing text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-tesseract

Yes

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/tika

from images containing text.	Yes	Py1	Yes	Yes	Yes	Yes	Yes	Yes	Bing Zhang
22	Tika (ncsa.nlp.tika)	Extractor	Document extractions such as language identification, ...

Yes

older Py1

Yes

No

Yes

Gregory Jansen

https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-txt2html/browse


23	txt2html (txt2html)	Converter	Convert text documents to HTML.	Yes

N/A

Yes

N/A Kenton McHenry

Yes	Bing Zhang
24	Versus - Color Distribution (ncsa.versus.image)	Extractor	Generate a distribution of color values within an image to be used for comparing how similar two

images are.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-versus/browse

images are.

Yes

Py1

Yes

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-vlfeat

Bing Zhang
25	VLFeat (ncsa.image.caltech101)	Extractor	Classify images as to whether they contain objects from the Caltech101 dataset (e.g. people, airplanes, motorcycles, cougars, ...).

Yes

Py2

yes

Yes

yes

no