Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Code: Is there code?  Is it submitted or existing? If so add URL to parent wiki page!!!
Py1/Py2: Is it using PyClowder1, PyClowder 2, or neither.  Expectation for Beta is most will be in Py1, moving forward with new ones we will use Py2.
JSON-LD: Is proper JSON-LD generated in the MetaData and registration document (make sure to use full URI).
DCKR: Dockerized?
Test File: Sample file available?

TC: Is the tool in the tools catalog?
DPL: Is the tool deployeddockerized (if not Windows), managed by the elasticity service, and passing tests every other hour?
DL: Is the tool downloadable via Sandeep's modifications to move compute to the data?

Complete & Deployed
Needs work
Blocked

Scientific

Domain  ToolType

Description                                     

Repositories

CodePy1/Py2JSON-LDDCKRTest FileTCDPLDLAssignment
Contact
Hydrology




 

 








1Advection Diffusion Solve a general advection-dispersion equation.
   

 

No

(Don't have a general tool. Seems like this was not done as per the description.)

  
 
   

 

Dong Kook Woo

 

Sandeep Puthanveetil Satheesan

Per Dr K this will be post beta

2Chemical Mean AgeExtractorDetermine the mean age of chemical constituents with inputs of chemical dynamics.
 

 No

(This is a simulation tool - don't have code.)

 

 

 

 

 

 

 

 

 

Dong Kook Woo

Sandeep Puthanveetil Satheesan

Per Dr K this will be post beta

3Document Tables Extractor (ncsa.nlp.wordtables)ExtractorExtract tables from documents.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/WordTablesExtractor

 

 

 

 

 

 

 

  4

Yes 

No

No

No

No 

No 

No 

No 


4ESRI Shapefile Extractor by using GDAL (ncsa.
geo
geoshp.
shpExtractor
preview)Extractor
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-geo/browse

 N/A

 

 

 

 

Yes

 

 
Create a Geoserver layer for WMS service. It can be used in previewer

Yes

Py1

Yes

Yes

 Yes

 No

Yes

 No

5GeoTiff Extractor by using GDAL (ncsa.geotiff.preview)ExtractorCreate a Geoserver layer for WMS service. It can be used in previewer

Yes

Py1YesYes

 Yes

 No

Yes

 No

5GDAL (ncsa.geo.tiffExtractor)Extractor https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-geo/browse

 N/A

  

 

 

Yes

 

 Smruti Padhy

6GeoTiff Metadata Extractor by using GDAL (Another implementation) (ncsa.
image
geotiff.
geotiff

Mostafa Elag

metadata)Extractor
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-geotiff/browse

 N/A

 

 

 

 

 

 

 
Extract geospatial metadata from Geotiff. It should be combined with #5 Geotiff extractor.

 Yes

Py1

Yes

Yes

 Yes

 Yes

Yes

 No

7Historical River Extractor (ncsa.cv.river)ExtractorExtract the river networks from the ancient hand-drawing maps and compare them with current river networks
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/river

 Yes

Py1

 Yes

Yes

 Yes

 

 Yes

 

 Yes

 

 

 

 

 

No

8Normalized Difference Vegetation Index (ncsa.arcgis.landsat7mosaic)Extractor
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-bd-cz/browse/ndviextractor

 

 

 

 

 

 

 

 

Create a NDVI layer from Landsat data. Calculating vegetation indices such as NDVI and Surface Temperature from Landsat 7 and 8 satellite data.

 Yes

Py1

 Yes

N/A

(Windows,

ArcGIS)

 Yes

 No

 Yes

(Windows VM)

 No

Smruti Padhy

9River Chi Index
Extractor
(ncsa.chi_analysis)ExtractorIdentify the river dynamics in a river basin and evaluate human activities' influences through Chi index in the streams.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-bd-cz/browse/chi-analysis

 Yes

Py1

 Yes

Yes

 Yes

 

Yes

Yes

 

Yes

Qina Yan

10
River Sinuosity
GDAL Converter
ExtractorStudy the maturity and equilibrium conditions of  
ConverterConverts between TIFF and BIL (zip) and FLT (zip)YesN/AN/AYesYesYesYesYes
11River SinuosityExtractorStudy the maturity and equilibrium conditions of a stream through the sinuosity index.

 

Yes

 

 

 

 

 

 

 

Qina Yan

11

Esther Lee (estherl2010@gmail.com)

Qina supposed to be delivering code - Dr K thinks it should make it in Beta

12Soil Moisture ChangeExtractorDetermine role of hydraulic redistribution in AZ (riparian site / upland site) by studying soil moisture change throughout different seasons.

 No

 

 

 

 

 

 

 

 
12

Per Dr. K - will need discussed and can not be in Beta

13Species ClassifierExtractorSAM based Species Classification from Hyperspectral data, Hyperspectral Indices, NDVI, SAVI, MSAVI, etc.

 

Yes 

 

 

 

 

 

 

 

Debsunder Dutta (debsunderdutta@gmail.com) leaving in July 2016

13

Dr K does not think this will make it in Beta

14Floodplain Extractor by using TerEx (ncsa.arcgis.floodplain)ExtractorIdentify the flat polygons and the heights inside a river valley.
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-bd-cz/browse/terex_floodplain

 N/A

 

 

 

 

 

 

 

Yes (but procedure?)

Py1

Yes 

No (windows)

Yes 

 No

Yes

(Windows)

 no

15
14 
Topographic DepressionsExtractorIdentify topographic depressions (TDs) and their distribution on landscape (Number, location, area, volume of TDs).

 

Yes - ready to review

 

 

 

 

 

 

 

Phong Le

15
16

Tree Delineation

ConverterTree-wise voxelization of waveform data for lidar metrics that describes canopy structure (max intensity, height, etc...). 

(ncsa.arcgis.treedelin)

ExtractorCreate a shape-file with polygons of tree canopy polygon from LiDAR data

 Yes

Py1

 Yes

N/A

(Windows,

ArcGIS)

Yes

 Yes

 Yes

(Windows VM)

 

No
17Tree-wise voxelization Tree-wise voxelization of waveform data for lidar metrics that describes canopy structure (max intensity, height, etc...).  Individual tree delineation, tree leaf area density to describe vertical leaf distribution.

Yes

(But extractor has to be developed)

No

No

No

No 

No 

No 

No 

https://opensource.ncsa.illinois.edu/bitbucket/projects/BD/repos/ir-lidar/browse

 Yes

 

 

N/A

 

 Yes

N/A16

18
Valley Safety ZonesExtractorEstimate submerging areas and water depths under extreme floods and map the safety zones in a river valley.
 

 Yes  

 

 

 

 

 

 

 

Qina Yan

17Vegetation IndicesExtractorCalculating vegetation indices such as NDVI and Surface Temperature from Landsat 7 and 8 satellite data.Is this the same as Normalized Difference Vegetation Index extractor above?

 

 

 

 

 

 

 

 

Debsunder Dutta (debsunderdutta@gmail.com) leaving in July 2016

Sandeep Puthanveetil Satheesan

N/

emailed March and April.

Sandeep will follow up with Qina.

Ecology

 






1netcdf (ncdump)ConverterConvert from binary netcdf to text.
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ncdump/browse
N/AN/A

 N/A

 
YesYes

Yes

Yes

N/A
Yes
2PEcAn (PEcAn#Ameriflux)ConverterConvert Ameriflux data to PEcAn's netcdf CF format.
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse

Yes

N/A

 N/A


Yes
 
Yes

Yes

Yes

N/A
Yes
3PEcAn (PEcAn#DALEC)ConverterConvert PEcAn's netcdf CF format to the format required by the DALEC model.
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse

Yes

N/A

  N/A

 
YesYes

 Yes

Yes

N/A
Yes
4PEcAn (PEcAn#ED2)ConverterConvert PEcAn's netcdf CF format to the format required by the ED model.

 Yes

https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse

 Yes

N/A

  N/A

 

 Yes

Yes

N/A
N/A

  N/A

YesYes

 Yes

Yes

Yes
5PEcAn (PEcAn#LINKAGES)ConverterConvert PEcAn's netcdf CF format to the format required by the LINKAGES model.
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse

 Yes

N/A

 N/A

 
YesYes

 Yes

Yes

N/A
Yes
6PEcAn (PEcAn#Sipnet)ConverterConvert PEcAn's netcdf CF format to the format required by the Sipnet model.
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse

 Yes

N/A

  N/A

 
YesYes

 Yes

 Yes

N/A
Yes
Rob Kooper

Kenton McHenry

7

PEcAn (PEcAn#AmerifluxBNL)

PEcAn (PEcAn#FLUXNET2015)

PEcAn (PEcAn#FACE)

PEcAn (PEcAn#PALEON)

PEcAn (PEcAn#NLDAS)

PEcAn (PEcAn#CURNCEP)

PEcAn (PEcAn#GLDAS)

PEcAn (PEcAn#GFDL)

PEcAn (PEcAn#BIOCRO)

PEcAn (PEcAn#CLM)

PEcAn (PEcAn#GDAY)

PEcAn (PEcAn#JULES)

PEcAn (PEcAn#LPJ-GUESS)

PEcAn (PEcAn#MAAT)

PEcAn (PEcAn#MAESPA)

PEcAn (PEcAn#PRELES)

 

  

 Yes

N/A

 N/A


No
 
Yes

 Yes

Yes

N/A
No
8PlantCV
8PlantCV
(terra.plantcv)ExtractorExtract plant height, area, and color distribution from photographs.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-plantcv/browse

 Yes

 

 

 

 

 

 

 Yan LiuCivil & Environmental Engineering

 

1

Body of Water Detector (ncsa.image.ponddetect)

ExtractorLand coverage, extract locations of bodies of water from satellite data.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-maps/browse/feature_detection

Yes

 

 

 

 

 

 

 Yes

Py1

 yes

yes

yes 

yes 

no

(bamboo test available, not being deployed)

 no

9BIL (hyperspectral)ConverterConvert bil.zip (contains raw & *.hdr) to *.terra.nc

Yes

(The original code has been updated. Need to figure out if we want to update this)

N/AN/Ayesno, only test files available are very large and would take a long timeyes

no

(bamboo test available, not being deployed)

no
Civil & Environmental Engineering



 

1

Body of Water Detector (ncsa.image.ponddetect)

ExtractorLand coverage, extract locations of bodies of water from satellite data.

Yes

No

 No

No

 Yes

 No

No 

 No

??who??

Ankit Rai

Marcus Slavenas
2GI IdentificationExtractor
 
 

Yes

 
Py1

 Yes

 
Yes

 Yes

 Yes

 

 Yes

No

3Human Preference Score (ncsa.image.humanpref)ExtractorAssign a model derived human preference score to a given image of an urban environment.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-maps/browse/humanpref

 

 

 

 

 

 

 

 Yes

no

 no

no

 yes

 no

 yes

 no

??who??

Ankit Rai

Marcus Slavenas

4

Route Greenness (ncsa.

xml.greenindexroute

greenindex)

ExtractorDerive the green index of a city route.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-maps/browse/greenroute

 

 

 

 

 

 Yes

 

 Yes

Py1

 Yes

yes

 yes

 Yes

 Yes

 No

5Social Media GI PreferencesExtractor
  

 

 

 

 

 

 

 

 
Determine if text contains references to visual or functional green infrastructure.

 Yes

Py1

 Yes

Yes

Yes 

No 

 no

 No

Ankit Rai

Marcus Slavenas

6Stanford CoreNLP - Sentiment (ncsa.nlp.SNLPSentiment)ExtractorAssign a sentiment score to a piece of text.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SNLP/SNLPSentimentExtractor

 

 

 

 

 

 

 

 

 Yes

Py2 Java equiv.

Yes

YesYes

 

Yes

 

No

 

No

 

7TUV TriaxusExtractor
  

 

 

 

 

 

 

 

 

Towed Undulating Vehicle Data Analyzing Tools

 Yes

Py1

 yes

yes

yes

 yes

 yes

No 

Ankit Rai

Marcus Slavenas

Social Science & Humanities













1Bertillon Card Cell Extractor (ncsa.image.dmp)ExtractorExtract table cells from a Bertillon Card.
https://opensource.ncsa.illinois.edu/bitbucket/projects/DEBOD/repos/extractors-debod

 

 

 

 

 

 

 Yes

Py1

Yes

No

 No

No 

No

 No

 

2Census Form Cell Extractor (census-section-segmentor) ExtractorExtract table cells from a 1930s Census form.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/census

 

 

 

 

 

 

 

 

 Yes

No

 No

No

 Yes

No 

 No

 No

Inna Zharnitsky

3Handwritten Decimals Extractor (ncsa.image.sphog.debod)ExtractorExtract handwritten decimal values from an image.
https://opensource.ncsa.illinois.edu/bitbucket/projects/DEBOD/repos/extractors-handwrittendecimals

 

 

 

 

 

 

 Yes

Py1

 Yes

No

 No

 No

No 

No

 

4

Killed Photos

(ncsa.image.

iarp_remove_circle

killedphoto)

ExtractorIdentify depression era photos "killed" by Farm Security Administration director Roy Stryker, indicated by a hole punch in the image.
https://opensource.ncsa.illinois.edu/bitbucket/projects/IARP/repos/image_fetcher/browse/extractors/remove_circle

 

 

 

 

 

 

 

Marcus SlavenasMarcus Slavenas

 Yes

Py2

Yes

Yes

Yes

Yes

no 

 no

5Mean Grey (ncsa.cv.meangrey)ExtractorMean grey values of black and white photos.
https://opensource.ncsa.illinois.edu/bitbucket/projects/IARP/repos/image_fetcher/browse/extractors/mean_grey 

 

 

Marcus Slavenas

 

 

 

Marcus Slavenas
Yes

 Py1

 Yes

YesYes

Yes 

Yes

 no

6Movie Slice (ncsa.movieslice)Extractor
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-movieslice/browse

 

 

 

 

 

 

 

Generates movie slice visualization from video files

 Yes

Py1/Py2 not used

No

No

 No

 No

No 

No 

7Person Detector (ncsa.video.person
-
_detector)ExtractorExtract locations of people in an image.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-person-detector

 

 

 

 

 

 

 

 Yes

Py2

Yes

Yes

(MATLAB external)

 Yes

 Yes

Yes 

No 

Sandeep Puthanveetil Satheesan

8Person Tracker (ncsa.video.person
-tracker
_tracking)ExtractorExtract locations and paths of people moving in videos.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-person-tracking

 

 

 

 

 

 

 

 Yes

Py2

Yes

Yes

(MATLAB external)

 Yes

Yes 

 Yes

No 

Sandeep Puthanveetil Satheesan

9Video Analytics Toolbox (ncsa.cinemetrics_batch)ExtractorExtract shot descriptors from videos.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-lsva-integrated/browse

 

 

 

 

 

 

 Yes

Py1/Py2 not used

No

No

 No

No 

 No

No 

 

Biology, Genomics, Medicine1FSL (mri2mesh)Extractor
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-mri/browse/mri2mesh

 

 

 

 

 

 

 

Marcus Slavenas
Creates a finite element mesh from a set of mr images

 Yes

Py2

 no

yes

 yes

 no

 no

 no

Marcus Slavenas

2Glomeruli (ncsa.msc.diagnosis)ExtractorExtract glomeruli from Kidney biopsy images.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-msc/browse/Kidney

 Yes

 

Yes

Yes

Yes

 

Yes

 

 Yes

Py1

Yes

Yes

Yes

Yes

no

no

General

Domain 

Tool

TypeDescription                                     
Repositories
CodePy1/Py2JSON-LDDCKRTest FileTCDPLDLAssignment
Contact

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

General

1Calibre (ebook-converter)ConverterConvert e-books to a number of document formats.
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ebook-convert/browse
YesN/A
 
N/A
 
Yes
 
Yes

Yes

N/A

No

 
Yes
 
2

CMU Sphinx (ncsa.audio.speech2text)

ExtractorAudio recognition to extract text for speech within audio.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/audio/speech2text
YesN/A
 
Yes
 
Yes
 
Yes

 Yes

 

No 

 

 

No 

??who??

Marcus Slavenas

3Daffodil (daffodil)ConverterConvert formats with a provided DFDL schema to XML.
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-daffodil/browse
Yes 
N/A 
   

 Yes

 Yes

N/A
Yes
4DBPedia (ncsa.dbpedia)ExtractorFind and define named entities within the given text.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-dbpedia/browse

 Yes

 
Py1
 
Yes
 
Yes
 
Yes
 

Yes

 

 Yes

 

5FFmpeg (ffmpeg)ConverterConvert between a large number of video formats.
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ffmpeg/browse
YesN/AN/A
 
Yes
 
Yes
 

 Yes

 

N/A 

No

Yes
Marcus Slavenas
6FLAC (flac)ConverterConvert to and from the FLAC format from other audio formats.
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-flac/browseN/A   

 Yes

 Yes

N/A 
YesN/AN/AYesYes

 Yes

No

YesBing Zhang
7Ghostscript (ghostscript)ConverterConvert between document formats.
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ghostscript/browse
YesN/AN/A
 
Yes
 
Yes
 

 Yes

 Yes

No

N/A
Yes
 
Bing Zhang
8htmldoc (htmldoc)ConverterConvert HTML to a number of document formats.Yes
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-htmldoc/browse
N/AN/A
 
Yes
 
Yes
 

 Yes

 

No

N/A 
Yes
Kenton McHenry

9ImageMagick (ImageMagick)ConverterConvert between a large number of image formats.Yes
https:N/A   

 Yes

 Yes

N/A Kenton McHenry
N/
/opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-imagemagick/browse
AN/AYesYes

 Yes

 No

Yes


10ImageMagick (ncsa.image.metadata)ExtractorPull available EXIF image metadata from a given image.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/image/metadata

 

   

 

 

 

 

 Yes

Py2YesYesYes

Yes

 Yes

 Yes


11Kabeja (kabeja)ConverterConvert between a handful of 3D and image formats.
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-kabeja/browse
YesN/AN/A
   

 Yes

 Yes

N/A  12

OpenCV - Faces (ncsa.cv.faces)

ExtractorFind faces in an image and return their locations.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-faces

 Yes

  
YesYes
 

 Yes

 Yes

 
Yes
 
13
12

OpenCV -

Eyes

Faces (ncsa.cv.

eyes

faces)

ExtractorFind
eyes
faces in an image and return their locations.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-eyes

 Yes

  

 Yes

Py1YesYesYes

 Yes

 Yes

 Yes

13

OpenCV - Eyes (ncsa.cv.eyes)

ExtractorFind eyes in an image and return their locations.

 Yes

Py1YesYesYes
  

 Yes

Yes

 

 Yes

14

OpenCV - Closeups (ncsa.cv.closeups)

ExtractorDetermine whether an image is a closeup
of a person or not.
of a person or not.

 Yes

Py1YesYesYes
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-closeups

 Yes

   

 Yes

 Yes

 

 Yes

 
15

OpenCV - Profiles (ncsa.cv.profiles)

ExtractorFind human
face profiles in an image and return their locations.
face profiles in an image and return their locations.

 Yes

Py1YesYesYes
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-profiles

 Yes

   

Yes

 Yes

 

 Yes

 
16Langid (ncsa.nlp.simplelanguage)ExtractorIdentify the language of the given text.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleLanguage

 

   

 

 

 

 Yes

Py2YesYesYes

 Yes

 No

No

 

17LibreOffice (unoconv)ConverterConvert to and from a variety of document formats.Yes
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-unoconv/browse
N/AN/A
  
Yes
 
Yes

 Yes

 Yes

N/A

No

Yes
 
Bing Zhang
18NLTK - Summary (ncsa.nlp.simplesummary)ExtractorSummarize a body of
text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleSummary

 

   

 Yes

 

text.

 Yes

Py2Yes

Yes

Yes

 Yes

No

 No

 

 
19Siegfried (
siegfried)ExtractorExtract information about a given file relevant to identifying its type and validating its format.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-siegfried/browse

 

   

 

Yes

 

siegfried)ExtractorExtract information about a given file relevant to identifying its type and validating its format.

 Yes

Py2YesYesYes

 Yes

No

 No

Gregory Jansenhttps://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SNLP/SNLPExtractor

 

   

 

 

 

 

20Stanford CoreNLP (ncsa.nlp.SNLP)ExtractorNatural Language Process extractions such as parts of
speech, named entities, langauge, etc.
speech, named entities, langauge, etc.

 Yes

Py2 Java equiv.YesYesYes

 Yes

 No

 No

21Tesseract (ncsa.image.ocr)ExtractorObject Character Recognition (OCR) to extract
text from images containing text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-tesseract

 Yes

  
text from images containing text.

 Yes

Py1YesYesYes
 

 Yes

 Yes

 

Yes 

 
Kenton McHenry
22Tika (ncsa.nlp.tika)ExtractorDocument extractions such as language identification, ...
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/tika

 Yes

   

 

 

 

 

 Yes

older Py1YesYesYesYes

 

No

 

Yes

 

Kenton McHenry

23txt2html (txt2html)ConverterConvert text documents to HTML.Yes
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-txt2html/browse
N/AN/A
 
Yes
 
Yes
 

Yes

Yes

 

N/A Kenton McHenry
Yes
24Versus - Color Distribution (ncsa.versus.image)ExtractorGenerate a distribution of color values within an image to be used for comparing how similar two
images are.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-versus/browse
images are.

 Yes

 
Py1Yes
 
Yes
 
Yes

 Yes

  

Yes

 

Yes

Kenton McHenry

25

VLFeat (ncsa.image.caltech101)

ExtractorClassify images as to whether they contain objects from the Caltech101 dataset (e.g. people, airplanes, motorcycles, cougars, ...).
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-vlfeat

 Yes

   

 Yes

 

 

 

 Yes

Py2yesyesyes

 Yes

 yes

no

Kenton McHenry

26Zip (zip)ConverterUnzip zip archives.Yes
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-zip/browse
N/AN/A
 
Yes
 
Yes
 

 Yes

 Yes

No

Yes

N/A Kenton McHenry
Bing Zhang

https://opensource.ncsa.illinois.edu/confluence/display/BD/Transformations (Under development, Being Refactored)

GeoTiff Extractor by using