Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Code: Is there code?  Is it submitted or existing? If so add URL to parent wiki page!!!
Py1/Py2: Is it using PyClowder1, PyClowder 2, or neither.  Expectation for Beta is most will be in Py1, moving forward with new ones we will use Py2.
JSON-LD: Is proper JSON-LD generated in the MetaData and registration document (make sure to use full URI).
DCKR: Dockerized?
Test File: Sample file available?

TC: Is the tool in the tools catalog?
DPL: Is the tool deployeddockerized (if not Windows), managed by the elasticity service, and passing tests every other hour?
DL: Is the tool downloadable via Sandeep's modifications to move compute to the data?

Complete & Deployed
Needs work
Blocked

Scientific

Domain  ToolType

Description                                     

Repositories

CodePy1/Py2JSON-LDDCKRTest FileTCDPLDLAssignment
ContactHydrology
Hydrology




 

 








1Advection Diffusion Solve a general advection-dispersion equation.
   

No

(Don't have a general tool. Seems like this was not done as per the description.)

 
 
    

 

Dong Kook Woo

 

Sandeep Puthanveetil Satheesan

Per Dr K this will be post beta

2Chemical Mean AgeExtractorDetermine the mean age of chemical constituents with inputs of chemical dynamics.
 

 No

(This is a simulation tool - don't have code.)

 

 

 

 

 

 

 

 

 

Dong Kook Woo

Sandeep Puthanveetil Satheesan

Per Dr K this will be post beta

3Document Tables Extractor (ncsa.nlp.wordtables)ExtractorExtract tables from documents.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/WordTablesExtractor

 

 

 

 

 

 

 

  4

Yes 

No

No

No

No 

No 

No 

No 


4ESRI Shapefile Extractor by using GDAL (ncsa.
geo
geoshp.
shpExtractor

Mostafa Elag

preview)Extractor
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-geo/browse

 N/A

 

 

 

 

Yes

 

 5GDAL (ncsa.geo.tiffExtractor)Extractor https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-geo/browse

 N/A

  

 

 

Yes

 

 6GDAL (ncsa.image.geotiff)Extractor https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-geotiff/browse

 N/A

 

 

 

 

 

 

 
Create a Geoserver layer for WMS service. It can be used in previewer

Yes

Py1

Yes

Yes

 Yes

 No

Yes

 No

5GeoTiff Extractor by using GDAL (ncsa.geotiff.preview)ExtractorCreate a Geoserver layer for WMS service. It can be used in previewer

Yes

Py1YesYes

 Yes

 No

Yes

 No

6GeoTiff Metadata Extractor by using GDAL (Another implementation) (ncsa.geotiff.metadata)ExtractorExtract geospatial metadata from Geotiff. It should be combined with #5 Geotiff extractor.

 Yes

Py1

Yes

Yes

 Yes

 Yes

Yes

 No

7Historical River Extractor (ncsa.cv.river)ExtractorExtract the river networks from the ancient hand-drawing maps and compare them with current river networks
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/river

 Yes

Py1

 Yes

Yes

 Yes

 

 Yes

 

 Yes

 

 

 

 

 

No

8Normalized Difference Vegetation Index (ncsa.arcgis.landsat7mosaic)Extractor
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-bd-cz/browse/ndviextractor

 

 

 

 

 

 

 

 

Create a NDVI layer from Landsat data. Calculating vegetation indices such as NDVI and Surface Temperature from Landsat 7 and 8 satellite data.

 Yes

Py1

 Yes

N/A

(Windows,

ArcGIS)

 Yes

 No

 Yes

(Windows VM)

 No

Smruti Padhy

9River Chi Index (ncsa.chi_analysis)ExtractorIdentify the river dynamics in a river basin and evaluate human activities' influences through Chi index in the streams.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-bd-cz/browse/chi-analysis

 Yes

Py1

 Yes

Yes

 Yes

 

Yes

Yes

 

Yes

10
River Sinuosity
GDAL Converter
ExtractorStudy the maturity and equilibrium conditions of a stream through the  
ConverterConverts between TIFF and BIL (zip) and FLT (zip)YesN/AN/AYesYesYesYesYes
11River SinuosityExtractorStudy the maturity and equilibrium conditions of a stream through the sinuosity index.

 

Yes

 

 

 

 

 

 

 

Qina Yan

11Soil Moisture ChangeExtractor

Esther Lee (estherl2010@gmail.com)

Qina supposed to be delivering code - Dr K thinks it should make it in Beta

12Soil Moisture ChangeExtractorDetermine role of hydraulic redistribution in AZ (riparian site / upland site) by studying soil moisture change throughout different seasons.

 No

 

 

 

 

 

 

 

 
12

Per Dr. K - will need discussed and can not be in Beta

13Species ClassifierExtractorSAM based Species Classification from Hyperspectral data, Hyperspectral Indices, NDVI, SAVI, MSAVI, etc.

 

Yes 

 

 

 

 

 

 

 

Debsunder Dutta (debsunderdutta@gmail.com) leaving in July 2016

13

Dr K does not think this will make it in Beta

14Floodplain Extractor by using TerEx (
TerEx (
ncsa.arcgis.floodplain)ExtractorIdentify the flat polygons and the heights inside a river valley.
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-bd-cz/browse/terex_floodplain

 N/A

 

 

 

 

 

 

 

Yes (but procedure?)

Py1

Yes 

No (windows)

Yes 

 No

Yes

(Windows)

 no

15
14 
Topographic DepressionsExtractorIdentify topographic depressions (TDs) and their distribution on landscape (Number, location, area, volume of TDs).

 

Yes - ready to review

 

 

 

 

 

 

 

Phong Le

1516Valley Safety ZonesExtractorEstimate submerging areas and water depths under extreme floods and map the safety zones in a river valley.
 

 

 

 

 

 

 

 

 

Qina Yan

16

Tree Delineation

ConverterTree-wise voxelization of waveform data for lidar metrics that describes canopy structure (max intensity, height, etc...).  Individual tree delineation, tree leaf area density to describe vertical leaf distribution. https://opensource.ncsa.illinois.edu/bitbucket/projects/BD/repos/ir-lidar/browse

 Yes

 

 

N/A

 

 Yes

N/A

(ncsa.arcgis.treedelin)

ExtractorCreate a shape-file with polygons of tree canopy polygon from LiDAR data

 Yes

Py1

 Yes

N/A

(Windows,

ArcGIS)

Yes

 Yes

 Yes

(Windows VM)

 

No
17
Vegetation IndicesExtractorCalculating vegetation indices such as NDVI and Surface Temperature from Landsat 7 and 8 satellite data.Is this the same as Normalized Difference Vegetation Index extractor above?

 

 

 

 

 

 

 

 

Debsunder Dutta (debsunderdutta@gmail.com) leaving in July 2016

Sandeep Puthanveetil Satheesan

Tree-wise voxelization Tree-wise voxelization of waveform data for lidar metrics that describes canopy structure (max intensity, height, etc...).  Individual tree delineation, tree leaf area density to describe vertical leaf distribution.

Yes

(But extractor has to be developed)

No

No

No

No 

No 

No 

No 

18Valley Safety ZonesExtractorEstimate submerging areas and water depths under extreme floods and map the safety zones in a river valley.

 Yes  

 

 

 

 

 

 

 

Sandeep Puthanveetil Satheesan

emailed March and April.

Sandeep will follow up with Qina.

Ecology

 






1netcdf (ncdump)ConverterConvert from binary netcdf to text.N/AN/A

 N/A

YesYes

Yes

Yes

Yes
2PEcAn (PEcAn#Ameriflux)ConverterConvert Ameriflux data to PEcAn's netcdf CF format.

Yes

N/A

 N/A


YesYes

Yes

Yes

Yes
3PEcAn (PEcAn#DALEC
Ecology

 

1netcdf (ncdump)ConverterConvert from binary netcdf to text.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ncdump/browseN/AN/A

 N/A

 

Yes

Yes

N/A2PEcAn (PEcAn#Ameriflux)ConverterConvert Ameriflux data to PEcAn's netcdf CF format.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse

Yes

N/A

 N/A

 

Yes

Yes

N/A3PEcAn (PEcAn#DALEC)ConverterConvert PEcAn's netcdf CF format to the format required by the DALEC model.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse

Yes

N/A

  N/A

 

 Yes

Yes

N/A4PEcAn (PEcAn#ED2)ConverterConvert PEcAn's netcdf CF format to the format required by the ED model.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse

 Yes

N/A

  N/A

 

 Yes

Yes

N/A5PEcAn (PEcAn#LINKAGEShttps://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse Yes
)ConverterConvert PEcAn's netcdf CF format to the format required by
the LINKAGES model.
the DALEC model.

Yes


N/A

  N/A

Yes
 
Yes

 Yes

Yes

N/A
Yes
6
4PEcAn (
PEcAn#Sipnet
PEcAn#ED2)ConverterConvert PEcAn's netcdf CF format to the format required
by the Sipnet model.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-pecan/browse
by the ED model.

 Yes

N/A

  N/A

Yes
 
Yes

 Yes

 Yes

Yes

N/A7
Yes
5
PEcAn (
PEcAn#AmerifluxBNL
PEcAn#LINKAGES)
PEcAn (PEcAn#FLUXNET2015)   
Converter

PEcAn (PEcAn#FACE)

PEcAn (PEcAn#PALEON)

PEcAn (PEcAn#NLDAS)

PEcAn (PEcAn#CURNCEP)

PEcAn (PEcAn#GLDAS)

PEcAn (PEcAn#GFDL)

PEcAn (PEcAn#BIOCRO)

PEcAn (PEcAn#CLM)

PEcAn (PEcAn#GDAY)

PEcAn (PEcAn#JULES)

PEcAn (PEcAn#LPJ-GUESS)

PEcAn (PEcAn#MAAT)

PEcAn (PEcAn#MAESPA)

PEcAn (PEcAn#PRELES)

Convert PEcAn's netcdf CF format to the format required by the LINKAGES model.

 Yes

N/A

 N/A

YesYes

 Yes

Yes

Yes
6PEcAn (PEcAn#Sipnet)ConverterConvert PEcAn's netcdf CF format to the format required by the Sipnet model.

 Yes

N/A

  N/A

 
YesYes

 Yes

 Yes

Yes
N/A
8PlantCV (terra.plantcv)ExtractorExtract plant height, area, and color distribution from photographs.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-plantcv/browse

 Yes

 

 

 

 

 

 

 Yan LiuCivil & Environmental Engineering

 

1

Body of Water Detector (ncsa.image.ponddetect)

ExtractorLand coverage, extract locations of bodies of water from satellite data.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-maps/browse/feature_detection

Yes

 

 

 

 

 

 

7

PEcAn (PEcAn#AmerifluxBNL)

PEcAn (PEcAn#FLUXNET2015)

PEcAn (PEcAn#FACE)

PEcAn (PEcAn#PALEON)

PEcAn (PEcAn#NLDAS)

PEcAn (PEcAn#CURNCEP)

PEcAn (PEcAn#GLDAS)

PEcAn (PEcAn#GFDL)

PEcAn (PEcAn#BIOCRO)

PEcAn (PEcAn#CLM)

PEcAn (PEcAn#GDAY)

PEcAn (PEcAn#JULES)

PEcAn (PEcAn#LPJ-GUESS)

PEcAn (PEcAn#MAAT)

PEcAn (PEcAn#MAESPA)

PEcAn (PEcAn#PRELES)

  

 Yes

N/A

 N/A


NoYes

 Yes

Yes

No
8PlantCV (terra.plantcv)ExtractorExtract plant height, area, and color distribution from photographs.

 Yes

Py1

 yes

yes

yes 

yes 

no

(bamboo test available, not being deployed)

 no

9BIL (hyperspectral)ConverterConvert bil.zip (contains raw & *.hdr) to *.terra.nc

Yes

(The original code has been updated. Need to figure out if we want to update this)

N/AN/Ayesno, only test files available are very large and would take a long timeyes

no

(bamboo test available, not being deployed)

no
Civil & Environmental Engineering



 

1

Body of Water Detector (ncsa.image.ponddetect)

ExtractorLand coverage, extract locations of bodies of water from satellite data.

Yes

No

 No

No

 Yes

 No

No 

 No

??who??
2GI IdentificationExtractor 

Yes

Py1

 Yes

Yes

 Yes

 Yes

 Yes

No

3Human Preference Score (ncsa.image.humanpref)ExtractorAssign a model derived human preference score to a given image of an urban environment.

 Yes

no

 no

no

 yes

 no

 yes

 no

??who??

4

Route Greenness (ncsa.greenindex)

ExtractorDerive the green index of a city route.

 Yes

Py1

 Yes

yes

 yes

 Yes

 Yes

 No

5Social Media GI PreferencesExtractorDetermine if text contains references to visual or functional green infrastructure.

 Yes

Py1

 Yes

Yes

Yes 

No 

 no

 No

6Stanford CoreNLP - Sentiment (ncsa.nlp.SNLPSentiment)ExtractorAssign a sentiment score to a piece of text.

 Yes

Py2 Java equiv.

Yes

YesYes

 

Yes

 

No

 

No

 

7TUV TriaxusExtractor

Towed Undulating Vehicle Data Analyzing Tools

 Yes

Py1

 yes

yes

yes

 yes

 yes

No 

Social Science & Humanities













1Bertillon Card Cell Extractor (ncsa.image.dmp)ExtractorExtract table cells from a Bertillon Card.

 Yes

Py1

Yes

No

 No

No 

No

 No

2Census Form Cell Extractor (census-section-segmentor) ExtractorExtract table cells from a 1930s Census form.

 Yes

No

 No

No

 Yes

No 

 No

 No

3Handwritten Decimals Extractor (ncsa.image.sphog.debod)ExtractorExtract handwritten decimal values from an image.

 Yes

Py1

 Yes

No

 No

 No

No 

No

4

Killed Photos

(ncsa.image.killedphoto)

ExtractorIdentify depression era photos "killed" by Farm Security Administration director Roy Stryker, indicated by a hole punch in the image.

 Yes

Py2

Yes

Yes

Yes

Yes

no 

 no

5Mean Grey (ncsa.cv.meangrey)ExtractorMean grey values of black and white photos.Yes

 Py1

 Yes

YesYes

Yes 

Yes

 no

6Movie Slice (ncsa.movieslice)ExtractorGenerates movie slice visualization from video files

 Yes

Py1/Py2 not used

No

No

 No

 No

No 

No 

7Person Detector (ncsa.video.person_detector)ExtractorExtract locations of people in an image.

 Yes

Py2

Yes

Yes

(MATLAB external)

 Yes

 Yes

Yes 

No 

8Person Tracker (ncsa.video.person_tracking)ExtractorExtract locations and paths of people moving in videos.

 Yes

Py2

Yes

Yes

(MATLAB external)

 Yes

Yes 

 Yes

No 

2GI IdentificationExtractor  

Yes

 

 Yes

 

 Yes

 Yes

 

3Human Preference Score (ncsa.image.humanpref)ExtractorAssign a model derived human preference score to a given image of an urban environment.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-maps/browse/humanpref

 

 

 

 

 

 

 

4

Route Greenness (ncsa.xml.greenindexroute)

ExtractorDerive the green index of a city route.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-maps/browse/greenroute

 

 

 

 

 

 Yes

 

5Social Media GI PreferencesExtractor  

 

 

 

 

 

 

 

 6Stanford CoreNLP - Sentiment (ncsa.nlp.SNLPSentiment)ExtractorAssign a sentiment score to a piece of text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SNLP/SNLPSentimentExtractor

 

 

 

 

 

 

 

 7TUV TriaxusExtractor  

 

 

 

 

 

 

 

 Social Science & Humanities
1Bertillon Card Cell Extractor (ncsa.image.dmp)ExtractorExtract table cells from a Bertillon Card.https://opensource.ncsa.illinois.edu/bitbucket/projects/DEBOD/repos/extractors-debod

 

 

 

 

 

 

 

2Census Form Cell Extractor (census-section-segmentor) ExtractorExtract table cells from a 1930s Census form.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/census

 

 

 

 

 

 

 

 3Handwritten Decimals Extractor (ncsa.image.sphog.debod)ExtractorExtract handwritten decimal values from an image.https://opensource.ncsa.illinois.edu/bitbucket/projects/DEBOD/repos/extractors-handwrittendecimals

 

 

 

 

 

 

 

4Killed Photos (ncsa.image.iarp_remove_circle)ExtractorIdentify depression era photos "killed" by Farm Security Administration director Roy Stryker, indicated by a hole punch in the image.https://opensource.ncsa.illinois.edu/bitbucket/projects/IARP/repos/image_fetcher/browse/extractors/remove_circle

 

 

 

 

 

 

 

Marcus SlavenasMarcus Slavenas5Mean Grey (ncsa.cv.meangrey)ExtractorMean grey values of black and white photos.https://opensource.ncsa.illinois.edu/bitbucket/projects/IARP/repos/image_fetcher/browse/extractors/mean_grey 

 

 

Marcus Slavenas

 

 

 

Marcus Slavenas6Movie Slice (ncsa.movieslice)Extractor https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-movieslice/browse

 

 

 

 

 

 

 

7Person Detector (person-detector)ExtractorExtract locations of people in an image.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-person-detector

 

 

 

 

 

 

 

8Person Tracker (ncsa.person-tracker)ExtractorExtract locations and paths of people moving in videos.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-person-tracking

 

 

 

 

 

 

 

Sandeep Puthanveetil Satheesan

9Video Analytics Toolbox (ncsa.cinemetrics_batch)ExtractorExtract
shot descriptors from videos.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-lsva-integrated/browse

 

 

 

 

 

 

 

shot descriptors from videos.

 Yes

Py1/Py2 not used

No

No

 No

No 

 No

No 

Sandeep Puthanveetil Satheesan

Biology
Biology, Genomics, Medicine1FSL (mri2mesh)Extractor
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-mri/browse/mri2mesh

 

 

 

 

 

 

 

Marcus Slavenas
Creates a finite element mesh from a set of mr images

 Yes

Py2

 no

yes

 yes

 no

 no

 no

Marcus Slavenas

2Glomeruli (ncsa
.msc
.
diagnosis)ExtractorExtract glomeruli from Kidney biopsy images.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-msc/browse/Kidney

 Yes

msc.diagnosis)ExtractorExtract glomeruli from Kidney biopsy images.

 Yes

Py1
 

Yes

Yes

Yes

 

Yes

Yes

no

 

no

General

Domain 

Tool

TypeDescription                                     
Repositories
CodePy1/Py2
TCDL
JSON-LD
DPL
DCKRTest File
DeveloperContact

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

TCDPLDLAssignment
 

 

 

 

 

 

 

 

General

1Calibre (ebook-converter)Converter

Convert e-books to a number of document formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ebook-convert/browseN/A

 

Yes

 

Yes

N/A

 

 

 

 

2

CMU Sphinx (ncsa.audio.speech2text)

ExtractorAudio recognition to extract text for speech within audio.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/audio/speech2textN/A 

 

 

 

 

 

 

 

 

 Yes

 

 

 

 

 

Marcus Slavenas

General

3
1
Daffodil
Calibre (
daffodil
ebook-converter)ConverterConvert
formats with a provided DFDL schema to XML.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-daffodil/browse
e-books to a number of document formats.YesN/AN/A
 
Yes
 Yes
Yes
 

 Yes

N/A  Kenton McHenry4DBPedia (ncsa.dbpedia)ExtractorFind and define named entities within the given text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-dbpedia/browse

 Yes

 

 

 

 

 

  5FFmpeg (ffmpeg)ConverterConvert between a large number of video formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ffmpeg/browseN/A 

 Yes

 

 

N/A  6FLAC (flac)ConverterConvert to and from the FLAC format from other audio formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-flac/browseN/A 

 Yes

 

 Yes

N/A  7Ghostscript (ghostscript)ConverterConvert between document formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-ghostscript/browseN/A 

 Yes

 

 Yes

N/A  8htmldoc (htmldoc)ConverterConvert HTML to a number of document formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-htmldoc/browseN/A 

 Yes

 

 

N/A  Kenton McHenry9ImageMagick (ImageMagick)ConverterConvert between a large number of image formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-imagemagick/browseN/A 

 Yes

 

 Yes

N/A  Kenton McHenry10ImageMagick (ncsa.image.metadata)ExtractorPull available EXIF image metadata from a given image.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/image/metadata

 

 

 

 

 

 

  11Kabeja (kabeja)ConverterConvert between a handful of 3D and image formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-kabeja/browseN/A 

 Yes

 

 Yes

N/A   12

OpenCV - Faces (ncsa.cv.faces)

ExtractorFind faces in an image and return their locations.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-faces

 Yes

 

 Yes

 

 Yes

 

  Kenton McHenry13

OpenCV - Eyes (ncsa.cv.eyes)

ExtractorFind eyes in an image and return their locations.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-eyes

 Yes

 

 Yes

 

Yes

 

 

Yes

No

Yes
2

CMU Sphinx (ncsa.audio.speech2text)

ExtractorAudio recognition to extract text for speech within audio.YesN/AYesYesYes

 Yes

No 

No 

??who??

3Daffodil (daffodil)ConverterConvert formats with a provided DFDL schema to XML.Yes    

 Yes

 Yes

Yes
4DBPedia (ncsa.dbpedia)ExtractorFind and define named entities within the given text.

 Yes

Py1YesYesYes

Yes

 Yes

 

5FFmpeg (ffmpeg)ConverterConvert between a large number of video formats.YesN/AN/AYesYes

 Yes

No

Yes
6FLAC (flac)ConverterConvert to and from the FLAC format from other audio formats.YesN/AN/AYesYes

 Yes

No

YesBing Zhang
7Ghostscript (ghostscript)ConverterConvert between document formats.YesN/AN/AYesYes

 Yes

No

YesBing Zhang
8htmldoc (htmldoc)ConverterConvert HTML to a number of document formats.YesN/AN/AYesYes

 Yes

No

Yes
9ImageMagick (ImageMagick)ConverterConvert between a large number of image formats.YesN/AN/AYesYes

 Yes

 No

Yes


10ImageMagick (ncsa.image.metadata)ExtractorPull available EXIF image metadata from a given image.

 Yes

Py2YesYesYes

Yes

 Yes

 Yes


11Kabeja (kabeja)ConverterConvert between a handful of 3D and image formats.YesN/AN/AYesYes

 Yes

 Yes

Yes
12

OpenCV - Faces (ncsa.cv.faces)

ExtractorFind faces in an image and return their locations.

 Yes

Py1YesYesYes

 Yes

 Yes

 Yes

13

OpenCV - Eyes (ncsa.cv.eyes)

ExtractorFind eyes in an image and return their locations.

 Yes

Py1YesYesYes

 Yes

Yes

 Yes

 

14

OpenCV - Closeups (ncsa.cv.closeups)

ExtractorDetermine whether an image is a closeup of a person or not.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-closeups

 Yes

 

 Yes

 

 Yes

 

  

 Yes

Py1YesYesYes

 Yes

 Yes

 Yes

15

OpenCV - Profiles (ncsa.cv.profiles)

ExtractorFind human face profiles in an image and return their locations.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-opencv/extractors-opencv-profiles

 Yes

 Yes

Py1
 
Yes
 
Yes
 Yes
Yes
 

Yes

 

 Yes

 

 Yes

16Langid (ncsa.nlp.simplelanguage)ExtractorIdentify the language of the given
text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleLanguage

 

 

 

 

 

 

 
text.

 Yes

Py2YesYesYes

 Yes

 No

No

 

17LibreOffice (unoconv)ConverterConvert to and from a variety of document formats.
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-unoconv/browse
YesN/AN/A
  
Yes

 Yes

Yes

 Yes

N/A 

No

Yes
 
Bing Zhang
18NLTK - Summary (ncsa.nlp.simplesummary)ExtractorSummarize a body of text.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleSummary

 

 

 Yes

 

 

 

  

 Yes

Py2Yes

Yes

Yes

 Yes

No

 No

19Siegfried (siegfried)ExtractorExtract information about a given file relevant to identifying
its type and validating its format.
its type and validating its format.

 Yes

Py2YesYesYes

 Yes

No

 No

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-siegfried/browse

 

 

 

 

Yes

 

  

20Stanford CoreNLP (ncsa.nlp.SNLP)ExtractorNatural Language Process extractions such as parts of speech
, named entities, langauge, etc.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SNLP/SNLPExtractor

 

 

 

 

 

 

  
, named entities, langauge, etc.

 Yes

Py2 Java equiv.YesYesYes

 Yes

 No

 No

 

21Tesseract (ncsa.image.ocr)ExtractorObject Character Recognition (OCR) to extract text
from images containing text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-tesseract

 Yes

 

 Yes

 

 Yes

 

  Kenton McHenry
from images containing text.

 Yes

Py1YesYesYes

 Yes

 Yes

Yes 

22Tika (ncsa.nlp.tika)ExtractorDocument extractions such as language identification, ...
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/tika

 Yes

 

 

 

 

 

  

 Yes

older Py1YesYesYesYes

 

No

 

Yes

 

Kenton McHenry

23txt2html (txt2html)ConverterConvert text documents to HTML.Yes
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-txt2html/browse
N/AN/A
 
YesYes
 

Yes

 

Yes

N/A  Kenton McHenry
Yes
24Versus - Color Distribution (ncsa.versus.image)ExtractorGenerate a distribution of color values within an image to be used for comparing how similar two
images are.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-versus/browse
images are.

 Yes

 
Py1
 Yes
Yes
 
Yes
 
Yes
 

 Yes

 

Yes

 Yes

Kenton McHenry

25

VLFeat (ncsa.image.caltech101)

ExtractorClassify images as to whether they contain objects from the Caltech101 dataset (e.g. people, airplanes, motorcycles, cougars, ...).
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/extractors-vlfeat

 Yes

 

 Yes

 

 

 

  

 Yes

Py2yesyesyes

 Yes

 yes

no

Kenton McHenry

26Zip (zip)ConverterUnzip zip archives.Yes
https://opensource.ncsa.illinois.edu/bitbucket/projects/POL/repos/converters-zip/browse
N/AN/AYes
 
Yes

 Yes

 

Kenton McHenry

No

Yes
N/A  
Bing Zhang

https://opensource.ncsa.illinois.edu/confluence/display/BD/Transformations (Under development, Being Refactored)

GeoTiff Extractor by using