Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Scientific

ID (Extractor Name from config file,

same as queue name)

Programming

Language

SoftwareOSCan be Dockerized?Can be upload to Docker Hub ?Assigned ToLink to repoWho wrote or worked on the codeDEPLOYED
Domain ToolType

Description                                     

RepositoriesContact
Hydrology




 

 
  








Advection Diffusion 
 
Solve a general advection-dispersion equation. 
 ncsa.image.ocrPythonTesseractLinux  Ruiocr 

ncsa.cv.faces

PythonOpenCVLinux  
Chemical Mean AgeExtractorDetermine the mean age of chemical constituents with inputs of chemical dynamics. 
Document Tables Extractor (ncsa.nlp.wordtables)ExtractorExtract tables from documents.
Rui
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
cv
nlp/browse/
opencv
WordTablesExtractor
Liana
 
GDAL (ncsa.
cv
geo.
eyesPythonOpenCVLinux
shpExtractor)Extractor
 
 
Rui
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
cv
geo/browse
/opencvLiana
GDAL (ncsa.
cv
geo.
closeups
tiffExtractor)
Python
Extractor
OpenCVLinux
 
 Rui
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
cv
geo/browse
/opencvLiana
GDAL (ncsa.
cv
image.
profilesPythonOpenCV
geotiff)Extractor
Linux
 
 Rui
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
cv
geotiff/browse
/opencvLiana

ncsa.cellprofiler.fluorescentcomet

Pythonpymedici (question)WindowsNo 
Historical River Extractor (ncsa.cv.river)ExtractorExtract the river networks from the ancient hand-drawing maps and compare them with current river networks
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/
cellprofiler
river
Liana
Normalized Difference Vegetation Index (ncsa.
cellprofiler
arcgis.
flyPython WindowsNo
landsat7mosaic)Extractor
 
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-bd-
cv
cz/browse/
cellprofiler

ncsa.cellprofiler.human

Python WindowsNo  
ndviextractor
Liana
River Chi IndexExtractorIdentify the river dynamics in a river basin and evaluate human activities' influences through Chi index in the streams.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-bd-
cv
cz/browse/
cellprofilerLiana

ncsa.cellprofiler.silvercomet

Python WindowsNo  https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/cellprofilerLiana

ncsa.cellprofiler.speckle

Python WindowsNo  https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/cellprofilerLiana

ncsa.cellprofiler.trackobject

Python WindowsNo  
chi-analysis
River SinuosityExtractorStudy the maturity and equilibrium conditions of a stream through the sinuosity index.  
Soil Moisture ChangeExtractorDetermine role of hydraulic redistribution in AZ (riparian site / upland site) by studying soil moisture change throughout different seasons. 
Species ClassifierExtractorSAM based Species Classification from Hyperspectral data, Hyperspectral Indices, NDVI, SAVI, MSAVI, etc. 

Debsunder Dutta (debsunderdutta@gmail.com) leaving in July 2016

Sandeep Puthanveetil Satheesan

TerEx (ncsa.arcgis.floodplain)ExtractorIdentify the flat polygons and the heights inside a river valley.  
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
cv
bd-cz/browse/
cellprofilerLiana

ncsa.cellprofiler.tumor

Python WindowsNo  
terex_floodplain
Topographic DepressionsExtractorIdentify topographic depressions (TDs) and their distribution on landscape (Number, location, area, volume of TDs).
https://
opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/cellprofilerLiana
github.com/HydroComplexity/TDI
Tree DelineationConverterTree-wise voxelization of waveform data for lidar metrics that describes canopy structure (max intensity, height, etc...).  Individual tree delineation, tree leaf area density to describe vertical leaf distribution.

ncsa.cellprofiler.yeast

Python WindowsNo  
https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
BD/repos/
extractors
ir-
cv
lidar/browse
/cellprofiler
Liana

ncsa.image.sphog

Python Matlab, mnist-sphog Linux  https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/handwritten/HandwrittenNumbers 
  • ncsa.image.caltech101
       ncsa.bisque.histogram (notes: disabled)Python Linux     ncsa.bisque.metadata (notes: disabled)Python Linux     census-section-segmentorJava Linux  
Valley Safety ZonesExtractorEstimate submerging areas and water depths under extreme floods and map the safety zones in a river valley.
 
Vegetation IndicesExtractorCalculating vegetation indices such as NDVI and Surface Temperature from Landsat 7 and 8 satellite data. 

Debsunder Dutta (debsunderdutta@gmail.com) leaving in July 2016

Sandeep Puthanveetil Satheesan

Ecology

 






netcdf (ncdump)ConverterConvert from binary netcdf to text.https
https
://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
cv
ncdump/browse
/censusLiana, Innancsa.cv.river PythonOpenCV (python), convert (from imagemagick), and GdalLinux  
PEcAn (PEcAn#Ameriflux)ConverterConvert Ameriflux data to PEcAn's netcdf CF format.
Smruti Padhy
https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
cv
pecan/browse
/riverncsa.geo.shpExtractorPythongdalLinux  
Liana
PEcAn (PEcAn#DALEC)ConverterConvert PEcAn's netcdf CF format to the format required by the DALEC model.
https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
geo
pecan/browse
Jong Lee
PEcAn (PEcAn#ED2)ConverterConvert PEcAn's netcdf CF format to the format required by the ED model.
ncsa.geo.tiffExtractorPythongdalLinux  Jong Lee
https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
geo
pecan/browse
Jong Leencsa.image.geotiffPython

GDAL, Cython, numpy,
pygeoprocessing

Linux  Rui
PEcAn (PEcAn#LINKAGES)ConverterConvert PEcAn's netcdf CF format to the format required by the LINKAGES model.https
https
://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
geotiff

ncsa.image.ponddetect

PythonMatlabLinux  
pecan/browse
Rui, Mostafa Elag
PEcAn (PEcAn#Sipnet)ConverterConvert PEcAn's netcdf CF format to the format required by the Sipnet model.
https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
maps
pecan/browse
/feature_detectionMarcus, Ankit

PEcAn (PEcAn#AmerifluxBNL)

PEcAn (PEcAn#FLUXNET2015)

PEcAn (PEcAn#FACE)

PEcAn (PEcAn#PALEON)

PEcAn (PEcAn#NLDAS)

PEcAn (PEcAn#CURNCEP)

PEcAn (PEcAn#GLDAS)

PEcAn (PEcAn#GFDL)

PEcAn (PEcAn#BIOCRO)

PEcAn (PEcAn#CLM)

PEcAn (PEcAn#GDAY)

PEcAn (PEcAn#JULES)

PEcAn (PEcAn#LPJ-GUESS)

PEcAn (PEcAn#MAAT)

PEcAn (PEcAn#MAESPA)

PEcAn (PEcAn#PRELES)

   
PlantCV (terra.plantcv)ExtractorExtract plant height, area, and color distribution from photographs.
ncsa.image.humanprefPythonMatlabLinux  https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-maps/browse/humanprefMarcus, Ankit

ncsa.xml.greenindexroute, ncsa.csv.greenindexroute

PythonOpenCVLinux  Marcus Slavenas
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
maps
plantcv/browse
/greenrouteMarcus

ncsa.image.knn_numerals

PythonOpenCVLinux   Marcus

ncsa.audio.speech2text

JavaCMU Sphinx, ffmpeg, soxLinux  
Yan Liu
Civil & Environmental Engineering



 

Body of Water Detector (ncsa.image.ponddetect)

ExtractorLand coverage, extract locations of bodies of water from satellite data.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
core
maps/browse/
audio/speech2text
feature_detection
ncsa.audio.preview
GI IdentificationExtractor
Python
  
  
Human Preference Score (ncsa.image.humanpref)ExtractorAssign a model derived human preference score to a given image of an urban environment.
Inna
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
core
maps/browse/
audio/preview ncsa.nlp.simplelanguagePythonnumpy   Inna
humanpref

Route Greenness (ncsa.xml.greenindexroute)

ExtractorDerive the green index of a city route.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
nlp
maps/browse/
SimpleLanguageLiana
greenroute
Social Media GI PreferencesExtractor  
Stanford CoreNLP - Sentiment (ncsa.nlp.
simplesummary
SNLPSentiment)
Python

Natural Language Toolkit (NLTK) for Python, NLTK Data or at least:

 nltk.corpus,nltk.stem.porter and nltk.tokenize.punkt.

   
ExtractorAssign a sentiment score to a piece of text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SNLP/
SimpleSummary
SNLPSentimentExtractor
Lianancsa.nlp.SNLPSentimentJava Stanford CoreNLP tool, java, maven
TUV TriaxusExtractor  
 
Social Science & Humanities













Bertillon Card Cell Extractor (ncsa.image.dmp)ExtractorExtract table cells from a Bertillon Card.https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
DEBOD/repos/extractors-
nlp/browse/SNLP/SNLPSentimentExtractorLiana, Marcus(?)
debod
Census Form Cell Extractor (census-section-segmentor) ExtractorExtract table cells from a 1930s Census form.
ncsa.nlp.wordtablesPython requestspikawin32com    
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
nlp
cv/browse/
WordTablesExtractor
census
Liana
Handwritten Decimals Extractor (ncsa.image.sphog.debod)ExtractorExtract handwritten decimal values from an image.
siegfriedPython     
https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
DEBOD/repos/extractors-
siegfried/browseGregory Jansenncsa.versus.imageJavaVersusLinux  
handwrittendecimals
Killed Photos (ncsa.image.iarp_remove_circle)ExtractorIdentify depression era photos "killed" by Farm Security Administration director Roy Stryker, indicated by a hole punch in the image.https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
IARP/repos/
extractors-versus/browseKenton, Smrutincsa.image.preview (note: check if really deployed. there is an extractor in Hosted VMs list with a similar name.)Python     
image_fetcher/browse/extractors/remove_circleMarcus Slavenas
Mean Grey (ncsa.cv.meangrey)ExtractorMean grey values of black and white photos.https://opensource.ncsa.illinois.edu/bitbucket/projects/IARP/repos/image_fetcher/browse/extractors/mean_greyMarcus Slavenas
Movie Slice (ncsa.movieslice)Extractor https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
core
movieslice/browse
/image/preview
Rob,
ncsa.pdf.preview (note: check if really deployed. there is an extractor in Hosted VMs list with a similar name.)Python     
Person Detector (person-detector)ExtractorExtract locations of people in an image.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
core/browse/pdf/previewRobncsa.video.preview (note: check if really deployed. there is an extractor in Hosted VMs list with a similar name.)Python     
person-detector
Person Tracker (ncsa.person-tracker)ExtractorExtract locations and paths of people moving in videos.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
core/browse/video/previewRob
person-tracking
Video Analytics Toolbox (ncsa.cinemetrics_batch)ExtractorExtract shot descriptors from videos.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-lsva-integrated/browse
Biology, Genomics, MedicineFSL (mri2mesh)Extractor https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-mri/browse/mri2meshMarcus Slavenas
Glomeruli (ncsa.msc.diagnosis)ExtractorExtract glomeruli from Kidney biopsy images.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-msc/browse/Kidney

General

              
Domain

Tool

TypeDescription                                     RepositoriesContact

 

 

 

 

 

 

 

 

 

 

 

 

 

NOT DEPLOYED

 

 

 

 

 

 

 

 

 

ncsa.image.digitpyPython

 

 

 

opencv

 

  

General

Calibre (ebook-converter)ConverterConvert e-books to a number of document formats.
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-ebook-
cv
convert/browse
/handwritten/SimpleDigitPython
 

CMU Sphinx (ncsa.

cv

audio.

pdfimages

speech2text)

 pdfimages, from poppler-utils   
ExtractorAudio recognition to extract text for speech within audio.
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
cv
core/browse/audio/
poppler
speech2text
 ncsa.cv.caltech101PythonMatlab and VLFeat 64-bit Mac OS    
Daffodil (daffodil)ConverterConvert formats with a provided DFDL schema to XML.https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
cv
daffodil/browse
/vlfeat 
Kenton McHenry
DBPedia (ncsa.dbpedia)ExtractorFind and define named entities within the given text.
dbpediaPython Natural Language Toolkit (NLTK) and rdflib.   Luigi Marini
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-dbpedia/browse
digestPython     
FFmpeg (ffmpeg)ConverterConvert between a large number of video formats.https:/
https:/
/opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
digest
ffmpeg/browse
 ncsa.hpcPython     
FLAC (flac)ConverterConvert to and from the FLAC format from other audio formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
hpcLSVAJava     
flac/browse
Ghostscript (ghostscript)ConverterConvert between document formats.
https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
lsva
ghostscript/browse
Liana, ConstantinosLSVA integrated     
htmldoc (htmldoc)ConverterConvert HTML to a number of document formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
lsva-integrated
htmldoc/browse
Kenton McHenry
ImageMagick (ImageMagick)ConverterConvert between a large number of image formats.
ncsa.movieslicePython     
https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
movieslice
imagemagick/browse
Sandeepmri2meshPythonpymedici, subprocess, logging, os, numpy, shutil, zipfile    
Kenton McHenry
ImageMagick (ncsa.image.metadata)ExtractorPull available EXIF image metadata from a given image.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
mri
core/browse/
mri2meshMarcusmsc-ChemCBCExtractorPythonrequests, pika, openpyxl, xlrd, pymongoLinux  Yan Zhao
image/metadata
Kabeja (kabeja)ConverterConvert between a handful of 3D and image formats.https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
msc
kabeja/browse
/ChemCBCExtractor
Yan
 
msc-IsletExtractorPythonrequests, pika, openpyxl, xlrd, pymongoLinux  

OpenCV - Faces (ncsa.cv.faces)

ExtractorFind faces in an image and return their locations.https://
https://
opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
msc
cv/browse/
IsletExtractorYan
extractors-opencv/extractors-opencv-facesKenton McHenry

OpenCV - Eyes (ncsa.cv.eyes)

ExtractorFind eyes in an image and return their locations.https:
msc-MonitorExtractorPythonrequests, pika, openpyxl, xlrd, pymongoLinux  https:
//opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
msc
cv/browse/
MonitorExtractorYan
extractors-opencv/extractors-opencv-eyesKenton McHenry

OpenCV - Closeups (ncsa.cv.closeups)

ExtractorDetermine whether an image is a closeup of a person or not.https:
ncsa.msc.dailymonitorPythonrequests, pika, openpyxl, xlrd, pymongo   not usedhttps:
//opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
msc
cv/browse
/OldMonitorExtractorAshwinimsc-PhenotypeExtractorPython

requests, pika, openpyxl, xlrd, pymongo

Linux  
/extractors-opencv/extractors-opencv-closeupsKenton McHenry

OpenCV - Profiles (ncsa.cv.profiles)

ExtractorFind human face profiles in an image and return their locations.https
https
://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
msc
cv/browse
/PhenotypeExtractorYan
/extractors-opencv/extractors-opencv-profilesKenton McHenry
Langid (ncsa.nlp.
SNLP
simplelanguage)
Java Stanford CoreNLP tool, java, maven    
ExtractorIdentify the language of the given text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/
SNLP/SNLPExtractorLiana
SimpleLanguage
LibreOffice (unoconv)ConverterConvert to and from a variety of document formats.
ncsa.nlp.tikaPython Tika project page, pymedici   Kenton McHenry
https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
nlp
unoconv/browse
/tika
Liana
Bing Zhang
person-detectorPython MATLAB, FFMPEG, requests and pika    
NLTK - Summary (ncsa.nlp.simplesummary)ExtractorSummarize a body of text.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
person-detector
nlp/browse/
python
SimpleSummary
Sandeepncsa.person-trackerPythonpython, MATLAB, FFMPEG requests and pika    
Siegfried (siegfried)ExtractorExtract information about a given file relevant to identifying its type and validating its format.https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
person-tracking
siegfried/browse
/pythonterra.plantcvPython

pika
requests
wheel

    
Sandeep
Stanford CoreNLP (ncsa.nlp.SNLP)ExtractorNatural Language Process extractions such as parts of speech, named entities, langauge, etc.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
plantcv
nlp/browse
Yanmedici_PTM_thumbnailsJava
/SNLP/SNLPExtractor 
   
Tesseract (ncsa.image.ocr)ExtractorObject Character Recognition (OCR) to extract text from images containing text.
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
ptm
cv/browse/
PTMThumbnailExtractorConstantinosmedici_PTM_metadataJava     
extractors-tesseractKenton McHenry
Tika (ncsa.nlp.tika)ExtractorDocument extractions such as language identification, ...
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
ptm
nlp/browse/
PTMMetadataExtractor
tika
Constantinos

Name not clear

PtmMetadata(?)

Java     
txt2html (txt2html)ConverterConvert text documents to HTML.https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
ptm
txt2html/browse
/PTMMetadatamedici_ptm_mapsJava     
Constantinos
Kenton McHenry
Versus - Color Distribution (ncsa.versus.image)ExtractorGenerate a distribution of color values within an image to be used for comparing how similar two images are.
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
ptm
versus/browse
/PTMMapsExtractorConstantinosmedici_ptm_3dJava     

VLFeat (ncsa.image.caltech101)

ExtractorClassify images as to whether they contain objects from the Caltech101 dataset (e.g. people, airplanes, motorcycles, cougars, ...).
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
ptm
cv/browse/
PTM3DExtractorConstantinosmedici_images_ptmJava     
extractors-vlfeatKenton McHenry
Zip (zip)ConverterUnzip zip archives.
https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
POL/repos/
extractors
converters-
ptm
zip/browse
/ImagesPTMExtractor
Constantinos

extractors-rabbitmq

(look like examples)

      

https://opensource.ncsa.illinois.edu/

...

confluence/

...

...

...

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-books/browse/SheBookPreviewExtractor/src/BookPreviewExtractor

 

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-books/browse/SheBookPreviewExtractor/src/bookpreviewextractor

...

display/

...

BD/Transformations (Under development, Being Refactored)