Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Docker containers
  2. JSONLD
  3. Extractor info registration
  4. Use pyclowder (for python extractors)
  5. Add status messages to all extractors and fix level granularity
    1. Make status constants (DONE, ERROR)
    2. Arcgis multiprocessing extractor
  6. Register on on demand execution queues
    1. Add on demand key binding to configuration file: messageType = "*.file.text.plain", "extractors."+extractorName
  7. Standardize around python logging
    1. Figure out what to log and what format to follow
  8. Add logstash to docker compose
  9. Add sample input/ouput to git repository
  10. Add icon for tools catalog to git repository
  11. Add entry to Tools catalog, with icon

 

ID (Extractor Name from config file,

same as queue name)

Programming

Language

SoftwareOSCan be Dockerized
?Can be upload to Docker Hub
?Assigned To
Link to repoWho wrote or worked on the code
RepoAuthor
DEPLOYED
 
       
ncsa.image.ocrPythonTesseractLinux
 
 Rui
ocr 
https://opensource.ncsa.
cv.facesPython
illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/ocr 

ncsa.cv.faces

PythonOpenCVLinux
 
 Ruihttps://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/opencvLiana

ncsa.cv.eyes

PythonOpenCVLinux 
 
Ruihttps://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/opencvLiana

ncsa.cv.closeups

PythonOpenCVLinux
 
 Ruihttps://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/opencvLiana

ncsa.cv.profiles

PythonOpenCVLinux
 
 Ruihttps://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/opencvLiana

ncsa.cellprofiler.fluorescentcomet

Pythonpymedici (question)WindowsNo
 
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/cellprofilerLiana

ncsa.cellprofiler.fly

Python WindowsNo 
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/cellprofilerLiana

ncsa.cellprofiler.human

Python WindowsNo 
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/cellprofilerLiana

ncsa.cellprofiler.silvercomet

Python WindowsNo 
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/cellprofilerLiana

ncsa.cellprofiler.speckle

Python WindowsNo
 
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/cellprofilerLiana

ncsa.cellprofiler.trackobject

Python WindowsNo 
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/cellprofilerLiana

ncsa.cellprofiler.tumor

Python WindowsNo
 
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/cellprofilerLiana

ncsa.cellprofiler.yeast

Python WindowsNo
 
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/cellprofilerLiana

ncsa.image.sphog

Python Matlab, mnist-sphog Linux
 
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/handwritten/HandwrittenNumbers 

ncsa.image.caltech101

    
 
  
ncsa.bisque.histogram (notes: disabled)Python Linux
 
    
ncsa.bisque.metadata (notes: disabled)Python Linux
 
    
census-section-segmentorJava Linux
 
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/censusLiana, Inna
ncsa.cv.river PythonOpenCV (python), convert (from imagemagick), and GdalLinux
 
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/riverLiana
ncsa.geo.shpExtractorPythongdalLinux
 
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-geo/browseJong Lee
ncsa.geo.tiffExtractorPythongdalLinux
 
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-geo/browseJong Lee
ncsa.image.geotiffPython

GDAL, Cython, numpy,
pygeoprocessing

Linux
 Marcus
 Ruihttps://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-geotiff/browseRui, Mostafa Elag

ncsa.image.ponddetect

PythonMatlabLinux 
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-maps/browse/feature_detectionMarcus, Ankit
ncsa.image.humanprefPythonMatlabLinux 
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-maps/browse/humanprefMarcus, Ankit

ncsa.xml.greenindexroute, ncsa.csv.greenindexroute

PythonOpenCVLinux 
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-maps/browse/greenrouteMarcus

ncsa.image.knn_numerals

PythonOpenCVLinux
 
  Marcus

ncsa.audio.speech2text

JavaCMU Sphinx, ffmpeg, soxLinux 
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/audio/speech2textMarcus
ncsa.audio.previewPython
 
   Innahttps://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/audio/preview 
ncsa.nlp.simplelanguagePythonnumpy
 
  Innahttps://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleLanguageLiana
ncsa.nlp.simplesummaryPython

Natural Language Toolkit (NLTK) for Python, NLTK Data or at least:

 nltk.corpus,nltk.stem.porter and nltk.tokenize.punkt.

 

  https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SimpleSummaryLiana
ncsa.nlp.SNLPSentimentJava Stanford CoreNLP tool, java, maven
 
  https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/SNLP/SNLPSentimentExtractorLiana, Marcus(?)
ncsa.nlp.wordtablesPython requestspikawin32com   
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-nlp/browse/WordTablesExtractorLiana
siegfriedPython    
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-siegfried/browseGregory Jansen
ncsa.versus.imageJavaVersusLinux
 
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-versus/browseKenton, Smruti
ncsa.image.preview (note: check if really deployed. there is an extractor in Hosted VMs list with a similar name.)Python    
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/image/previewRob, Sandeep
ncsa.pdf.preview (note: check if really deployed. there is an extractor in Hosted VMs list with a similar name.)Python
 
    https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/pdf/previewRob
ncsa.video.preview (note: check if really deployed. there is an extractor in Hosted VMs list with a similar name.)Python
 
    https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-core/browse/video/previewRob
   
NOT DEPLOYED
ncsa.image.digitpyPythonopencv
 
   
  
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/handwritten/SimpleDigitPython 
ncsa.cv.pdfimages pdfimages, from poppler-utils   https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/poppler 
ncsa.cv.caltech101PythonMatlab and VLFeat 64-bit Mac OS   https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-cv/browse/vlfeat 
dbpediaPython Natural Language Toolkit (NLTK) and rdflib.
     
  
  
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-dbpedia/browse
digestPython
   
    
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-digest/browse 
NOT DEPLOYED
ncsa.hpcPython
 
    
   
https://opensource.ncsa.
image.digitpy
illinois.edu/bitbucket/projects/CATS/repos/extractors-hpc/browse
LSVAJava
Pythonopencv
    https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
cv/browse/handwritten/SimpleDigitPython ncsa.cv.pdfimages pdfimages, from poppler-utils
lsva/browseLiana, Constantinos
LSVA integrated    https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-lsva-
cv
integrated/browse
/poppler
 
ncsa.
cv.caltech101
movieslicePython
Matlab and VLFeat
 
64-bit Mac OS 
   https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
cv
movieslice/browse
/vlfeat
 
Sandeep
dbpedia
mri2meshPython
 Natural Language Toolkit (NLTK) and rdflib.
pymedici, subprocess, logging, os, numpy, shutil, zipfile   
Luigi Marini
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
dbpedia
mri/browse/mri2mesh
Luigi Marini
Marcus
msc-ChemCBCExtractor
digest
Pythonrequests, 
 
pika, openpyxl, xlrd, pymongoLinux
 
 
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
digest
msc/browse/ChemCBCExtractor
 
Yan
msc-IsletExtractor
ncsa.hpc
Python
 
requests, pika, openpyxl, xlrd, pymongoLinux
 
 
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
hpc
msc/browse/IsletExtractor
Sandeep Puthanveetil Satheesan
Yan
msc-MonitorExtractorPythonrequests, pika, openpyxl, xlrd, pymongoLinux
LSVAJava  
 
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
lsva
msc/browse/MonitorExtractor
Liana, ConstantinosLSVA integrated 
Yan
ncsa.msc.dailymonitorPythonrequests, pika, openpyxl, xlrd, pymongo
 Sandeep Puthanveetil Satheesan
  
 
not usedhttps://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
lsva-integrated
msc/browse/OldMonitorExtractor
Sandeep Puthanveetil Satheesan
Ashwini
msc-PhenotypeExtractor
ncsa.movieslice
Python
 

requests, pika, openpyxl, xlrd, pymongo

Linux
 
 
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
movieslice
msc/browse/PhenotypeExtractor
Sandeepmri2meshPythonpymedici, subprocess, logging, os, numpy, shutil, zipfile
Yan
ncsa.nlp.SNLPJava Stanford CoreNLP tool, java, maven   
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
mri
nlp/browse/SNLP/
mri2mesh
SNLPExtractor
Marcus
Liana
ncsa.nlp.tika
msc-ChemCBCExtractor
Python
requests,Linux
 
pika, openpyxl, xlrd, pymongo
Tika project page, pymedici  
Yan Zhao
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
msc
nlp/browse/
ChemCBCExtractor
tika
Yan
Liana
msc
person-
IsletExtractor
detectorPython MATLAB, FFMPEG, requests
, pika, openpyxl, xlrd, pymongoLinux
and pika
  
 Yan Zhao
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-person-
msc
detector/browse/
IsletExtractor
python
Yan
Sandeep
msc
ncsa.person-
MonitorExtractor
trackerPython
requests, pika, openpyxl, xlrd, pymongoLinux
python, MATLAB, FFMPEG requests and pika  
 Yan Zhao
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-person-
msc
tracking/browse/
MonitorExtractor
python
Yan
Sandeep
ncsa
terra.
msc.dailymonitor
plantcvPython

pika
requests

, pika, openpyxl, xlrd, pymongo


wheel

   
not used
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
msc
plantcv/browse
/OldMonitorExtractor
Ashwini
Yan
medici_PTM_thumbnailsJava
msc-PhenotypeExtractorPython

requests, pika, openpyxl, xlrd, pymongo

Linux
   
Yan Zhao
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
msc
ptm/browse/
PhenotypeExtractor
PTMThumbnailExtractor
Yan
Constantinos
medici_PTM_metadata
ncsa.nlp.SNLP
Java 
Stanford CoreNLP tool, java, maven 
   https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
nlp
ptm/browse/
SNLP/SNLPExtractor
PTMMetadataExtractor
Lianancsa.nlp.tikaPython Tika project page, pymedici
Constantinos

Name not clear

PtmMetadata(?)

Java    https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
nlp
ptm/browse/
tikaLianaperson-detectorPython MATLAB, FFMPEG, requests and pika
PTMMetadataConstantinos
medici_ptm_mapsJava    https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
person-detector
ptm/browse/
python
PTMMapsExtractor
Sandeepncsa.person-trackerPythonpython, MATLAB, FFMPEG requests and pika
Constantinos
medici_ptm_3dJava    https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
person-tracking
ptm/browse/
python
PTM3DExtractor
Sandeep
Constantinos
medici_images_ptmJava
terra.plantcvPythonpika
requests
wheel
    https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
plantcv
ptm/browse/ImagesPTMExtractor
Yan
Constantinos

extractors-rabbitmq

(look like examples)

medici_PTM_thumbnailsJava

     https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
ptm
rabbitmq/browse
/PTMThumbnailExtractor
Constantinos
 
Name not clear extractors-seabird/Scala
medici_PTM_metadataJava
    
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
ptm
seabird/browse
/PTMMetadataExtractor
Constantinos

Name not clear

Luigi
medici_3d_x3d (one of extractors-3d
PtmMetadata(?
)Java
 
    https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
ptm
3d/browse/
PTMMetadata
ObjJSONExtractorConstantinos
medici_
ptm_maps
3d_obj_merger (one of extractors-3d)Java    
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
ptm
3d/browse/
PTMMapsExtractor
OBJMergerExtractorConstantinos
medici_
ptm_3d
oni (one of extractors-3d)Java    
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
ptm
3d/browse/
PTM3DExtractor
OniExtractorConstantinos
medici_
images_ptm
ply_obj (one of extractors-3d)Java    
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
ptm
3d/browse/
ImagesPTMExtractor
PlyObjExtractorConstantinos

extractors-rabbitmq

(look like examples)

 
medici_3d_metadata (one of extractors-3d) Java
 
    https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
rabbitmq
3d/browse/ThreeDMetadataExtractor
 
Constantinos
medici_x3d_html (one of extractors-3d) Java
Name not clear extractors-seabird/Scala
    
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
seabird
3d/browse/X3DhtmlExtractor
Luigimedici_3d_x3d (one of extractors-3d)Java    
Constantinos
ncsa.arcgis.landsat7mosaicPythonArcGISWindowsNo
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
3d
bd-cz/browse/
ObjJSONExtractor
ndviextractor
Constantinosmedici_3d_obj_merger (one of extractors-3d)Java    
Smruti
ncsa.arcgis.floodplainPythonArcGISWindowsNohttps://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
3d
bd-cz/browse
/OBJMergerExtractor
/terex_floodplain/config.pySmruti
Constantinos
medici_
oni (one of extractors-3d)
bookJava
 
    https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
3d
books/browse/
OniExtractor
BookPreviewExtractor
Constantinos
Theerasit Issaranon
medici_
ply_obj (one of extractors-3d)
image_pyramidJava    
 
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
3d
books/browse/
PlyObjExtractorConstantinosmedici_3d_metadata (one of extractors-3d) Java
ImagePreviewPyramidExtractor-shebookTheerasit Issaranon
shebookJava
 
    

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors

-3d/browse/ThreeDMetadataExtractorConstantinosmedici_x3d_html (one of extractors-3d) Java     

-books/browse/SheBookPreviewExtractor/src/BookPreviewExtractor

 

https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-

3d

books/browse/SheBookPreviewExtractor/

X3DhtmlExtractorConstantinosncsa.arcgis.landsat7mosaicPythonArcGISWindowsNo 

src/bookpreviewextractor

Theerasit Issaranon
lsva-ceddJava    
Smruti Padhy
https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
bd-cz
cedd/browse
/ndviextractor
Smruti
Constantinos
ncsa.
arcgis.floodplainWindows
cinemetricsPython
ArcGIS
 
No
  
Smruti Padhy
 https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
bd-cz
cinemetrics/browse
/terex_floodplain/config.pySmruti
Constantinos
ncsa.image.metadataPython
medici_bookJava 
    https://opensource.ncsa.illinois.edu/bitbucket/projects/CATS/repos/extractors-
books
core/browse/image/
BookPreviewExtractorTheerasit Issaranonmedici_image_pyramidJava
metadataMax. Rob
ncsa.debod.segmentor     https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
DEBOD/repos/extractors-
booksTheerasit IssaranonshebookJava
cellsegmentor/browse
/ImagePreviewPyramidExtractor-shebook
 
ncsa.image.dmp
     

https://opensource.ncsa.illinois.edu/bitbucket/projects/

CATS

DEBOD/repos/extractors-

books

debod/browse

/SheBookPreviewExtractor/src/BookPreviewExtractor 

https://opensource.ncsa.illinois.edu/bitbucket/projects/

CATS

DEBOD/repos/extractors-

books

dmp/browse

/SheBookPreviewExtractor/src/bookpreviewextractor

 
ncsa.image.sphog.debod
Theerasit Issaranonlsva-ceddJava
     https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
DEBOD/repos/extractors-
cedd
handwrittendecimals/browse
Constantinos
 

ncsa

.cinemetricsPython

.image.iarp_remove_circle

     https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
IARP/repos/image_fetcher/browse/extractors
-cinemetrics
/
browse
remove_circle
Constantinos
Marcus
ncsa.
image
cv.
metadataPython
meangrey     https://opensource.ncsa.illinois.edu/bitbucket/projects/
CATS
IARP/repos/
extractors-core
image_fetcher/browse/
image
extractors/
metadataMax. Rob                 
mean_greyMarcus