Clowder
  1. Clowder

extractors-core

Public
AuthorCommitMessageCommit dateIssues
Rob KooperRob Kooper
7044feb75afUpdate CHANGELOG.md
Sandeep Puthanveetil SatheesanSandeep Puthanveetil Satheesan
a5629642990Updated CHANGELOG.
Sandeep Puthanveetil SatheesanSandeep Puthanveetil Satheesan
5b66956509fMMerge branch 'develop' into release/v0.4.0
Sandeep Puthanveetil SatheesanSandeep Puthanveetil Satheesan
a225094afa3Added CHANGELOG
Rob KooperRob Kooper
e40aadde6ecuse larger chunk sizethis will speed up reading file, default is to read 1 byte a time. Example file of 90KB took 1.2 seconds for 1 byte and 0.003 seconds when using 10KB chunks.
Rob KooperRob Kooper
f1892ea2d91fix spelling of kooper
Rob KooperRob Kooper
a35f1e44593fix shell issue
Rob KooperRob Kooper
9ac72a8935aupdate core to use pyclowder2- update all packages to use pyclowder2, bumped version number to 2.0.0 - most extractors use the binary-preview-extractor - split docker.sh into release.sh and docker.sh. docker.sh will build docker containers, release.sh will do the actual push to docker hub.
Jim MyersJim Myers
f6a27f9ebebBugfix: all digests were being calculated incorrectly...due to an extra '/' in the retrieval URL (which retrieved a 'you must be logged in' page instead of the file contents). This affects all digests calculated since the bug was introduced (with updates to pyclowder2 or a change to how clowder responds to the extra / ?). Note that the extra / was also being sent in the metadata which shows in the GUI as a bad link (goes to the internal error page w...
Sandeep Puthanveetil SatheesanSandeep Puthanveetil Satheesan
6ce55ecd712Updated git URL and added docker image name in speech2text extractor info
Sandeep Puthanveetil SatheesanSandeep Puthanveetil Satheesan
762e0259e92MResolved merge conflicts.
Sandeep Puthanveetil SatheesanSandeep Puthanveetil Satheesan
45b84c923bcRemoved unwanted file
Sandeep Puthanveetil SatheesanSandeep Puthanveetil Satheesan
5f48725c31fUpdate base docker image in Dockerfile
Sandeep Puthanveetil SatheesanSandeep Puthanveetil Satheesan
c20e97704acRemoved carriage return control character
Marcus SlavenasRob KooperMarcus Slavenas
2092dcb9fb7change test json format, added extractor info, changed context in jsonld Added correct test json and jsonld Changed context url, changed clowder to clowder/clowder:1.1.1, added key to url, changed test output metadata added tests Add portions of config to defaults in code, shrink the conversion to act on any audio file (testing needed) Issue bd-1117, cleaned up all temp files a little code cleanup Added jsonld from BD-1111, added get rabb...BD-1111
Yan ZhaoYan Zhao
b144c1830c3remove filename and space
Yan ZhaoYan Zhao
602fe2f6376update test
Yan ZhaoYan Zhao
ed989081c75update test
inna zharnitskyinna zharnitsky
e2451cfa1a1Renamed the output json file
inna zharnitskyinna zharnitsky
70ad4984325Renamed output file, it contains the full json output of the extraction, for reference purposes
inna zharnitskyinna zharnitsky
19c1e98c9ccAdded test folder and test input/output files
Rob KooperRob Kooper
c977c182cb2use clowder organization
Rob KooperRob Kooper
4190a84a9d2use from clowder/pyclowder
Rob KooperRob Kooper
d47685b2617docker now only pushes to clowder
Sandeep Puthanveetil SatheesanSandeep Puthanveetil Satheesan
923cbdc5ac7Updated extractor_info.json to include docker image names
Sandeep Puthanveetil SatheesanSandeep Puthanveetil Satheesan
8d0ff414467Updated extractor_info.json to include docker image names
Max BurnetteMax Burnette
aeba85a7b67Simplify, return None if no id is found
Max BurnetteMax Burnette
8bda27b0564Remove deprecated reference to parent_dataset_id
Rob KooperRob Kooper
4555bcf7048copy extractor_info.json
Max BurnetteMax Burnette
0ca96c16142Fix docker image name for pyclowder 2
Max BurnetteMax Burnette
97ffe80e161pull from extractor base version 2This should include pyClowder2.
Max BurnetteMax Burnette
a0052cb1278Convert digest to require pyclowder2 instead of 1
Rob KooperRob Kooper
ac0a9a88662fix clowder url
Rob KooperRob Kooper
37ed2168752add clowder url
Rob KooperRob Kooper
d333795766e@context should be array
Rob KooperRob Kooper
611d95a2121more video debugging
Rob KooperRob Kooper
2782cc5024bsend error back to clowder
Rob KooperRob Kooper
8d7c8de07d2scale does not work
Rob KooperRob Kooper
d3ee0ce78dbmissing registration endpoints
Rob KooperRob Kooper
47c2ec94787don't install logstash
Rob KooperRob Kooper
1a4be8617a1MMerge pull request #8 in CATS/extractors-core from extractor-info to master* commit '2d007a0f6bb4b2ac4449c663c33cef844f011357': register extractor add extractor_info.json and registration
Rob KooperRob Kooper
8ab833de498use dash not bash
Rob KooperRob Kooper
54fadbf5a0dcreate digest container
Max BurnetteMax Burnette
65bff0ce0faAdded extractor_info and requirements file
Max BurnetteMax Burnette
4be08398c68MMerge branch 'master' into feature/CATS-565-move-sha-512-calculation-to-extractorCATS-565
Rob KooperRob Kooper
44fb5b362b3MMerge pull request #3 in CATS/extractors-core from fix-jsonld-in-image-metadata-extractor to master* commit 'f0ccc1dc947e679045115e8262f73ce9e219889e': Give image md extractor proper machine-readable context update @context of metadata to conform to jsonld standard
Rob KooperRob Kooper
b880c01f066missing mimetype for excel
Rob KooperRob Kooper
2d007a0f6bbregister extractor
Rob KooperRob Kooper
736e0378fe2MMerge remote-tracking branch 'origin/master' into extractor-info
Rob KooperRob Kooper
8495f2af803MMerge pull request #7 in CATS/extractors-core from feature/CATS-558-office-file-format-extractor-to to master* commit '4018694fc46b812f331ba00f4a150ab57f204775': fix registration office extractorCATS-558