Uploaded image for project: 'BrownDog'
  1. BrownDog
  2. BD-1724

Make Paper Form Classifier Test Collection

XMLWordPrintableJSON

    • Icon: Task Task
    • Resolution: Unresolved
    • Icon: Normal Normal
    • None
    • None
    • Tools
    • None
    • Brown Dog - August Sprint 1, Brown Dog - Aug/Sept Sprint

      There is already a "tickytacky" extractor in dockerhub that supports this workflow by identifying the significant vertical and horizontal lines in an image. A clustering algorithm running after the extractions can group images into like forms. This is a workflow that I'm developing in DRAS-TIC/Clowder combination.

      I've tested clustering locally with image files, but now working to implement a work flow in DRAS-TIC CI-BER cluster. Loading a collection from NARA, running extractor, trying various clustering approaches, both parallel (Spark over DRAS-TIC) and as a separate process (export metrics for the job).

       

      This task is to build a test collection for this workflow, using NARA image collections. Specifically, we will use WWII records collections related to the restitution of seized artworks.

              gregjansen Gregory Jansen
              gregjansen Gregory Jansen
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated:

                  Estimated:
                  Original Estimate - 1 day
                  1d
                  Remaining:
                  Remaining Estimate - 1 day
                  1d
                  Logged:
                  Time Spent - Not Specified
                  Not Specified