You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 38 Next »

This document is under construction. It will describe the details about integrating already generated metadata with Clowder.

Background

At the time of writing this document, there are about 171,000 images from the Library of Congress Farm Security Administration/Office of War Information Photograph Collection that were processed for extracting various features like faces, eyes, facial profile, closeups, printed text, presence of Stryker hole, presence of border, mean and standard deviation of grayscale values, subject details, photographer details, and category details. These were computed on XSEDE Comet by using stripped down version of Clowder Extractors or in certain cases, by creating new standalone programs. Integrating this information with Clowder is important to use its features like RESTful API, authentication and authorization, available visualizations, etc.

Database Table Descriptions and Converted Sample JSON Documents

The following set of tables contain description about the database tables that contain extracted metadata. Following each database table description is an example JSON document that will be generated by the Extractor Integration Script for that particular table.

CategoryInfo

Sl. No.Database Column NameField DescriptionRemarks
1idLOC IndexString;
2categoryLOC Category number (other_number field in the image JSON document)String;
CategoryInfo JSON Document
{
	"id": "fsa1997018591",
	"category": "F 665"
}

CreatorInfo

NOTE: some creators are empty strings, so it might need some refinement. 

Sl. No.Database Column NameField DescriptionRemarks
1idLOC IndexString;
2nameCreator name in the format: <last name>, <first name>, <birth year> - <death year>. If the creator name is blank the value is NULL.String;
3year_monYear and month (abbreviated in certain cases) in which the photograph was taken in the format: <year> - <month | month1 - month2 | season >String; Some year - month values are like '[between 1940 and 1946]'; The format mentioned in the left cell may not be strictly followed. Need to look into this in detail when doing the transformation.

FacesInfo

Sl. No.Database Column NameField DescriptionRemarks
1idLOC IndexString;
2imghtImage heightFloat;
3imgwidImage widthFloat;
4dumb1

The letter F, it's only there to help browse raw data

String;
5num_facesNumber of faces foundInteger;
6face_segsBounding box location of faces

String; this is a text string that has the ith face, x, y, width, height of face segment. Each face segment is separated by a semicolon.

7dumb2The letter P, it's only there to help browse raw dataString;
8num_profilesNumber of profiles foundInteger;
9prof_segsBounding box location of profilesString;
10dumb3The letter Y, it's only there to help browse raw dataString;
11num_eyesNumber of eyes foundInteger;
12eye_segsBounding box location of eyesString;
13dumb4The letter C , it's only there to help browse raw dataString;
14num_fullclsNumber of face full closeupsInteger; 'FULL' is relative to image size
15num_midclsNumber of face mid closeupsInteger; 'MID' is relative to image size
16num_fullprofNumber of profile full closeupsInteger; 'FULL' is relative to image size
17num_midprofNumber of profile mid closeupsInteger; 'MID' is relative to image size

ImageProperties

Sl. No.Database Column NameField DescriptionRemarks
1idLOC IndexString;
2holePresence of Stryker holeBoolean;
3borderPresence of borderBoolean;
4meangrayMean of grayscale values (not including hole and border)Float;
5stdgrayStandard deviation of grayscale values (not including hole and border)Float;

ImageFilesList

Sl. No.Database Column NameField DescriptionRemarks
1fileidFile ID (Serial number)Integer;
2idLOC IndexString;
3cometfnFilename in CometString;
4locurlURL of the photograph in LOC websiteString;

OCRInfo

Sl. No.Database Column NameField DescriptionRemarks
1idLOC IndexString;
2ocr_pred

Overall prediction of whether or not text is present in image. 'nop' means OCR found nothing. Where if any one box predicted text then the final prediction is set to T.

String; What is the difference between nop and F?
3scoresPrediction scores. A string that consists of sets of 3 numbers (separated by semicolon) where, for each OCR text box found, a 0/1 classification value indicating no-text/text predicted, 2 floats indicating classification score for no-text/textString;
4box_sumA count of number of 1's found across text box score sets Integer;
5box_cnt

Number of text boxes. Note that box_sum / box_cnt is another possible score instead of the T/F above.

Integer; What is the difference between box_sum and box_cnt?

6box_txt

Set of strings separated by semicolon. One string for each text box found in OCR process

String;
7box_locs

A string that consists of sets of 4 numbers (separated by semicolon) one set for each text box, where the numbers are upper left x coordinate, upper left y coordinate, box width, box height.

String;

SubjectInfo

Sl. No.Database Column NameField DescriptionRemarks
1idLOC IndexString;
2subjectSubject informationString;

 

 

 

  • No labels