This article will describe the details about integrating already generated metadata with Clowder.

Background

At the time of writing this document, there are about 171,000 images that were processed for extracting various features like faces, eyes, facial profile, close ups, printed text, presence of Stryker hole, presence of border, mean and standard deviation of grayscale values, subject details, photographer details, and category details. These are being done on Comet by using stripped down version of Clowder Extractors or in certain cases, by creating new standalone programs. Integrating this information with Clowder is important to use its features like RESTful API, authentication and authorization, available visualizations, etc.

PyClowder2

Write a short note about PyClowder2 - the latest version of Python library for writing Clowder extractors

Data Tables

The following set of data tables contain extracted metadata. Description of those tables are provided to understand the contents of the table

CategoryInfo

Sl. No.	Database Column Name	Field Description	Remarks
1	id	LOC Index	String;
2	category	LOC Category number (other_number field in the image JSON document)	String;

CreatorInfo

NOTE: some creators are empty strings, so it might take some refinement.

Sl. No.	Database Column Name	Field Description	Remarks
1	id	LOC Index	String;
2	name	Creator name in the format: <last name>, <first name>, <birth year> - <death year>. If the creator name is blank the value is NULL.	String;
3	year_mon	Year and month (abbreviated in certain cases) in which the photograph was taken in the format: <year> - <month \| month1 - month2 \| season >	String; Some year - month values are like '[between 1940 and 1946]'

FacesInfo

Sl. No.	Database Column Name	Field Description	Remarks
1	id	LOC Index	String;
2	imght	Image height	Float;
3	imgwid	Image width	Float;
4	dumb1	The letter F, it's only there to help browse raw data	String;
5	num_faces	Number of faces found	Integer;
6	face_segs	Bounding box location of faces	String; this is a text string that has the i^thface, x, y, width, height of face segment -- each face segment is separated by a semi colon.
7	dumb2	The letter P, it's only there to help browse raw data	String;
8	num_profiles	Number of profiles found	Integer;
9	prof_segs	Bounding box location of profiles	String;
10	dumb3	The letter Y, it's only there to help browse raw data	String;
11	num_eyes	Number of eyes found	Integer;
12	eye_segs	Bounding box location of eyes	String;
13	dumb4	The letter C , it's only there to help browse raw data	String;
14	num_fullcls	Number of face full closeups	Integer; 'FULL' is relative to image size
15	num_midcls	Number of face mid closeups	Integer; 'MID' is relative to image size
16	num_fullprof	Number of profile full closeups	Integer; 'FULL' is relative to image size
17	num_midprof	Number of profile mid closeups	Integer; 'MID' is relative to image size

ImageProperties

Sl. No.	Database Column Name	Field Description	Remarks
1	id	LOC Index	String;
2	hole	Presence of Stryker hole	Boolean;
3	border	Presence of border	Boolean;
4	meangray	Mean of grayscale values (not including hole and border)	Float;
5	stdgray	Standard deviation of grayscale values (not including hole and border)	Float;

ImageFilesList

Sl. No.	Database Column Name	Field Description	Remarks
1	fileid	File ID (Serial number)	Integer;
2	id	LOC Index	String;
3	cometfn	Filename in Comet	String;
4	locurl	URL of the photograph in LOC website	String;

Space shortcuts

Page tree

Background

PyClowder2

Data Tables

CategoryInfo

CreatorInfo

FacesInfo

ImageProperties

ImageFilesList

OCRInfo

SubjectInfo

Space shortcuts

Page tree

Integrating Generated Metadata with Clowder

Background

PyClowder2

Data Tables

CategoryInfo

CreatorInfo

FacesInfo

ImageProperties

ImageFilesList

OCRInfo

SubjectInfo