Uploaded image for project: 'Kurator'
  1. Kurator
  2. KURATOR-186

Quality Control Taxon Authority Table

XMLWordPrintableJSON

      A database manager of a biodiversity data set wishes to perform quality control on a taxonomic authority file used by that data set against one or more external authorities, identifying records which cannot be linked to external authoriatitive records, and linking database records with guids from authorities when matches can be made.

      The database manager exports data from their taxonomic authority table, selects a specification for quality control of scientific names and authorships, selects one or more authorities to check the data against, selects an output format, uploads the data, and executes the workflow. A quality control report is provided. The database manager may filter this report to obtain lists of problematic records by various (value, provenance and assertion) criteria to produce reports that can be given to relevant domain experts as data quality control projects (trackable units of work) to enhance the authority file. The database manager may filter and manipulate this report to obtain lists of enhancements (in particular guids of matches) that can be fired as sql queries against the taxonomic authority table.

      Preconditions: CSV export of data including surrogate numeric primary key values from taxon name table, scientific name strings, and scientific name authorship strings, optionally with higher taxonomy values.

      Postconditions: CSV file containging initial input columns, assertions of the nature of matches and missmatches against authorities, names and authorships found in authorities, guids found in authorities, and provenance concerning the guids and enhancement assertions. SQL file containing update statements to update a guid field in a taxon table given the surrogate numberic primary key value.

      Alternative paths: Compare higher taxonomy with source authority, and report on discrepancies. Fill in missing elements in higher taxonomy according to the source authority where they are missing in the input. Postcondition of sql statements to update scientific names and authorships and guids in taxon name table.

              Unassigned Unassigned
              mole Paul J. Morris
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: