Page History

The primary driver of this development is the BrownDog project.

Current sprint is JSON-LD-2.

As of June 8th the following need to be implemented

...

When adding metadata check if context already exists and create new if it doesnt
Ability to edit context
Ability edit metadata
Create GUI to manipulate this new metadata
Store extractor metadata (on top of context metadata)
store external services and libraries used
Create example extractor using this new representation
Add heartbeat for extractors that sends over extractor metadata and updates the heartbeat

Jira
server OpenSource JIRA
columns key,summary,assignee,status,resolution
maximumIssues 20
jqlQuery Sprint = 33
serverId b14d4ad9-eb00-3a94-88ac-a843fb6fa1ca

Example json-ld document:

https://opensource.ncsa.illinois.edu/stash/projects/CATS/repos/clowder/browse/doc/src/json/metadata.js?at=refs%2Fheads%2Ffeature%2Fjsonld

Available API endpoints:


POST	/api/contexts	@api.ContextLD.addContext()
GET	/api/contexts/:id	@api.ContextLD.getContextById(id: UUID)
GET	/api/contexts/:name/context.json	@api.ContextLD.getContextByName(name: String)
DELETE	/api/contexts/:id	@api.ContextLD.removeById(id: UUID)
POST	/api/datasets/:id/metadataJsonLD	@api.Datasets.addMetadataJsonLD(id: UUID)
GET	/api/datasets/:id/metadataJsonLD	@api.Datasets.getMetadataJsonLD(id: UUID)
POST	/api/files/:id/metadataJsonLD	@api.Files.addMetadataJsonLD(id: UUID)
GET	/api/files/:id/metadataJsonLD	@api.Files.getMetadataJsonLD(id: UUID)

Comments:

Extractor context definitions are similar to user specified metadata, they both provide what kind of metadata will be added
1. How much flexibility do extractors get when adding metadata?

Open questions:

Should ContextLD and Metadata services be combined into one?

...

Extractor info tentative model:

Code Block

case class ExtractorInfo (
  id: UUID,
  name: String,
  description: String,
  creator: User,
  version: String,
  lastSeen: Date,
  contexts: List[UUID],
  external_services: List[URL],
  libraries: List[String],
  bibtex: List[String]
)

Example body of post from extractor when registering itself. This is a POST to http://localhost:9000/api/extractors. You can list extractors with GET http://localhost:9000/api/extractors and get info about a specific one with http://localhost:9000/api/extractors/559c39557d840f25a725e4be. If the entry already exists it will be updated.

Code Block

language	js

{
  "name": "ncsa.dbpedia",
  "version": "0.1",
  "description": "Simple JSON-LD extractor to extract information from a text file using named-entity recognition and dbpedia.",
  "author": "Luigi Marini <lmarini@illinois.edu",
  "contributors": [],
  "contexts": [],
  "repository": {
    "repType": "git",
    "repUrl": "https://opensource.ncsa.illinois.edu/stash/scm/cats/extractors-dbpedia.git"
  },
  "external_services": [
    "http://live.dbpedia.org/sparql"
  ],
  "libraries": [
    "nltk"
  ],
  "bibtex": [
    "book{BirdKleinLoper09, author = {Steven Bird and Ewan Klein and Edward Loper}, title = {{Natural Language Processing with Python}}, publisher = {O'Reilly Media}, year = 2009}"
  ]
}

Research

Implementing Transitive Credit with JSON-LD
- http://arxiv.org/abs/1407.5117
  - https://www.researchgate.net/publication/279968805_Report_on_the_Second_Workshop_on_Sustainable_Software_for_Science_Practice_and_Experiences_%28WSSSPE2%29

Space shortcuts

Page tree

Versions Compared

Old Version 2

New Version Current

Key