DPLA Aggregation Overview Gretchen Gueguen, Data Services Coordinator gretchen@dp.la
1. Synchronization of metadata 2. Links back to content in context
DPLA s Harvest and Synchronization 1. Initial Metadata Harvest 2. Unique identifier assigned 3. Periodic re-harvest of entire feed 4. Synchronization with existing records Relies on: 1. A single feed of records 2. A single shared metadata schema...all of which is usually the result of good aggregation and normalization practices at the Hub The Harvest. Shared via Yale Center for British Art / ARTstor https://dp.la/item/35ecbd0d9dfa7c99323a648f2f6bcfb7
Preferred Harvest Methods OAI PMH ResourceSync File Transfer API The Harvest. Shared via the Minnesota Historical Society / Minnesota Digital Library https://dp.la/item/a744ac5dc594c625ca6079b45be2c8fa
Metadata Requirements
CONSISTENCY!!!
dpla:sourceresource DPLA Label Equivalent Element Requirement Alternative title dcterms:alternative Optional Collection dcterms:ispartof Recommended Contributor dcterms:contributor Optional Creator dcterms:creator Recommended Date dc:date Recommended Description dcterms:description Recommended Extent dcterms:extent Optional Format dc:format Recommended Genre edm:hastype Optional Identifier dcterms:identifier Optional Language dcterms:language Recommended
dpla:sourceresource DPLA Label Equivalent Element Requirement Place dcterms:spatial Recommended Publisher dc:publisher Recommended Relation dc:relation Optional Replaced by dcterms:isreplaced By Optional Replaces dcterms:replaces Optional Rights dc:rights Required Rights Holder dcterms:rightsholder Optional Subject dcterms:subject Optional Temporal Coverage dcterms:temporal Optional Title dcterms:title Required Type dcterms:type Recommended
ore:aggregation DPLA Label Equivalent Element Requirement Aggregated SR edm:aggregatedcho Required Data Provider edm:dataprovider Required Digital Resource Original Record dpla:originalrecord Required Has View edm:hasview Optional Intermediate Provider dpla:intermediateprovider Optional Is Shown at edm:isshownat Required Object edm:object Optional Preview edm:preview Required Provider edm:provider Required Standardized Rights Statement edm:rights Required
Name of the Contributing Institution Data Provider edm:dataprovider Should be in distinct/findable metadata field Helps to do some normalization
isshownat edm:isshownat URL back to object in originating repository (landing page) Should be in distinct/findable metadata field
Does not have to be unique, but that helps Title dcterms:title Would prefer it to be descriptive rather than an identifier Would prefer to not have brackets or ending periods
Not required for text, video, audio Thumbnail edm:preview Must be a URL to an image, not a landing page If no thumbnail is available, do not supply a generic icon Will be displayed on the front end at 150px on longest side*
Rights edm:rights dc:rights Either a <dc:rights> (uncontrolled, free text statement) OR a <edm:rights> (URI from rightsstatements.org or creativecommons.org is required Both may be present, but must not repeat or contradict each other
DPLA s Initial Ingest Process
1: Hub works on their data
2: I will do an initial gap analysis and give you feedback
3. Do a test harvest and map into our QA environment
4. First production ingest
5. Re-harvests on schedule
Helpful Documentation Introduction to the DPLA MAP http://dp.la/info/wp-content/uploads/2015/03/intro_to_dpla_metadata_model.pdf DPLA Metadata Application Profile http://dp.la/info/wp-content/uploads/2015/03/mapv4.pdf DPLA MAP 4.0 Metadata Crosswalk http://bit.ly/dpla-map4-crosswalk (caveat: this is a record of current mappings, not necessarily recommended mappings) DPLA Metadata Quality Guidelines http://bit.ly/dpla-metadata-qual