DOC PREVIEW
ODU CS 791 - Lecture Notes

This preview shows page 1-2-24-25 out of 25 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 25 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 25 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 25 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 25 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 25 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

The Multi-Faceted Use of the OAI-PMH in the LANL RepositoryChallenges faced by Digital LibrariesIntroductionProperties of LANL LibrarySlide 5Components in LANL Repository ArchitectureIngestion into the LANL RepositoryContd.Prototype Ingestion ProcessCharacteristics of OAI-PMH repositoriesXML tapes for storing DIDSXML tapes never updatedRepository IndexIdentifier ResolverExampleOAI-PMH FederatorFunctions of OAI-PMH federatorCharacteristics of the OAI-PMH federatorFederator ResponseMPEG-21 DIP EngineOpenURL gatewaySlide 22ConclusionReferencesThank youThe Multi-Faceted Use of the The Multi-Faceted Use of the OAI-PMH in the LANL OAI-PMH in the LANL RepositoryRepositoryWritten By: Written By: Henry , Xiaoming ,Patrick Henry , Xiaoming ,Patrick and Herbert.and Herbert.Presented By:Presented By:Shashi Kanth MyadamShashi Kanth MyadamChallenges faced by Digital Challenges faced by Digital LibrariesLibrariesRepresentation of Complex objects.Representation of Complex objects.Making the vast heterogeneous Complex Making the vast heterogeneous Complex objects accessible to downstream objects accessible to downstream applications.applications.Ingesting and storing the vast assets.Ingesting and storing the vast assets.Defining unique Identity to each asset.Defining unique Identity to each asset.Defining relationships between assets by Defining relationships between assets by complex object Models.complex object Models.IntroductionIntroductionLos Alamos National Laboratory (LANL) Los Alamos National Laboratory (LANL) research Library is one which has research Library is one which has resolved the challenges to a great extent.resolved the challenges to a great extent.Main goalsMain goals Hosting , archiving and accessing vast Hosting , archiving and accessing vast heterogeneous assets in a consistent and heterogeneous assets in a consistent and sustainable mannersustainable mannerMaking it accessible by downstream Making it accessible by downstream applicationsapplicationsProperties of LANL LibraryProperties of LANL LibraryUse of MPEG-21 Digital Item Declaration Use of MPEG-21 Digital Item Declaration Language (DIDL) to represent complex Language (DIDL) to represent complex objects.objects.Natively Distributed in nature.Natively Distributed in nature.XML tape to store complex objectsXML tape to store complex objectsMulti-faceted use of the OAI-PMH to Multi-faceted use of the OAI-PMH to access stored content in incremental access stored content in incremental batches.batches.Open URL to access data.Open URL to access data.Components in LANL Repository Components in LANL Repository ArchitectureArchitectureIngestion into the LANL Repository.Ingestion into the LANL Repository.OAI-PMH repositories.OAI-PMH repositories.XML tapes for storing DIDs.XML tapes for storing DIDs.Repository Index.Repository Index.Identifier Resolver.Identifier Resolver.MPEG-21 DIP Engine and table.MPEG-21 DIP Engine and table.OAI-PMH federator.OAI-PMH federator.OpenURL gateway.OpenURL gateway.Ingestion into the LANL RepositoryIngestion into the LANL RepositoryHow to Feed complex digital objects into LANL How to Feed complex digital objects into LANL RepositoryRepositoryIssue :Issue : If you have an article which has:If you have an article which has: Metadata describing the article.Metadata describing the article. Article itself in PDF and ASCI.Article itself in PDF and ASCI.References in XML format.References in XML format.MPEG-21 DIDL provides a standard for storing MPEG-21 DIDL provides a standard for storing such kind of complex digital objects.such kind of complex digital objects.Contd.Contd.Different Kind of ways to feed data into the Different Kind of ways to feed data into the repositoryrepository HTTP,FTPHTTP,FTPOAI-PMH Harvester OAI-PMH Harvester Web CrawlerWeb CrawlerPhysical MediaPhysical MediaPrototype Ingestion ProcessPrototype Ingestion ProcessConverts the asset to XML document Converts the asset to XML document called Digital Item Declaration (DID).called Digital Item Declaration (DID).DID abides to MPEG-21 DIDL DID abides to MPEG-21 DIDL specification.specification.DID also contains relationships which is DID also contains relationships which is generated by Ingestion process.generated by Ingestion process.Two Items added in Ingestion processTwo Items added in Ingestion processDID identifier – globally unique identifier.DID identifier – globally unique identifier.DID creation time.DID creation time.Characteristics of OAI-PMH Characteristics of OAI-PMH repositories repositories BaseURL(n)BaseURL(n)Contained records are DIDs onlyContained records are DIDs onlyDID identifier used to identify DIDsDID identifier used to identify DIDsdatestamp is DID creation datedatestamp is DID creation dateOAI-PMH granularity is at seconds-levelOAI-PMH granularity is at seconds-levelSets structure ( out of scope)Sets structure ( out of scope)XML tapes for storing DIDSXML tapes for storing DIDSThe characteristics of OAI-PMH repository has The characteristics of OAI-PMH repository has lead to XML tapes as storage for DIDSlead to XML tapes as storage for DIDSXML tapes are created as follows:XML tapes are created as follows:Asset converted to DIDAsset converted to DIDAll DIDs concatinated into a single well-formed All DIDs concatinated into a single well-formed and valid XML file.and valid XML file.XML file is indexed using the following Google's XML file is indexed using the following Google's approach:approach:XML files are gzippedXML files are gzippedGzipped files are indexed with keys as identifiers Gzipped files are indexed with keys as identifiers and datestamp.and datestamp.XML tapes never updatedXML tapes never updatedAssets are rarely updated.Assets are rarely updated.Even if Assets are updated the Even if Assets are updated the corresponding DIDs are never updated in corresponding DIDs are never updated in the XML tape.the XML tape.A new DID is created for the updated A new DID is created for the updated asset and is stored in another OAI-PMH asset and is stored in another OAI-PMH repository.repository.Repository IndexRepository IndexRepository Index contains:Repository Index contains:Repository BaseURL – unique and Repository BaseURL – unique and persistent URI.persistent URI.Repository Creation time.Repository Creation time.Metadata of the OAI-PMH repository. Metadata of the OAI-PMH repository. The repository index can gather the newly The repository index can gather the


View Full Document

ODU CS 791 - Lecture Notes

Documents in this Course
Load more
Download Lecture Notes
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Lecture Notes and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Lecture Notes 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?