Lifecycle Metadata for Digital ObjectsDramatis PersonaeMIX - Metadata for Images in XML SchemaProfiling the Dynamic Web PageSound Recording (Digitized)Preserving ETDsDigitized Moving Image: VHSDSpace SIP Profile for a Born Digital Audio Music FileSlide 9Born Digital Spoken Word Oral History AudioMETS SIP Profile for Spreadsheets Melissa KeenanLifecycle Metadata for Digital ObjectsThe Final CurtainDecember 4, 2006Dramatis Personae•Mundee•Harrison•Kaczmarczik•Sevcik•Bibb•Holt•Addison•Cofield•KeenanMIX - Metadata for Images in XML Schema•Currently under development•Schema for a set of technical data elements required to manage digital image collections•Useful for digitized text (page images)Profiling the Dynamic Web PageProfiling the Dynamic Web PageWhat Is This Dynamic Business?!?!What Is This Dynamic Business?!?!The Deep End: The Database of All The Deep End: The Database of All DatabasesDatabasesDynamic Web Pages and the Dynamic Web Pages and the Metadata Sets Who Love ThemMetadata Sets Who Love ThemWhy Dynamic Web Pages DieWhy Dynamic Web Pages DieHarvesters, Crawlers, and ExtractorsHarvesters, Crawlers, and ExtractorsPicking and Choosing MetadataPicking and Choosing MetadataDecisions, DecisionsDecisions, DecisionsSound Recording (Digitized)Sound Recording (Digitized)•Use case: Student recitals recorded as analog, digitized for streaming access•Challenge: Find schema that apply to musical performances and have usefulness for searching•Metadata standards: mpeg-7, DC/MODS•Use case: Student recitals recorded as analog, digitized for streaming access•Challenge: Find schema that apply to musical performances and have usefulness for searching•Metadata standards: mpeg-7, DC/MODSSusan Harwood KaczmarczikDecember 4, 2006Preserving ETDsMajor Issues--electronic theses and dissertations Fonts--embedded--unrecognized--hacked?Big list of Unicode: http://www.alanwood.net/unicode/fonts.htmlActive features--links, fields, encryptionSolutionsPDF/A--too simple & still in developmentMulti-page TIFF + "too big to fail"Administrative Degree candidacy elementsDigitized Moving Image: VHS*High Points*•Extension Schema: LOC AV PrototypedmdSec•MODSamdSec•techMD: VMD•rightsMD: RMD•sourceMD:VMD•digiProvMD: PMD*Problems Encountered*•Getting started•Overwhelming file sizes•Copyright•Confusing technical terminology related to videoDSpace SIP Profile for a Born Digital Audio Music FilePreservation Issues- Formats and GuidelinesControlled Vocabularies- Library of Congress Subject Headings- Getty Thesaurus of Geographic Names- MARC Value List for Relators and Roles- DCMI Type Vocabulary- ISO 639-2Extension Schemas- MODS- Creative Commons- AUDIOMD - LC-AV Audio Metadata Extension SchemaBorn Digital Still Images•Similar lifecycle to digitized. MD not always stored.•Primarily use NISO MIX format (includes EXIF, GPS).•Images are numerical representations - different image formats compress differently - some need special MD.•NISO MIX contains many fields that are seemingly unimportant but may be valuable as evidence.•NISO MIX also includes many fields completely unintelligible to the layman, referenced or not.•The previous two factors can spell trouble if the preservationist is not an expert! EXIF would help, but there is not a 1:1 ratio of information.•Metadata is meant to help understand transformation, not to “step backwards” to recreate images although this is possible with sufficient detail.Addison 4 DEC 06Born Digital Spoken WordOral History Audio .WAV.MP3.TXTAUDIOMD & PREMIS TEXTMD & PREMISTEI EncodingRules of Description - Name and Date formatsHASSETGetty Thesaurus of Geographic Names LCSHMETS SIP Profile for SpreadsheetsMelissa Keenan•Preservation issues:–Saving formulas–Proprietary format–Open Document Format for Office Applications (ISO/IEC 26300:2006)•Metadata:•EAD (use case is archival)•MathML (complex formulas)•Automatically generated by
View Full Document