The Open Archives Initiative and
the Sheet Music Consortium
Jon Dunn, Jenn Riley
IU Digital Library Program
October 10, 2003
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 2
Presentation outline
 Jon:
 OAI introduction
 Sheet Music Consortium background
 Jenn:
 Data mapping issues
 Sheet music harvester demonstration
 Next steps
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 3
OAI: Open Archives Initiative
 Original problem: searching across e-print
archives
 Distributed searching hard
 e.g. Z39.50
 Varying search semantics, capabilities
 Network, server problems
 Solution: metadata harvesting
 OAI-PMH: OAI Protocol for Metadata Harvesting
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 4
Metadata Harvesting
 Extract metadata from various sources
 Build services on local copies of metadata
user
. . .
search for “Indiana”
local copy of
metadata
metadata
harvested
offline
metadata
harvested
offline
metadata
harvested
offline
metadata
harvested
offline
all searching, browsing,
etc. performed on
the metadata hereIndividual repositories can
still support direct user
interaction
Data providers
Service
provider
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 5
OAI-PMH roles
 Data Providers
 Repositories of digital content and metadata
 Support harvesting of metadata via the OAI
protocol
 Service Providers
 Harvest metadata from data providers using the
OAI protocol
 Implement user interface to data

Usually for searching, but other services also possible
 Can be selective
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 6
OAI Protocol for Metadata
Harvesting
 Originally developed in 1999 (Santa Fe Convention)
 Original focus on E-prints
 Has grown into general metadata harvesting protocol
 Version 1.0: January 2001
 Version 1.1: June 2001
 Conform to XML Schema 1.0
 Version 2.0: June 2002
 Transition period through December 2002
 Currently 120 registered OAI data providers
(up from 53 in March 2003)
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 7
OAI-PMH tech details
 Carried over HTTP
 Requests: HTTP GET or POST
 Responses encoded in XML
 Format defined via XML schema
 Metadata in unqualified Dublin Core
(and potentially other formats)
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 8
Dublin Core elements
 Coverage
 Description
 Type
 Relation
 Source
 Subject
 Title
 Contributor
 Creator
 Publisher
 Rights
 Date
 Format
 Identifier
 Language
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 9
OAI-PMH verbs
Verb Function
Identify description of archive
ListMetadataFormats metadata formats supported by archive
ListSets sets defined by archive
ListIdentifiers OAI unique ids contained in archive
ListRecords listing of N records
GetRecord listing of a single record
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 10
OAI resources
 Web site, mailing lists
 Repository explorer
 Data/service provider software
www.openarchives.org
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 11
OAI data providers at IU
 OAI data provider for DLP collections
 Lilly: Hohenberger Photograph Collection,
DeVincent Sheet Music Collection
 IUN: U.S. Steel Photograph Collection
 eventually all
 Eprints: Digital Library of the Commons
 AISRI
 ReciprocalNet
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 12
OAI data provider for DLP
 PHP OAI Data Provider
 Developed by University of Oldenburg
 PHP, mySQL database
 Perl scripts used to map USMARC,
other formats to DC
 MARC.pm Perl module
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 13
Examples of OAI service
providers
 UIUC Digital Gateway to Cultural Heritage
Materials
 http://oai.grainger.uiuc.edu/
 UMich OAIster
 http://www.oaister.org/
 RLG Cultural Materials (licensed)
 http://www.rlg.org/culturalres/
 OLAC: Open Language Archives Community
 http://www.language-archives.org/
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 14
Sheet Music Consortium
 Partners
 UCLA
 Johns Hopkins
 IU
 Goal: Integrate access to sheet music
collections
 Online and print collections
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 15
Sheet music
 Definition
 Based on physical format: generally loose sheets
or folio, 1-10 pages
 Much is “popular music,” but not all
 Variety of research uses
 Currently hard to access
 Variety of metadata
 Much uncataloged
 Many valuable collections
 MLA list
 At IU: Lilly, Archives of Traditional Music
The Open Archives Initiative and the Sheet Music Consortium
The Open Archives Initiative and the Sheet Music Consortium
The Open Archives Initiative and the Sheet Music Consortium
The Open Archives Initiative and the Sheet Music Consortium
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 20
Sheet Music Consortium
Harvester: Timeline
 March 2002: Initial planning meeting at IU
 Fall 2002: Initial system prototype
 Winter 2002/2003: Usability evaluation,
interface redesign
 Focus groups and usability testing at several sites
 Fall 2003 – Version 1 of system released
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 21
Why did we have to map data?
 OAI requires unqualified Dublin Core
 Sheet Music Harvester version 1 only
collected Dublin Core
 Contributed data only needed to support
resource discovery
 Dublin Core field definitions need
interpretation
 For efficient searching, data from different
institutions must be consistent
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 22
Some mapping issues
 Field formatting important, not just contents
 Choices heavily influenced by LC practice
 Can’t force institutions to comply with
guidelines
 Sheet music has many alternative titles
 Creator vs. contributor
 Plate numbers: they’re important, where to
put and how to label?
 Uncertain dates and date ranges
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 23
Mapping guidelines
 Examples:
 Creator: Invert name. Use the authorized form of
name where possible. If needed (e.g. for an alias)
repeat the field for the alternative form.
 Date: Date of publication. The most recent date
to appear on the music, or, the actual date of
publication if not present but known. Include other
dates (e.g. date of composition) if known. Codes
“c” for copyright and “ca.” for circa in front of the
date is allowed for now. Use repeated DC fields
for each date if needed.
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 24
Existing metadata formats
 MARC
 Encoded Archival Description (EAD)
 Dublin Core (DC)
 Local custom formats
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 25
MARC (1)
 Library of Congress – mostly from
Music for the Nation: American Sheet Mus
 almost 50,000 records available via OAI
 already had data mapped “based on”
MARC to Dublin Core crosswalk
 not able to alter their mapping for
participation in sheet music project
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 26
MARC (2)
 IU – Starr collection
 little authority control
 determined LC MARC2DC mapping inadequate
 mapping in progress using MARC.pm
 Duke – Weinmann collection
 rare materials emphasis
 also customized own mapping
 mapping in progress
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 27
EAD
 Duke – Historic American Sheet Music
 Item level finding aid
 very robust and specific
 conversion was relatively simple because
data was converted to EAD from collection-
specific database
 included virtually all information in EAD
documents to DC records
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 28
Dublin Core
 UCLA – Archive of Popular American Music
 4 types of DC records
 songs

sheet music
 covers et al

recordings
 mapping only required inheritance of songs and sheet music
data elements down to the covers level
 recordings data ignored for OAI data provider purposes
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 29
Local custom formats (1)
 publication (location,
publisher, date)
 subject
 call num (box, item)
 title
 composer/lyricist/
arranger
 form of composition
 instrumentation
 first line
 first line of chorus
 performer
 dedicatee
 engraver/lithographer/
artist
 advertisement
 plate num
 duplication
 Johns Hopkins – Levy collection
 Simple SGML DTD
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 30
Local custom formats (2)
 title
 composer
 lyricist
 place of
publication
 publisher
 copyright
 first line
 first line of chorus
 subject
 form of composition
 performance medium
 copies
 call #
 IU – DeVincent collection
 Simple MS Access database
 Conversion done with Perl
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 31
Harvester demonstration
 <http://digital.library.ucla.edu/sheetmusic>
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 32
Data inconsistencies
 Different depths of description
 Different levels of authority control
 No common subject vocabulary
between collections
 Despite mapping guidelines, differences
in DC interpretation
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 33
Next steps?
 Authority control for names
 Date formats
 Data clean-up: what can be done at harvester end
and what must we ask data providers to do?
 What will more robust data format look like?
 How do we make it easier for more institutions to
participate?
October 10,
2003
DL Brown Bag:
OAI/Sheet Music 34
More information
 Presentation on DLP web site,
with links:
 www.dlib.indiana.edu/workshops/bbfall2003.htm
 Email:
 Jon Dunn: jwd@indiana.edu
 Jenn Riley: jenlrile@indiana.edu

More Related Content

PPTX
RDA for music
PPT
RDA for music cataloguers
PPTX
Snyder Kishimoto: RDA for Music: Popular Music, Jazz, and World Music Audio R...
PPTX
MLA Workshop: Cataloging Music Audiovisual Materials Using RDA
PPTX
RDA for Music: Scores
PPTX
RDA for Music: Classical Music Audio Recordings Workshop
PPTX
MLA Workshop: Introduction to LC’s Music Medium of Performance and Genre Voca...
PPTX
Music Cataloging Basics Workshop - Slides
RDA for music
RDA for music cataloguers
Snyder Kishimoto: RDA for Music: Popular Music, Jazz, and World Music Audio R...
MLA Workshop: Cataloging Music Audiovisual Materials Using RDA
RDA for Music: Scores
RDA for Music: Classical Music Audio Recordings Workshop
MLA Workshop: Introduction to LC’s Music Medium of Performance and Genre Voca...
Music Cataloging Basics Workshop - Slides

What's hot (12)

PPTX
Music Cataloging Basics (October 2018)
PPT
Resource Description & Access (RDA)
PPTX
RDA State of the Union
PPTX
Intro to rda
PPT
RDA (Resource Description & Access)
PPT
Organizing Trumpet Music
PPTX
Digipak research
PPTX
RDA Essentials
PPTX
Introducing RDA
PPTX
CMoreno_PSE_pg217_1_DM
PPTX
All About Access Points in RDA
DOC
Secondary 3
Music Cataloging Basics (October 2018)
Resource Description & Access (RDA)
RDA State of the Union
Intro to rda
RDA (Resource Description & Access)
Organizing Trumpet Music
Digipak research
RDA Essentials
Introducing RDA
CMoreno_PSE_pg217_1_DM
All About Access Points in RDA
Secondary 3
Ad

Viewers also liked (17)

PPTX
Launching metaware.buzz
PPT
Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and S...
PPTX
Discovery elsewhere
PPT
Tagging and User-Contributed Metadata
PPT
Making Interoperability Easier: Creating Shareable Metadata
PPT
Digital Imaging of Photographs
PPT
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Librar...
PPTX
Getting Comfortable with Metadata Reuse
PPT
Metadata for Brittle Books Page Turner
PPT
Challenges in the Nursery: Linking a Finding Aid with Online Content
PPTX
Designing the Garden: Getting Grounded in Linked Data
PPTX
The future of cataloguing? Future cataloguers!
PPT
Metadata for Audiovisual Materials and its Role in Digital Projects
PPT
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
PPT
Digitizing and Delivering Audio and Video
PPT
IN Harmony Tools Redux
PPTX
Avalon 5.0 and Beyond
Launching metaware.buzz
Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and S...
Discovery elsewhere
Tagging and User-Contributed Metadata
Making Interoperability Easier: Creating Shareable Metadata
Digital Imaging of Photographs
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Librar...
Getting Comfortable with Metadata Reuse
Metadata for Brittle Books Page Turner
Challenges in the Nursery: Linking a Finding Aid with Online Content
Designing the Garden: Getting Grounded in Linked Data
The future of cataloguing? Future cataloguers!
Metadata for Audiovisual Materials and its Role in Digital Projects
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Digitizing and Delivering Audio and Video
IN Harmony Tools Redux
Avalon 5.0 and Beyond
Ad

Similar to The Open Archives Initiative and the Sheet Music Consortium (15)

PPT
Online Sheet Music
PPT
FRBR; or, How I learned to stop worrying and love the model
PPT
Open Archives Initiative for Sheet Music: Data Mapping
PPTX
Early Warnings
PPT
Linking books: rda-frbr-lod
PDF
Handout for FRBR; or, How I learned to stop worrying and love the model
PDF
rda_complete_examples_bibliographic_april2016_0.pdf
PPTX
LIS 653, Session 6: FRBR & Relationships
PDF
The Listening Experience Database
PDF
Swib 2013: Enhancing an OAI-PMH Service Using Linked Data: A Report from the ...
ODP
Publishing and interlinking music-related data on the Web
PPT
Ask a Librarian: The Role of Librarians in the Music Information Retrieval Co...
PPT
RDA: coming (not so) soon to a catalogue near you
PDF
Toward linked data: Library of Congress Medium of Performance Thesaurus
PDF
A-sides, B-sides, Chapters, and Special Features: Describing Content and Stru...
Online Sheet Music
FRBR; or, How I learned to stop worrying and love the model
Open Archives Initiative for Sheet Music: Data Mapping
Early Warnings
Linking books: rda-frbr-lod
Handout for FRBR; or, How I learned to stop worrying and love the model
rda_complete_examples_bibliographic_april2016_0.pdf
LIS 653, Session 6: FRBR & Relationships
The Listening Experience Database
Swib 2013: Enhancing an OAI-PMH Service Using Linked Data: A Report from the ...
Publishing and interlinking music-related data on the Web
Ask a Librarian: The Role of Librarians in the Music Information Retrieval Co...
RDA: coming (not so) soon to a catalogue near you
Toward linked data: Library of Congress Medium of Performance Thesaurus
A-sides, B-sides, Chapters, and Special Features: Describing Content and Stru...

More from Jenn Riley (14)

PPTX
Understanding Metadata: Looking Forward
PDF
Handout for Digital Imaging of Photographs
PPT
Variations2
PDF
Handout for Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
PDF
Handout for Merging Metadata from Multiple Traditions: IN Harmony Sheet Music...
PDF
Handout for Tagging and User-Contributed Metadata
PDF
Handout for IN Harmony Tools Redux
PDF
Handout for An introduction to PREMIS
PPT
An Introduction to PREMIS
PPT
The Standards Paradox: Case studies in Conforming to or Abandoning Metadata S...
PPT
The Digital Library Federation Aquifer Initiative
PDF
Handout for The Evolution of Library Descriptive Practices: Bibliographic Con...
PPT
The Evolution of Library Descriptive Practices: Bibliographic Control? Descri...
PPT
Building an Audio Preservation System at Indiana University Using Standards a...
Understanding Metadata: Looking Forward
Handout for Digital Imaging of Photographs
Variations2
Handout for Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Handout for Merging Metadata from Multiple Traditions: IN Harmony Sheet Music...
Handout for Tagging and User-Contributed Metadata
Handout for IN Harmony Tools Redux
Handout for An introduction to PREMIS
An Introduction to PREMIS
The Standards Paradox: Case studies in Conforming to or Abandoning Metadata S...
The Digital Library Federation Aquifer Initiative
Handout for The Evolution of Library Descriptive Practices: Bibliographic Con...
The Evolution of Library Descriptive Practices: Bibliographic Control? Descri...
Building an Audio Preservation System at Indiana University Using Standards a...

Recently uploaded (20)

PDF
BSc-Zoology-02Sem-DrVijay-Comparative anatomy of vertebrates.pdf
PPT
hemostasis and its significance, physiology
PPTX
4. Diagnosis and treatment planning in RPD.pptx
PPTX
IT infrastructure and emerging technologies
PPSX
namma_kalvi_12th_botany_chapter_9_ppt.ppsx
PPTX
growth and developement.pptxweeeeerrgttyyy
PDF
Chevening Scholarship Application and Interview Preparation Guide
PDF
POM_Unit1_Notes.pdf Introduction to Management #mba #bba #bcom #bballb #class...
PDF
anganwadi services for the b.sc nursing and GNM
PDF
Physical pharmaceutics two in b pharmacy
PPTX
2025 High Blood Pressure Guideline Slide Set.pptx
PDF
FAMILY PLANNING (preventative and social medicine pdf)
PPTX
Cite It Right: A Compact Illustration of APA 7th Edition.pptx
PPTX
Key-Features-of-the-SHS-Program-v4-Slides (3) PPT2.pptx
PDF
African Communication Research: A review
PPTX
pharmaceutics-1unit-1-221214121936-550b56aa.pptx
PDF
Health aspects of bilberry: A review on its general benefits
PPTX
ACFE CERTIFICATION TRAINING ON LAW.pptx
DOCX
THEORY AND PRACTICE ASSIGNMENT SEMESTER MAY 2025.docx
PDF
Kalaari-SaaS-Founder-Playbook-2024-Edition-.pdf
BSc-Zoology-02Sem-DrVijay-Comparative anatomy of vertebrates.pdf
hemostasis and its significance, physiology
4. Diagnosis and treatment planning in RPD.pptx
IT infrastructure and emerging technologies
namma_kalvi_12th_botany_chapter_9_ppt.ppsx
growth and developement.pptxweeeeerrgttyyy
Chevening Scholarship Application and Interview Preparation Guide
POM_Unit1_Notes.pdf Introduction to Management #mba #bba #bcom #bballb #class...
anganwadi services for the b.sc nursing and GNM
Physical pharmaceutics two in b pharmacy
2025 High Blood Pressure Guideline Slide Set.pptx
FAMILY PLANNING (preventative and social medicine pdf)
Cite It Right: A Compact Illustration of APA 7th Edition.pptx
Key-Features-of-the-SHS-Program-v4-Slides (3) PPT2.pptx
African Communication Research: A review
pharmaceutics-1unit-1-221214121936-550b56aa.pptx
Health aspects of bilberry: A review on its general benefits
ACFE CERTIFICATION TRAINING ON LAW.pptx
THEORY AND PRACTICE ASSIGNMENT SEMESTER MAY 2025.docx
Kalaari-SaaS-Founder-Playbook-2024-Edition-.pdf

The Open Archives Initiative and the Sheet Music Consortium

  • 1. The Open Archives Initiative and the Sheet Music Consortium Jon Dunn, Jenn Riley IU Digital Library Program October 10, 2003
  • 2. October 10, 2003 DL Brown Bag: OAI/Sheet Music 2 Presentation outline  Jon:  OAI introduction  Sheet Music Consortium background  Jenn:  Data mapping issues  Sheet music harvester demonstration  Next steps
  • 3. October 10, 2003 DL Brown Bag: OAI/Sheet Music 3 OAI: Open Archives Initiative  Original problem: searching across e-print archives  Distributed searching hard  e.g. Z39.50  Varying search semantics, capabilities  Network, server problems  Solution: metadata harvesting  OAI-PMH: OAI Protocol for Metadata Harvesting
  • 4. October 10, 2003 DL Brown Bag: OAI/Sheet Music 4 Metadata Harvesting  Extract metadata from various sources  Build services on local copies of metadata user . . . search for “Indiana” local copy of metadata metadata harvested offline metadata harvested offline metadata harvested offline metadata harvested offline all searching, browsing, etc. performed on the metadata hereIndividual repositories can still support direct user interaction Data providers Service provider
  • 5. October 10, 2003 DL Brown Bag: OAI/Sheet Music 5 OAI-PMH roles  Data Providers  Repositories of digital content and metadata  Support harvesting of metadata via the OAI protocol  Service Providers  Harvest metadata from data providers using the OAI protocol  Implement user interface to data  Usually for searching, but other services also possible  Can be selective
  • 6. October 10, 2003 DL Brown Bag: OAI/Sheet Music 6 OAI Protocol for Metadata Harvesting  Originally developed in 1999 (Santa Fe Convention)  Original focus on E-prints  Has grown into general metadata harvesting protocol  Version 1.0: January 2001  Version 1.1: June 2001  Conform to XML Schema 1.0  Version 2.0: June 2002  Transition period through December 2002  Currently 120 registered OAI data providers (up from 53 in March 2003)
  • 7. October 10, 2003 DL Brown Bag: OAI/Sheet Music 7 OAI-PMH tech details  Carried over HTTP  Requests: HTTP GET or POST  Responses encoded in XML  Format defined via XML schema  Metadata in unqualified Dublin Core (and potentially other formats)
  • 8. October 10, 2003 DL Brown Bag: OAI/Sheet Music 8 Dublin Core elements  Coverage  Description  Type  Relation  Source  Subject  Title  Contributor  Creator  Publisher  Rights  Date  Format  Identifier  Language
  • 9. October 10, 2003 DL Brown Bag: OAI/Sheet Music 9 OAI-PMH verbs Verb Function Identify description of archive ListMetadataFormats metadata formats supported by archive ListSets sets defined by archive ListIdentifiers OAI unique ids contained in archive ListRecords listing of N records GetRecord listing of a single record
  • 10. October 10, 2003 DL Brown Bag: OAI/Sheet Music 10 OAI resources  Web site, mailing lists  Repository explorer  Data/service provider software www.openarchives.org
  • 11. October 10, 2003 DL Brown Bag: OAI/Sheet Music 11 OAI data providers at IU  OAI data provider for DLP collections  Lilly: Hohenberger Photograph Collection, DeVincent Sheet Music Collection  IUN: U.S. Steel Photograph Collection  eventually all  Eprints: Digital Library of the Commons  AISRI  ReciprocalNet
  • 12. October 10, 2003 DL Brown Bag: OAI/Sheet Music 12 OAI data provider for DLP  PHP OAI Data Provider  Developed by University of Oldenburg  PHP, mySQL database  Perl scripts used to map USMARC, other formats to DC  MARC.pm Perl module
  • 13. October 10, 2003 DL Brown Bag: OAI/Sheet Music 13 Examples of OAI service providers  UIUC Digital Gateway to Cultural Heritage Materials  http://oai.grainger.uiuc.edu/  UMich OAIster  http://www.oaister.org/  RLG Cultural Materials (licensed)  http://www.rlg.org/culturalres/  OLAC: Open Language Archives Community  http://www.language-archives.org/
  • 14. October 10, 2003 DL Brown Bag: OAI/Sheet Music 14 Sheet Music Consortium  Partners  UCLA  Johns Hopkins  IU  Goal: Integrate access to sheet music collections  Online and print collections
  • 15. October 10, 2003 DL Brown Bag: OAI/Sheet Music 15 Sheet music  Definition  Based on physical format: generally loose sheets or folio, 1-10 pages  Much is “popular music,” but not all  Variety of research uses  Currently hard to access  Variety of metadata  Much uncataloged  Many valuable collections  MLA list  At IU: Lilly, Archives of Traditional Music
  • 20. October 10, 2003 DL Brown Bag: OAI/Sheet Music 20 Sheet Music Consortium Harvester: Timeline  March 2002: Initial planning meeting at IU  Fall 2002: Initial system prototype  Winter 2002/2003: Usability evaluation, interface redesign  Focus groups and usability testing at several sites  Fall 2003 – Version 1 of system released
  • 21. October 10, 2003 DL Brown Bag: OAI/Sheet Music 21 Why did we have to map data?  OAI requires unqualified Dublin Core  Sheet Music Harvester version 1 only collected Dublin Core  Contributed data only needed to support resource discovery  Dublin Core field definitions need interpretation  For efficient searching, data from different institutions must be consistent
  • 22. October 10, 2003 DL Brown Bag: OAI/Sheet Music 22 Some mapping issues  Field formatting important, not just contents  Choices heavily influenced by LC practice  Can’t force institutions to comply with guidelines  Sheet music has many alternative titles  Creator vs. contributor  Plate numbers: they’re important, where to put and how to label?  Uncertain dates and date ranges
  • 23. October 10, 2003 DL Brown Bag: OAI/Sheet Music 23 Mapping guidelines  Examples:  Creator: Invert name. Use the authorized form of name where possible. If needed (e.g. for an alias) repeat the field for the alternative form.  Date: Date of publication. The most recent date to appear on the music, or, the actual date of publication if not present but known. Include other dates (e.g. date of composition) if known. Codes “c” for copyright and “ca.” for circa in front of the date is allowed for now. Use repeated DC fields for each date if needed.
  • 24. October 10, 2003 DL Brown Bag: OAI/Sheet Music 24 Existing metadata formats  MARC  Encoded Archival Description (EAD)  Dublin Core (DC)  Local custom formats
  • 25. October 10, 2003 DL Brown Bag: OAI/Sheet Music 25 MARC (1)  Library of Congress – mostly from Music for the Nation: American Sheet Mus  almost 50,000 records available via OAI  already had data mapped “based on” MARC to Dublin Core crosswalk  not able to alter their mapping for participation in sheet music project
  • 26. October 10, 2003 DL Brown Bag: OAI/Sheet Music 26 MARC (2)  IU – Starr collection  little authority control  determined LC MARC2DC mapping inadequate  mapping in progress using MARC.pm  Duke – Weinmann collection  rare materials emphasis  also customized own mapping  mapping in progress
  • 27. October 10, 2003 DL Brown Bag: OAI/Sheet Music 27 EAD  Duke – Historic American Sheet Music  Item level finding aid  very robust and specific  conversion was relatively simple because data was converted to EAD from collection- specific database  included virtually all information in EAD documents to DC records
  • 28. October 10, 2003 DL Brown Bag: OAI/Sheet Music 28 Dublin Core  UCLA – Archive of Popular American Music  4 types of DC records  songs  sheet music  covers et al  recordings  mapping only required inheritance of songs and sheet music data elements down to the covers level  recordings data ignored for OAI data provider purposes
  • 29. October 10, 2003 DL Brown Bag: OAI/Sheet Music 29 Local custom formats (1)  publication (location, publisher, date)  subject  call num (box, item)  title  composer/lyricist/ arranger  form of composition  instrumentation  first line  first line of chorus  performer  dedicatee  engraver/lithographer/ artist  advertisement  plate num  duplication  Johns Hopkins – Levy collection  Simple SGML DTD
  • 30. October 10, 2003 DL Brown Bag: OAI/Sheet Music 30 Local custom formats (2)  title  composer  lyricist  place of publication  publisher  copyright  first line  first line of chorus  subject  form of composition  performance medium  copies  call #  IU – DeVincent collection  Simple MS Access database  Conversion done with Perl
  • 31. October 10, 2003 DL Brown Bag: OAI/Sheet Music 31 Harvester demonstration  <http://digital.library.ucla.edu/sheetmusic>
  • 32. October 10, 2003 DL Brown Bag: OAI/Sheet Music 32 Data inconsistencies  Different depths of description  Different levels of authority control  No common subject vocabulary between collections  Despite mapping guidelines, differences in DC interpretation
  • 33. October 10, 2003 DL Brown Bag: OAI/Sheet Music 33 Next steps?  Authority control for names  Date formats  Data clean-up: what can be done at harvester end and what must we ask data providers to do?  What will more robust data format look like?  How do we make it easier for more institutions to participate?
  • 34. October 10, 2003 DL Brown Bag: OAI/Sheet Music 34 More information  Presentation on DLP web site, with links:  www.dlib.indiana.edu/workshops/bbfall2003.htm  Email:  Jon Dunn: [email protected]  Jenn Riley: [email protected]

Editor's Notes

  • #22: Unqualified DC required, but more robust formats also allowed. More on this later. Since the purpose of the harvester is discovery of resources, and a user is taken out of the harvester to view items at individual institutions, it was not necessary to force all of the information from each institution’s records into DC. We needed to define what was required for discovery only, not figure out how to squeeze every marc field into dc. The name “Dublin Core” reveals something about its purpose. It was designed to be a core set of metadata elements applicable to all types of resources. Thus it’s meant to be flexible, with a low entry barrier. This means the definitions of fields are open to wide interpretations. We needed to develop a single interpretation that all contributors followed to make searching and browsing more effective.
  • #26: Duke (rare materials emphasis) and IU (little authority control) had records in MARC too, but these weren’t contributed in phase 1 of the project.
  • #27: Duke (rare materials emphasis) and IU (little authority control) had records in MARC too, but these weren’t contributed in phase 1 of the project.
  • #28: Explain what EAD is and what a finding aid is Very specific info, for example: subject type (LCSH, AAT, TGM), dedicatee, recordings available
  • #30: Very specific for sheet music, not good even for other types of printed music.
  • #31: No authority control
  • #33: MARC has more fields than custom DBs, but custom DBs have more applicable fields. Many MARC records don’t have relator codes, so we don’t know who’s a composer and who’s a lyricist. Even if each individual collection was under name authority, they still might not interoperate. But the problem was much worse – there wasn’t even agreement on name order! Local subject vocabs in use, so even a complex mapping between LCSH, AAT, TGM wouldn’t solve the problem.
  • #34: but we don’t know how to solve these things yet