source: gsdl/trunk/etc/marc2qdc.txt@ 15076

Last change on this file since 15076 was 15018, checked in by davidb, 16 years ago

Marc mapping upgraded to support richer set of operations, including subfields, multiple fields in one line (separated by comma), and the removal of rules, e.g. -245 at the start of a line. A Marc to Qualified Dublin Core crosswalk from the Library of congress has been added as "etc/marc2qdc.txt". A collection can then choose to, for example, top up the mapping with its own version of the file stored in its local "etc" folder, specifying only the rules that are different. This is where a rule like "-245" might be used to override a more general rule from the main file that has all subfields in 245 mapping to one metadata item (Title). If the user specifies a different different filename -- through a plugin option -- then they are free to divise a mapping from scratch and store it in the collections local "etc" folder.

File size: 2.9 KB
Line 
1
2
3# The following mapping is based on the Library of Congress crosswalk given at:
4# http://www.loc.gov/marc/marc2dc.html
5
6245 -> Title
7130, 210, 240, 242, 246 -> Title^Alternative
8730, 740 -> Title^Alternative
9
10100, 110, 111 -> Creator
11700, 710, 711 -> Creator
12720 -> Creator
13
14# at later date might want to distinguish between
15# (LCSH encoding, 2nd indicator=0) and (MeSH encoding, 2nd indicator=2)
16600, 610, 611, 630, 650 -> Subject # LCSH or MeSH encoding
17050 -> Subject # LCC encoding
18082 -> Subject # DDC encoding
19080 -> Subject # UDC encoding
20
21500-504, 507-519, 521-529 -> Description
22531-539, 541-545, 547-599 -> Description
23505 -> Description^TableofContents
24520 -> Description^Abstract # First indicator=3
25
26260$a$b -> Publisher
27
28260$c$g -> Date^Created
29533$d -> Date^Created
30260$c -> Date^Issued
31# Control field substring, not currently supported by mapping
32# 008/07-10 -> Date^Issued
33
34# Leader fields not currently supported by mapping
35# Leader06, Leader07 -> Type # DCMI Type Vocabulary
36655 -> Type # Subfield $2=dct
37
38856$q -> Format # IMT encoding
39300$a -> Format^Extent
40533$e -> Format^Extent
41340$a -> Format^Medium
42
43856$u -> Identifier # URI encoding
44
45786$o -> Source # URI encoding
46
47# Control field substring, not currently supported
48# 008/35-37 -> Language # ISO 639-2 encoding
49041 -> Language # ISO 639-2 encoding
50546 -> Language # RFC1766 encoding
51
52775 -> Relation^IsVersionOf
53786$n$t -> Relation^IsVersionOf
54775,786$o -> Relation^IsVersionOf # URI encoding
55775$n$t -> Relation^HasVersion
56775$o -> Relation^HasVersion # URL encoding
57785$n$t -> Relation^IsReplacedBy
58785$o -> Relation^IsReplacedBy # URL encoding
59780$n$t -> Relation^Replaces
60780$o -> Relation^Replaces # URL encoding
61538 -> Relation^Requires
62760 -> Relation^IsPartOf
63773$n$t -> Relation^IsPartOf
64440,490,800,810,811,830 -> Relation^IsPartOf
65760,773$o -> Relation^IsPartOf # URL encoding
66774$n$t -> Relation^HasPart
67774$o -> Relation^HasPart # URL encoding
68510 -> Relation^IsReferencedBy
69776$n$t -> Relation^IsFormatOf
70530 -> Relation^IsFormatOf
71776$o -> Relation^IsFormatOf # URL encoding
72530$u -> Relation^IsFormatOf # URL encoding
73776$n$t -> Relation^HasFormat
74530 -> Relation^HasFormat
75776$o -> Relation^HasFormat # URL encoding
76530$u -> Relation^HasFormat # URL encoding
77
78522, 651 -> Coverage^Spatial
79255 -> Coverage^Spatial # Some 255 information equivalent to DC encoding scheme but different syntax
80
81650$z -> Coverage^Spatial
82752 -> Coverage^Spatial
83
84043$c,044$c -> Coverage^Spatial # ISO 3166 encoding
85651 -> Coverage^Spatial # Subfield $2=tgn # Encoding TGN
86513$b -> Coverage^Temporal
87033$a -> Coverage^Temporal
88
89506, 540 -> Rights
90
Note: See TracBrowser for help on using the repository browser.