source: collections/documented-examples/trunk/marc-e/etc/collect.cfg@ 19357

Last change on this file since 19357 was 19357, checked in by kjdon, 15 years ago

added a sampleoid macro for the sample doc OID, used that in the links to documents, made OIDtype incremental so we get a stable id across platforms

  • Property svn:executable set to *
File size: 11.7 KB
Line 
1creator [email protected]
2maintainer [email protected]
3public true
4
5buildtype mgpp
6
7indexes text dc.Title
8defaultindex text
9
10levels document
11
12plugin GreenstoneXMLPlugin
13plugin MARCPlugin -OIDtype incremental
14plugin MetadataXMLPlugin
15plugin ArchivesInfPlugin
16plugin DirectoryPlugin
17
18classify AZList -metadata dc.Title -removesuffix \\s*(\\/|;|,|\\.).*
19classify AZCompactList -metadata dc.Creator -removesuffix (b\\.\\s+)?(\\d+(\\-?))(\\d+(\\.)?)?
20classify AZCompactList -metadata dc.Subject
21
22format VList "<td>[link][icon][/link]</td>
23<td>{If}{[numleafdocs],[Title],<b>[dc.Title]</b>{If}{[dc.Creator], [sibling:dc.Creator]}{If}{[dc.Publisher], <i>[dc.Publisher]</i>}}</td>{If}{[numleafdocs],<td><i>([numleafdocs])</i></td>}"
24
25#format CL2VList "<td>[link][icon][/link]</td><td>{If}{[numleafdocs],[Title],[dc.Creator]; <i>[dc.Title]</i>{If}{[dc.Publisher], <i>[dc.Publisher]</i>}}</td><td>{If}{[numleafdocs],<i>([numleafdocs])</i>}</td>"
26#format CL3VList "<td>[link][icon][/link]</td><td>{If}{[numleafdocs]{[Title],[dc.Title]{If}{[dc.Creator], [dc.Creator]}{If}{[dc.Publisher], [dc.Publisher]}}</td><td>{If}{[numleafdocs],<i>([numleafdocs])</i>}</td>"
27
28format HList "[link][highlight][ex.Title][/highlight][/link]"
29
30format DocumentHeading "[dc.Title]<br>"
31
32format DocumentText "[Text]"
33
34format DocumentButtons ""
35
36format SearchTypes "plain,form"
37
38collectionmeta .text [l=en] "_labeltext_"
39collectionmeta .Title [l=en] "_labelTitle_"
40collectionmeta .Source [l=en] "_labelSource_"
41
42collectionmeta sampleoid "D0s156"
43
44# -- English strings ----------------
45collectionmeta collectionname [l=en] "MARC example"
46
47# -- French strings ----------------
48collectionmeta collectionname [l=fr] "Exemple MARC"
49
50# -- Spanish strings ----------------
51collectionmeta collectionname [l=es] "Colección de muestra MARC"
52
53# -- Russian strings ----------------
54collectionmeta collectionname [l=ru] "ПрОЌер MARC"
55
56# -- English text ----------------
57collectionmeta collectionextra [l=en] "This collection, which contains _about:numdocs_ MARC entries, is based on the MARC records of working papers published by
58<a href=\"http://www.cs.waikato.ac.nz\">Computer Science Department</a> at <a href=\"http://www.waikato.ac.nz\">the University of Waikato</a>.
59<a href=\"_gwcgi_?l=_cgiargl_&a=d&c=_cgiargc_&d=_sampleoid_\">Here</a> is a sample document in the collection.
60
61<h3>How the collection works</h3>\n
62
63<p>
64The <a href=\"_httpcollection_/etc/collect.cfg\" target=collect.cfg>configuration file</a> uses <i>MARCPlugin</i> to process the MARC records, as well as the
65standard plugins. There are three classifiers, based on <i>dc.Title</i>,
66<i>dc.Creator</i>, and <i>dc.Subject</i> metadata. The <i>Title</i> classifier uses AZList, while the other two use AZCompactList, which groups items with the same metadata into a bookshelf. The <i>-removesuffix</i> argument for the <i>Title</i> and <i>Creator</i>
67classifiers removes suffixes from the metadata string (<i>dc.Title</i> and
68<i>dc.Creator</i> respectively). This is specified as a PERL regular
69expression, and trims characters (such as trailing punctuation)
70from the strings for display.
71<p>
72The <i>VList</i> format statement controls the display of search results and all classifiers. For bookshelves, the number of leaf documents is displayed on the right-hand side. For documents, <i>dc.Title</i> is displayed, along with <i>dc.Creator</i> and <i>dc.Publisher</i>. <i>[sibling:dc.Creator]</i> is used as dc.Creator has multiple values, and specifies that all values be output, not just the first one.
73<p>
74
75The MARC plugin uses a special file to map MARC field numbers to
76Greenstone-style metadata. This file resides in the
77<i>greenstone/etc</i> directory, and is called
78<a href=\"_httpprefix_/etc/marc2dc.txt\" target=marc2dc>marc2dc.txt</a>.
79It lists the correspondences between MARC field numbers and Greenstone
80metadata. Any MARC fields that are not listed simply do not
81appear as metadata, though they are still present in the
82Greenstone document. Each line in the file has the format
83<blockquote>
84&lt;MARC field number&gt; -&gt; GreenstoneMetadataName
85</blockquote>
86Lines in the file that begin with \"\#\" are comments. \n
87<p>
88The standard version of this file is loosely based on
89the MARC to Dublin Core mapping found at <a
90href=\"http://www.loc.gov/marc/marc2dc.html\">http://www.loc.gov/marc/marc2dc.html</a>
91(which assumes USMARC/MARC21).
92<p>
93Multiple MARC fields may map to a single Dublin Core field.
94For example, fields 720 (\"Uncontrolled name\"), 100 (\"Personal name\"),
95110 (\"Corporate name\") and 111 (\"Meeting name\") all map to
96<i>dc.Creator</i>. Actual MARC records normally
97define only one of these fields, and anyway Greenstone allows
98multi-valued metadata.\n
99<p>
100Some mappings are dependent on subfields. For example, MARC field 260 contains
101information about publication and distribution. Subfields \"c\" (Date of Publication) and \"g\" (Date of manufacture) are mapped to <i>dc.Date</i>, using the following mapping line:
102<blockquote>
103260$c$g -&gt; dc.Date
104</blockquote>
105
106Greenstone also provides a file for mapping MARC to <b>qualified</b> dublin core: <a href=\"_httpprefix_/etc/marc2qdc.txt\" target=marc2qdc>greenstone/etc/marc2qdc.txt</a>. This can be used by the MARC plugin by setting the <i>-metadata_mapping_file</i> option to \"marc2qdc.txt\".
107"
108
109# -- French text ----------------
110collectionmeta collectionextra [l=fr] "Cette collection, which contains _about:numdocs_ MARC entries, is based on the MARC records of working papers published by
111<a href=\"http://www.cs.waikato.ac.nz\">Computer Science Department</a> at <a href=\"http://www.waikato.ac.nz\">the University of Waikato</a>.
112<a href=\"_gwcgi_?l=_cgiargl_&a=d&c=_cgiargc_&d=_sampleoid_\">Voici</a> un document échantillon dans la collection.
113
114<h3>Comment marche cette collection ?</h3>
115
116<p>
117Le <a href=\"_httpcollection_/etc/collect.cfg\" target=collect.cfg>fichier de configuration</a> utilise <i>ZipPlug</i> et <i>MarcPlug</i> à coté des trois qui sont utilisés en standard.
118Il y a trois classificateurs basés sur les méta-données <i>dc.Title</i>, <i>dc.Creator</i> et <i>dc.Subject</i>. _text1_
119 Ceci est spécifié comme une expression
120réguliÚre de PERL et ajuste pour l\'affichage des caractÚres (comme une ponctuation finale).
121<p>
122_text2_
123
124<p>
125Le module Marc utilise un fichier spécial pour la mise en correspondance des numéros de champ MARC avec les méta-donnée de type Greenstone. Ce fichier se trouve dans le répertoire <i>gsdl/etc</i>,
126et est appelé <a href=\"_httpprefix_/etc/marc2dc.txt\" target=marc2dc>marc2dc.txt</a>. Il liste les correspondences entre les numéros de champ MARC et les méta-données Greenstone. Tous les champs MARC qui ne sont pas listés ne vont pas apparaître comme méta-données, bien qu\'ils spient toujours présents dans le document Greenstone. Chaque ligne dans le fichier à le format:
127
128<blockquote>
129&lt;numéro de champ MARC&gt; -&gt; NonMetadonnéeGreenstone
130</blockquote>
131
132Les lignes dans le fichier qui commence par \"\#\" sont des commentaires.
133
134<p>
135La version standard de ce fichier est approximativement basée sur la mise en correspondance MARC - Dublin Core trouvé à <a href=\"http://www.loc.gov/marc/marc2dc.html\">http://www.loc.gov/marc/marc2dc.html</a>
136(qui suppose USMARC/MARC21).
137_text3_
138"
139
140# -- Spanish text ----------------
141collectionmeta collectionextra [l=es] "Esta colección, which contains _about:numdocs_ MARC entries, is based on the MARC records of working papers published by
142<a href=\"http://www.cs.waikato.ac.nz\">Computer Science Department</a> at <a href=\"http://www.waikato.ac.nz\">the University of Waikato</a>. He <a href=\"_gwcgi_?l=_cgiargl_&a=d&c=_cgiargc_&d=_sampleoid_\">aquí</a> un documento muestra de la colección.
143<p>
144
145<h3>Cómo trabaja esta colección</h3>
146<p>
147
148El <a href=\"_httpcollection_/etc/collect.cfg\" target=collect.cfg>archivo de configuración</a> utiliza <i>MARCPlugin</i> además de los tres plugins estándar. Existen tres clasificadores basados en los metadatos <i>dc.Título</i>, <i>dc.Creador</i> y <i>dc.Asunto</i>. _text1_ Esto se especifica como una expresión regular PERL y ordena los caracteres (como la puntuación que se encuentra al final) a partir de las cadenas para su presentación en pantalla.
149<p>
150_text2_
151
152<p>
153
154El plugin de MARC utiliza un archivo especial para mapear y convertir los números de campo MARC a metadatos estilo Greenstone. Este archivo reside en el directorio <i>gsdl/etc</i> y se llama <a href=\"_httpprefix_/etc/marc2dc.txt\" target=marc2dc>marc2dc.txt</a>. Incluye una lista de las correspondencias entre los números de campo MARC y los metadatos Greenstone. Cualquier campo MARC que no esté incluido en la lista simplemente no aparece como metadato, aunque siga estando presente en el documento Greenstone. Cada línea del archivo tiene el siguiente formato:
155<p>
156
157<blockquote>
158&lt;MARC field number&gt; -&gt; GreenstoneMetadataName
159</blockquote>
160<p>
161
162Las líneas en el archivo que empiezan con \"\#\" son comentarios.
163<p>
164
165La versión estándar de este archivo se basa libremente en el mapeo de MARC a Dublin Core que se encuentra en <a href=\"http://www.loc.gov/marc/marc2dc.html\">http://www.loc.gov/marc/marc2dc.html</a> (el cual asume USMARC/MARC21).
166
167_text3_
168"
169
170# -- Russian text ----------------
171collectionmeta collectionextra [l=ru] "This collection, which contains _about:numdocs_ MARC entries, is based on the MARC records of working papers published by
172<a href=\"http://www.cs.waikato.ac.nz\">Computer Science Department</a> at <a href=\"http://www.waikato.ac.nz\">the University of Waikato</a>. <a href=\"_gwcgi_?l=_cgiargl_&a=d&c=_cgiargc_&d=_sampleoid_\">ЗЎесь</a> преЎставлеМ тОпПвПй ЎПкуЌеМт ЎаММПй кПллекцОО.
173
174<h3>Как рабПтает кПллекцОя</h3>
175<p>
176
177<a href=\"_httpcollection_/etc/collect.cfg\" target=collect.cfg>КПМфОгурацОПММый файл</a> ОспПльзует плагОМы <i>ZIPPlug</i> О <i>MARCPlug</i>, пПЌОЌП третьегП стаМЎарта. ИЌеются трО классОфОкатПра, ПсМПваММые Ма ЌетаЎаММых
178 <i>НазваМОя (dc.Title)</i>, <i>СПзЎателя (dc.Creator)</i> О <i>ПреЎЌета (dc.Subject)</i>. _text1_ ЭтП требует устаМПвкО PERL Ўля правОльМПгП выражеМОя, О урезает зМакО (тОпа кПМечМых
179 зМакПв препОМаМОя) Оз стрПкО Ўля ПтПбражеМОя.
180<p>
181_text2_
182<p>
183
184ПлагОМ MARC ОспПльзует спецОальМый файл Ўля сПпПставлеМОя МПЌера пПля в MARC О ЌетаЎаММых
185 Greenstone. ЭтПт файл Мах
186ПЎОтся в ЎОректПрОО <i>gsdl/etc</i> О Мазывается <a href=\"_httpprefix_/etc/marc2dc.txt\" target=marc2dc>marc2dc.txt</a>. ОМ преЎставляет спОсПк сППтветствОя ЌежЎу МПЌераЌО пПлей сОстеЌы MARC О ЌетаЎаММыЌО Greenstone. ᅵюбые пПля MARC, кПтПрые Ме вМесеМы в спОсПк, прПстП Ме пПявляются как ЌетаЎаММые, х
187Птя ПМО прОсутствуют в ЎПкуЌеМте Greenstone. КажЎая стрПка в этПЌ файле ОЌеет слеЎующОй фПрЌат.
188
189<blockquote>
190&lt;MARC field number&gt; -&gt; GreenstoneMetadataName
191</blockquote>
192СтрПкО в ЎаММПЌ файле, кПтПрые МачОМаются сП зМака \"\#\", являются кПЌЌеМтарОяЌО.
193<p>
194
195СтаМЎартМая версОя этПгП файла свПбПЎМП базОруется Ма MARC в ПпОсаМОО Dublin Core, кПтПрПе вПспрОМОЌает USMARC/MARC21 О кПтПрПе ЌПжМП МайтО Ма <a href=\"http://www.loc.gov/marc/marc2dc.html\">http://www.loc.gov/marc/marc2dc.html</a>.
196
197_text3_
198"
Note: See TracBrowser for help on using the repository browser.