source: collections/documented-examples/trunk/marc-e/etc/collect.cfg@ 19249

Last change on this file since 19249 was 19249, checked in by kjdon, 12 years ago

updated description for 2.82

  • Property svn:executable set to *
File size: 11.7 KB
Line 
1creator greenstone@cs.waikato.ac.nz
2maintainer greenstone@cs.waikato.ac.nz
3public true
4
5buildtype mgpp
6
7indexes text dc.Title
8defaultindex text
9
10levels document
11
12plugin GreenstoneXMLPlugin
13plugin MARCPlugin
14plugin MetadataXMLPlugin
15plugin ArchivesInfPlugin
16plugin DirectoryPlugin
17
18classify AZList -metadata dc.Title -removesuffix \\s*(\\/|;|,|\\.).*
19classify AZCompactList -metadata dc.Creator -removesuffix (b\\.\\s+)?(\\d+(\\-?))(\\d+(\\.)?)?
20classify AZCompactList -metadata dc.Subject
21
22format VList "<td>[link][icon][/link]</td>
23<td>{If}{[numleafdocs],[Title],<b>[dc.Title]</b>{If}{[dc.Creator], [sibling:dc.Creator]}{If}{[dc.Publisher], <i>[dc.Publisher]</i>}}</td>{If}{[numleafdocs],<td><i>([numleafdocs])</i></td>}"
24
25#format CL2VList "<td>[link][icon][/link]</td><td>{If}{[numleafdocs],[Title],[dc.Creator]; <i>[dc.Title]</i>{If}{[dc.Publisher], <i>[dc.Publisher]</i>}}</td><td>{If}{[numleafdocs],<i>([numleafdocs])</i>}</td>"
26#format CL3VList "<td>[link][icon][/link]</td><td>{If}{[numleafdocs]{[Title],[dc.Title]{If}{[dc.Creator], [dc.Creator]}{If}{[dc.Publisher], [dc.Publisher]}}</td><td>{If}{[numleafdocs],<i>([numleafdocs])</i>}</td>"
27
28format HList "[link][highlight][ex.Title][/highlight][/link]"
29
30format DocumentHeading "[dc.Title]<br>"
31
32format DocumentText "[Text]"
33
34format DocumentButtons ""
35
36format SearchTypes "plain,form"
37
38collectionmeta .text [l=en] "_labeltext_"
39collectionmeta .Title [l=en] "_labelTitle_"
40collectionmeta .Source [l=en] "_labelSource_"
41
42
43# -- English strings ----------------
44collectionmeta collectionname [l=en] "MARC example"
45
46# -- French strings ----------------
47collectionmeta collectionname [l=fr] "Exemple MARC"
48
49# -- Spanish strings ----------------
50collectionmeta collectionname [l=es] "Colección de muestra MARC"
51
52# -- Russian strings ----------------
53collectionmeta collectionname [l=ru] "ПрОЌер MARC"
54
55# -- English text ----------------
56collectionmeta collectionextra [l=en] "This collection, which contains _about:numdocs_ MARC entries, is based on the MARC records of working papers published by
57<a href=\"http://www.cs.waikato.ac.nz\">Computer Science Department</a> at <a href=\"http://www.waikato.ac.nz\">the University of Waikato</a>.
58<a href=\"_gwcgi_?l=_cgiargl_&a=d&c=_cgiargc_&d=HASH0127daf20f8b3eec5c5db834s9\">Here</a> is a sample document in the collection.
59
60<h3>How the collection works</h3>\n
61
62<p>
63The <a href=\"_httpcollection_/etc/collect.cfg\" target=collect.cfg>configuration file</a> uses <i>MARCPlugin</i> to process the MARC records, as well as the
64standard plugins. There are three classifiers, based on <i>dc.Title</i>,
65<i>dc.Creator</i>, and <i>dc.Subject</i> metadata. The <i>Title</i> classifier uses AZList, while the other two use AZCompactList, which groups items with the same metadata into a bookshelf. The <i>-removesuffix</i> argument for the <i>Title</i> and <i>Creator</i>
66classifiers removes suffixes from the metadata string (<i>dc.Title</i> and
67<i>dc.Creator</i> respectively). This is specified as a PERL regular
68expression, and trims characters (such as trailing punctuation)
69from the strings for display.
70<p>
71The <i>VList</i> format statement controls the display of search results and all classifiers. For bookshelves, the number of leaf documents is displayed on the right-hand side. For documents, <i>dc.Title</i> is displayed, along with <i>dc.Creator</i> and <i>dc.Publisher</i>. <i>[sibling:dc.Creator]</i> is used as dc.Creator has multiple values, and specifies that all values be output, not just the first one.
72<p>
73
74The MARC plugin uses a special file to map MARC field numbers to
75Greenstone-style metadata. This file resides in the
76<i>greenstone/etc</i> directory, and is called
77<a href=\"_httpprefix_/etc/marc2dc.txt\" target=marc2dc>marc2dc.txt</a>.
78It lists the correspondences between MARC field numbers and Greenstone
79metadata. Any MARC fields that are not listed simply do not
80appear as metadata, though they are still present in the
81Greenstone document. Each line in the file has the format
82<blockquote>
83&lt;MARC field number&gt; -&gt; GreenstoneMetadataName
84</blockquote>
85Lines in the file that begin with \"\#\" are comments. \n
86<p>
87The standard version of this file is loosely based on
88the MARC to Dublin Core mapping found at <a
89href=\"http://www.loc.gov/marc/marc2dc.html\">http://www.loc.gov/marc/marc2dc.html</a>
90(which assumes USMARC/MARC21).
91<p>
92Multiple MARC fields may map to a single Dublin Core field.
93For example, fields 720 (\"Uncontrolled name\"), 100 (\"Personal name\"),
94110 (\"Corporate name\") and 111 (\"Meeting name\") all map to
95<i>dc.Creator</i>. Actual MARC records normally
96define only one of these fields, and anyway Greenstone allows
97multi-valued metadata.\n
98<p>
99Some mappings are dependent on subfields. For example, MARC field 260 contains
100information about publication and distribution. Subfields \"c\" (Date of Publication) and \"g\" (Date of manufacture) are mapped to <i>dc.Date</i>, using the following mapping line:
101<blockquote>
102260$c$g -&gt; dc.Date
103</blockquote>
104
105Greenstone also provides a file for mapping MARC to <b>qualified</b> dublin core: <a href=\"_httpprefix_/etc/marc2qdc.txt\" target=marc2qdc>greenstone/etc/marc2qdc.txt</a>. This can be used by the MARC plugin by setting the <i>-metadata_mapping_file</i> option to \"marc2qdc.txt\".
106"
107
108# -- French text ----------------
109collectionmeta collectionextra [l=fr] "Cette collection, which contains _about:numdocs_ MARC entries, is based on the MARC records of working papers published by
110<a href=\"http://www.cs.waikato.ac.nz\">Computer Science Department</a> at <a href=\"http://www.waikato.ac.nz\">the University of Waikato</a>.
111<a href=\"_gwcgi_?l=_cgiargl_&a=d&c=_cgiargc_&d=HASH0127daf20f8b3eec5c5db834s9\">Voici</a> un document échantillon dans la collection.
112
113<h3>Comment marche cette collection ?</h3>
114
115<p>
116Le <a href=\"_httpcollection_/etc/collect.cfg\" target=collect.cfg>fichier de configuration</a> utilise <i>ZipPlug</i> et <i>MarcPlug</i> à coté des trois qui sont utilisés en standard.
117Il y a trois classificateurs basés sur les méta-données <i>dc.Title</i>, <i>dc.Creator</i> et <i>dc.Subject</i>. _text1_
118 Ceci est spécifié comme une expression
119réguliÚre de PERL et ajuste pour l\'affichage des caractÚres (comme une ponctuation finale).
120<p>
121_text2_
122
123<p>
124Le module Marc utilise un fichier spécial pour la mise en correspondance des numéros de champ MARC avec les méta-donnée de type Greenstone. Ce fichier se trouve dans le répertoire <i>gsdl/etc</i>,
125et est appelé <a href=\"_httpprefix_/etc/marc2dc.txt\" target=marc2dc>marc2dc.txt</a>. Il liste les correspondences entre les numéros de champ MARC et les méta-données Greenstone. Tous les champs MARC qui ne sont pas listés ne vont pas apparaître comme méta-données, bien qu\'ils spient toujours présents dans le document Greenstone. Chaque ligne dans le fichier à le format:
126
127<blockquote>
128&lt;numéro de champ MARC&gt; -&gt; NonMetadonnéeGreenstone
129</blockquote>
130
131Les lignes dans le fichier qui commence par \"\#\" sont des commentaires.
132
133<p>
134La version standard de ce fichier est approximativement basée sur la mise en correspondance MARC - Dublin Core trouvé à <a href=\"http://www.loc.gov/marc/marc2dc.html\">http://www.loc.gov/marc/marc2dc.html</a>
135(qui suppose USMARC/MARC21).
136_text3_
137"
138
139# -- Spanish text ----------------
140collectionmeta collectionextra [l=es] "Esta colección, which contains _about:numdocs_ MARC entries, is based on the MARC records of working papers published by
141<a href=\"http://www.cs.waikato.ac.nz\">Computer Science Department</a> at <a href=\"http://www.waikato.ac.nz\">the University of Waikato</a>. He <a href=\"_gwcgi_?l=_cgiargl_&a=d&c=_cgiargc_&d=HASH0127daf20f8b3eec5c5db834s9\">aquí</a> un documento muestra de la colección.
142<p>
143
144<h3>Cómo trabaja esta colección</h3>
145<p>
146
147El <a href=\"_httpcollection_/etc/collect.cfg\" target=collect.cfg>archivo de configuración</a> utiliza <i>MARCPlugin</i> además de los tres plugins estándar. Existen tres clasificadores basados en los metadatos <i>dc.Título</i>, <i>dc.Creador</i> y <i>dc.Asunto</i>. _text1_ Esto se especifica como una expresión regular PERL y ordena los caracteres (como la puntuación que se encuentra al final) a partir de las cadenas para su presentación en pantalla.
148<p>
149_text2_
150
151<p>
152
153El plugin de MARC utiliza un archivo especial para mapear y convertir los números de campo MARC a metadatos estilo Greenstone. Este archivo reside en el directorio <i>gsdl/etc</i> y se llama <a href=\"_httpprefix_/etc/marc2dc.txt\" target=marc2dc>marc2dc.txt</a>. Incluye una lista de las correspondencias entre los números de campo MARC y los metadatos Greenstone. Cualquier campo MARC que no esté incluido en la lista simplemente no aparece como metadato, aunque siga estando presente en el documento Greenstone. Cada línea del archivo tiene el siguiente formato:
154<p>
155
156<blockquote>
157&lt;MARC field number&gt; -&gt; GreenstoneMetadataName
158</blockquote>
159<p>
160
161Las líneas en el archivo que empiezan con \"\#\" son comentarios.
162<p>
163
164La versión estándar de este archivo se basa libremente en el mapeo de MARC a Dublin Core que se encuentra en <a href=\"http://www.loc.gov/marc/marc2dc.html\">http://www.loc.gov/marc/marc2dc.html</a> (el cual asume USMARC/MARC21).
165
166_text3_
167"
168
169# -- Russian text ----------------
170collectionmeta collectionextra [l=ru] "This collection, which contains _about:numdocs_ MARC entries, is based on the MARC records of working papers published by
171<a href=\"http://www.cs.waikato.ac.nz\">Computer Science Department</a> at <a href=\"http://www.waikato.ac.nz\">the University of Waikato</a>. <a href=\"_gwcgi_?l=_cgiargl_&a=d&c=_cgiargc_&d=HASH0127daf20f8b3eec5c5db834s9\">ЗЎесь</a> преЎставлеМ тОпПвПй ЎПкуЌеМт ЎаММПй кПллекцОО.
172
173<h3>Как рабПтает кПллекцОя</h3>
174<p>
175
176<a href=\"_httpcollection_/etc/collect.cfg\" target=collect.cfg>КПМфОгурацОПММый файл</a> ОспПльзует плагОМы <i>ZIPPlug</i> О <i>MARCPlug</i>, пПЌОЌП третьегП стаМЎарта. ИЌеются трО классОфОкатПра, ПсМПваММые Ма ЌетаЎаММых
177 <i>НазваМОя (dc.Title)</i>, <i>СПзЎателя (dc.Creator)</i> О <i>ПреЎЌета (dc.Subject)</i>. _text1_ ЭтП требует устаМПвкО PERL Ўля правОльМПгП выражеМОя, О урезает зМакО (тОпа кПМечМых
178 зМакПв препОМаМОя) Оз стрПкО Ўля ПтПбражеМОя.
179<p>
180_text2_
181<p>
182
183ПлагОМ MARC ОспПльзует спецОальМый файл Ўля сПпПставлеМОя МПЌера пПля в MARC О ЌетаЎаММых
184 Greenstone. ЭтПт файл Мах
185ПЎОтся в ЎОректПрОО <i>gsdl/etc</i> О Мазывается <a href=\"_httpprefix_/etc/marc2dc.txt\" target=marc2dc>marc2dc.txt</a>. ОМ преЎставляет спОсПк сППтветствОя ЌежЎу МПЌераЌО пПлей сОстеЌы MARC О ЌетаЎаММыЌО Greenstone. ᅵюбые пПля MARC, кПтПрые Ме вМесеМы в спОсПк, прПстП Ме пПявляются как ЌетаЎаММые, х
186Птя ПМО прОсутствуют в ЎПкуЌеМте Greenstone. КажЎая стрПка в этПЌ файле ОЌеет слеЎующОй фПрЌат.
187
188<blockquote>
189&lt;MARC field number&gt; -&gt; GreenstoneMetadataName
190</blockquote>
191СтрПкО в ЎаММПЌ файле, кПтПрые МачОМаются сП зМака \"\#\", являются кПЌЌеМтарОяЌО.
192<p>
193
194СтаМЎартМая версОя этПгП файла свПбПЎМП базОруется Ма MARC в ПпОсаМОО Dublin Core, кПтПрПе вПспрОМОЌает USMARC/MARC21 О кПтПрПе ЌПжМП МайтО Ма <a href=\"http://www.loc.gov/marc/marc2dc.html\">http://www.loc.gov/marc/marc2dc.html</a>.
195
196_text3_
197"
Note: See TracBrowser for help on using the repository browser.