source: documented-examples/trunk/marc-e/resources/collectionConfig.properties@ 36275

Last change on this file since 36275 was 36275, checked in by anupama, 2 months ago

Ported MARC-e DEC collection to GS3. As always, the collection descriptions may need rewriting.

File size: 3.4 KB
Line 
1name=MARC example
2.Title=
3.text=
4.Source=
5sampleoid=D0s156
6
7shortDescription=<p>This collection, which contains _about:numdocs_ MARC entries, is based on the MARC records of working papers published by <a href="http://www.cs.waikato.ac.nz">Computer Science Department</a> at <a href="http://www.waikato.ac.nz">the University of Waikato</a>. <a href="?a=d&d=_sampleoid_">Here</a> is a sample document in the collection.</p>
8
9description1=<h3>How the collection works</h3><p>The <a href="_httpcollection_/etc/collect.cfg" target=collect.cfg>configuration file</a> uses <i>MARCPlugin</i> to process the MARC records, as well as the standard plugins. There are three classifiers, based on <i>dc.Title</i>, <i>dc.Creator</i>, and <i>dc.Subject</i> metadata. The <i>Title</i> classifier uses AZList, while the other two use AZCompactList, which groups items with the same metadata into a bookshelf. The <i>-removesuffix</i> argument for the <i>Title</i> and <i>Creator</i> classifiers removes suffixes from the metadata string (<i>dc.Title</i> and <i>dc.Creator</i> respectively). This is specified as a PERL regular expression, and trims characters (such as trailing punctuation) from the strings for display.</p>
10
11description2=<p>The <i>VList</i> format statement controls the display of search results and all classifiers. For bookshelves, the number of leaf documents is displayed on the right-hand side. For documents, <i>dc.Title</i> is displayed, along with <i>dc.Creator</i> and <i>dc.Publisher</i>. <i>[sibling:dc.Creator]</i> is used as dc.Creator has multiple values, and specifies that all values be output, not just the first one.</p>
12
13description3=<p>The MARC plugin uses a special file to map MARC field numbers to Greenstone-style metadata. This file resides in the <i>greenstone/etc</i> directory, and is called <a href="_httpsite_/etc/marc2dc.txt" target=marc2dc>marc2dc.txt</a>. It lists the correspondences between MARC field numbers and Greenstone metadata. Any MARC fields that are not listed simply do not appear as metadata, though they are still present in the Greenstone document. Each line in the file has the format <blockquote> &lt;MARC field number&gt; -&gt; GreenstoneMetadataName </blockquote> Lines in the file that begin with "\#" are comments.</p>
14
15description4=<p>The standard version of this file is loosely based on the MARC to Dublin Core mapping found at <a href="http://www.loc.gov/marc/marc2dc.html">http://www.loc.gov/marc/marc2dc.html</a> (which assumes USMARC/MARC21).</p>
16
17description5=<p>Multiple MARC fields may map to a single Dublin Core field. For example, fields 720 ("Uncontrolled name"), 100 ("Personal name"), 110 ("Corporate name") and 111 ("Meeting name") all map to <i>dc.Creator</i>. Actual MARC records normally define only one of these fields, and anyway Greenstone allows multi-valued metadata.</p>
18
19description6=<p>Some mappings are dependent on subfields. For example, MARC field 260 contains information about publication and distribution. Subfields "c" (Date of Publication) and "g" (Date of manufacture) are mapped to <i>dc.Date</i>, using the following mapping line: <blockquote> 260$c$g -&gt; dc.Date </blockquote> Greenstone also provides a file for mapping MARC to <b>qualified</b> dublin core: <a href="_httpsite_/etc/marc2qdc.txt" target=marc2qdc>greenstone/etc/marc2qdc.txt</a>. This can be used by the MARC plugin by setting the <i>-metadata_mapping_file</i> option to "marc2qdc.txt".</p>
Note: See TracBrowser for help on using the repository browser.