source: gsdl/trunk/runtime-src/src/oaiservr/README.txt@ 19294

Last change on this file since 19294 was 14280, checked in by xiao, 17 years ago

add comments about how to specify the baseURL in oai.cfg

  • Property svn:keywords set to Author Date Id Revision
File size: 6.9 KB
Line 
1OAI Server Configuration Instructions
2-------------------------------------
3
4Greenstone needs some configuration before successfully supporting OAI serving. This consists of three steps:
5
61) Server-wide configuration to support vital OAI commands such as Identify.
7 This allows the OAI client to obtain information on the maintainer of
8 the server, etc. See the OAI documentation for further information on
9 the Identify verb. These configurations are kept in gsdl/etc/oai.cfg.
10
112) Configuring collections to be 'active' OAI collections; just as collections
12 in Greenstone can be 'private' or 'public', collections can similarly be
13 'visible' or 'invisible' to OAI. By default, collections are NOT served
14 via OAI - you have to enable them by adding each collection's name to a
15 list in gsdl/etc/oai.cfg.
16
173) Mapping your collection metadata to a "Namespace" supported by OAI. At
18 present, Greenstone supports the Dublin Core metadata format alone in OAI,
19 and metadata fields in a Greenstone collection must be mapped into the
20 Dublin Core namespace. This configuration can be done either server-wide,
21 for all collections, in gsdl/etc/oai.cfg or in the etc/collect.cfg of
22 each individual collection.
23
24This documentation explains how to set up Greenstone's configuration for steps 1-3 above, install, deplay and test OAI on your Greenstone system.
25
26Part 1: Server-Wide Configuration Information
27---------------------------------------------
28
29Your OAI server should provide some basic information on the collections it holds as a whole. For this purpose, all collections on your server are treated the same: you cannot have different information for each collection. In this section, we describe what information you should provide, and how to configure Greenstone correctly to provide it.
30
31The basic server-wide information is:
32 * The OAI repository (server) name - this is a human readable form such
33 as "The Research Authority's OAI Server".
34 * The central administrative email address for the OAI repository.
35 * The base URL for the repository.
36 * Which version of OAI the repository supports/delivers.
37 * A free text item of information about the OAI repository - e.g. its
38 purpose, size, origins, etc.
39
40These items are kept in the following items of gsdl/etc/oai.cfg. The syntax rules for these configuration items are the same as for gsdl/etc/main.cfg - e.g. strings containing spaces should be enclosed in quotation marks. See the main Greenstone configuration documentation for further information.
41
42 * repositoryName
43 One item only, free text.
44 * maintainer
45 One item only, as a valid email address
46 * baseURL
47 One item only, see the example in the oai.cfg file.
48 * httpdomain
49 NB: in the gsdlsite.cfg file, NOT oai.cfg
50 Should not require specific configuration if your Greenstone install
51 is already operational through your web server.
52 * oaiversion
53 One item only, which should be either "1.1 or "2.0"
54 * oaiinfo
55 Numerous items, which are usually in a label and value pair.
56 An example item may be:
57 oaiinfo sponsor "The Big Cheese Corporation"
58
59Sample items are already provided in the default oai.cfg file - please ensure that you complete these in your preferred text editor.
60
61
62Part 2: Enabling Collections
63----------------------------
64
65In addition to the items above, one further sort of item needs to be added to gsdl/etc/oai.cfg - a list of the collections that should be made available to OAI clients. This list is supplied in one or more lines in gsdl/etc/oai.cfg in the form:
66
67 oaicollection OAIDemo HDL
68
69The first item in each line should be the same - "oaicollection" - followed by one or more collection names. Each collection name should be the name of the folder/directory in the gsdl/collect directory which holds the collection.
70
71Once these items have been added to gsdl/etc/oai.cfg, the final stage of the OAI configuration can begin: mapping from a collection's own metdata fields into the Dublin Core metadata fields used by OAI. This will be detailed in the folloowing section of this document.
72
73
74Part 3: Mapping fields
75----------------------
76
77In Greenstone, metadata fields are created by the collection designer. These may follow an accepted standard - e.g. Dublin Core - be a modification or extension of an accepted standard, or a bespoke format specific to the needs of the particular collection. As Greenstone installer, you should have decided upon one of these three approaches already. You may have chosen one of the standard metadata structures incorporated into the Greenstone Librarian Interface (GLI), or created your own using your own metadata files and automatically extracted information from your document files. Whatever the approach you have used for obtaining the metadata in your collection, or whatever the fields are that you have used in your collection, the approach to linking these fields (mapping them) to the OAI Dublin Core format is the same.
78
79The mapping of fields is done in two ways: first, globally for all collections, and secondly specifically for one collection. The syntax for the configuration items to achieve these different mappings is the same. The file in which they are placed determines whether they apply to all collections, are just one. Global mappings are found with the other OAI configuration items in gsdl/etc/main.cfg whilst collection-specific mappings are placed in the etc/collect.cfg file of the collection itself.
80
81Mapping items in the configuration file take the following format:
82
83 oaimapping <greenstone fieldname> <dublin core fieldname>
84
85The syntax of each of the two fieldnames given above requires a little more detail; we will take the Dubline Core fieldname first.
86
87The Dublin Core fieldsname should be given in the form "DC:<fieldname>", as OAI can support other metadata formats, and thus we use "DC:" to be clear that we are mapping to Dublin Core. The fieldnames available for Dublin Core can be found on the central Dublin Core website at http://www.dublincore.org/.
88
89The Greenstone fieldname can be in two formats. The simpler one is simply the name of the field to map from. The more complex one has the name of the collection placed in front of the field name, followed by a colon. e.g. "demo:Title" would mean the metadata field "Title" in the collection called "demo". Using this, all mappings can be stored in the central gsdl/etc/oai.cfg file if you wish, and we recommend this approach.
90
91Let us recap at this point with a few example items for a gsdl/etc/oai.cfg:
92
93oaimapping Author dc.Creator
94oaimapping dls.Title dc.Title
95oaimapping Photos:fileformat dc.Format
96
97The first item maps any occurrence of 'Author' in any collection to the Dublin Core 'Creator' field (there is no 'Author' field in DC). The second item maps 'dls.Title' to 'dc.Title'. Finally, in the specific collection 'Photos', we will map the unique metadata field 'fileformat' to the Dublin Core 'Format' field.
98
99
100Installation and Testing
101------------------------
Note: See TracBrowser for help on using the repository browser.