source: trunk/gsdl/docs/makecol.txt@ 917

Last change on this file since 917 was 917, checked in by sjboddie, 24 years ago

added docs directory - Cris makecol.txt doc and TODO list

  • Property svn:keywords set to Author Date Id Revision
File size: 5.5 KB
Line 
1How to Make a Collection - A Quick Introduction
2
3 Cristian Francu
4 [email protected]
5 Jan 12, 2000
6
7First, go to the directory where you installed GSDL. In order to make
8sure that you can run certain perl scripts you should run either
9setup.bash or setup.csh, depending on the shell you're using:
10
11source setup.bash or
12
13source setup.csh
14
15This scripts set variables GSDLHOME, GSDLOS and PATH. Of course you
16can include them in .cshrc or .profile in order to have them set
17automatically.
18
19Next, you should run mkcol.pl in order to create the collection. This
20perl script creates the necessary environment for the collection, like
21directories and the file collect.cfg. The script mkcol.pl is located
22in the directory
23
24bin/script
25
26This directory contains all the scripts that you'll need, so it's a
27good idea to peek at it.
28
29If you run mkcol.pl it will tell you how to use it:
30
31$ mkcol.pl
32
33 usage: mkcol.pl [options] collection-name
34
35 options:
36 -creator email Your email address
37 -maintainer email The current maintainer's email address
38 -public true|false If this collection has anonymous access
39 -beta true|false If this collection is still under development
40
41After running mkcol.pl the collection will reside in
42collect/<collection-name>. The next thing you should do is edit the
43file
44
45collect/<collection-name>/etc/collect.cfg
46
47You should do at least two things: one is to add a line like this:
48
49collectionmeta iconcollection "http://sequence.rutgers.edu/~gsdl/collect/cstr/images/cstr.jpg"
50
51This line will set the icon of the collection (the image that users
52will click to access the collection once it's on-line). Make sure you
53type a proper URL of the image between quotes. You should do this at
54this moment, because if you want to change the icon you have to
55rebuild the collection, which is a time consuming operation. Hey,
56gurus, is there any simpler way to change the icon of the collection
57once the collection is already built?
58
59Now, the second thing you should do in the collect.cfg file is add the
60proper plugin on the lines:
61
62plugin GMLPlug
63plugin TEXTPlug
64plugin ArcPlug
65plugin RecPlug
66
67The plugins you need depend on the format of your documents. If the
68documents are plain text, or GSDL's own format named GML you don't
69need to change anything. If your documents are in other formats you
70should look for a proper plugin in the directory
71
72perllib/plugins
73
74A very useful plugin is HTMLPlug which can process files with .html and
75.htm file extensions. You would normally replace the TEXTPlug plugin with
76the one you want to use. Say your collection is in html format, than you
77would change the plugin lines to:
78
79plugin GMLPlug
80plugin HTMLPlug
81plugin ArcPlug
82plugin RecPlug
83
84You're finally done with collect.cfg. Suppose you are creating a
85collection named "tutorial". The next thing you should do is go to the
86directory collect/tutorial and create two directories, import and
87archives:
88
89cd collect/tutorial
90mkdir import
91mkdir archives
92
93The material to be indexed should reside in 'import' directory. You
94can either copy it there, or create links to its directory. The
95material to be indexed can contain directories and subdirectories. The
96building script will go recursively into them and search for files to
97be indexed. This is what the plugin RecPlug does.
98
99So, the next thing to do is make sure you have the documents to be
100indexed in the import directory. You are now ready to run the
101processing scripts. The fastest way to build a collection is in two
102steps:
103
1041. process the documents in 'import' directory and generate their
105equivalent in .gml format in 'archives' directory
106
1072. process the documents in 'archives' directory (now in .gml format)
108and create the necessary indexes in 'building' directory
109
110For the first step just run the script import.pl:
111
112import.pl tutorial
113
114Depending on the size of your documents this might take between
115minutes and hours. You might also want to redirect stdout and stderr
116to capture the possible errors to files. You can also change the
117verbosity of the script, just run it without arguments and you'll get
118a complete list of options.
119
120For the second step run the script buildcol.pl:
121
122buildcol.pl tutorial
123
124Again, depending on the size of your material to be processed this may
125take minutes to hours. Keep in mind that you must have enough space on
126your hard drive for both steps, as the .gml documents eat up about the
127same amount as the original documents.
128
129If everything went fine, you should now have a directory named
130'building' under collect/tutorial. That directory contains the results
131of the processing of your documents. In order to use it you have to
132move the content of 'building' directory to a new directory named
133'index'. First create it:
134
135cd collect/tutorial
136mkdir index
137
138Then move the content:
139
140mv building/* index
141
142As long as your collect.cfg file contains the line
143
144public true
145
146and the collection built successfully the gsdl software should
147automatically notice your new collection. The collection should now appear
148on the main page, which can be accessed at:
149
150http://hostname.domain.edu/cgi-bin/library?a=p&p=home
151
152(replace hostname.domain.edu with the name of your server.)
153
154Keep in mind these instructions are just a jump start
155to get you quickly on the run. There are more options
156you can use and you can explore more of GSDL by reading
157the documentation carefully. You can also email the
158creators for further details.
Note: See TracBrowser for help on using the repository browser.