Index: documentation/trunk/tutorials/xml-source/tutorial_en.xml
===================================================================
--- documentation/trunk/tutorials/xml-source/tutorial_en.xml (revision 25997)
+++ documentation/trunk/tutorials/xml-source/tutorial_en.xml (revision 26000)
@@ -1099,5 +1099,5 @@
-Greenstone comes with the PDFPlugin which can handle older versions of PDF, but can't cope by default with newer PDF files. However, a Greenstone extension making use of PDFBox, an open-source PDF conversion tool, is available if you want Greenstone to extract text from more recent PDF files. This tutorial will cover how to install the PDFBox extension for Greenstone and how to switch on its functionality in the Greenstone Librarian Interface.
+By default the PDFPlugin can process PDF versions 1.4 and older. The PDFBox extension for Greenstone allows text from more recent PDF files to be extracted. The extension uses PDFBox, an open-source PDF conversion tool. This tutorial will cover how to install the PDFBox extension for Greenstone and how to switch on its functionality in the Greenstone Librarian Interface to process text from newer versions of PDF.Obtaining and installing the PDFBox extension for Greenstone
@@ -3196,86 +3196,5 @@
-
-
-Downloading over OAI
-
-
-
-
-
-The previous exercise did not obtain the data from an external OAI-PMH server. This missing step is accomplished either by running a command-line program or by using the panel in the Librarian Interface. This exercise shows you how to do this using both methods.
-
-
-Downloading using the Librarian Interface
-
-
-In the Librarian Interface, switch to the panel. Select from the list of download types on the left hand side.
-
-
-In the box, type in the following URL:
-http://rocky.dlib.vt.edu/~jcdlpix/cgi-bin/OAI/jcdlpix.pl
-
-
-We want to download the documents as well as the metadata, so tick the checkbox.
-
-
-If your computer is behind a firewall or proxy server, you will need to edit the proxy settings in the Librarian Interface. Click the button. Switch on the checkbox. Enter the proxy server address and port number in the and boxes. Click .
-
-
-Now click . If you have set proxy information in , a popup will ask for your user name and password. Once the download has started, a progress bar appears in the lower half of the panel that reports on how the downloading process is doing.
-
-
-Downloaded files are stored in a top-level folder called that appears on the left-hand side of the panel. These files can then be added to a collection.
-
-
-Downloading using the command line
-
-
-For command line downloading to work, your computer must have a direct connection to the Internet—being behind a firewall may interfere with the ability to download the information. You will need to use the Librarian Interface for downloading if you are behind a firewall.
-
-
-Close the Librarian Interface.
-We will work with the OAI collection used in exercise . You may have noticed that its internal name is .
-
-
-In a text editor (e.g. WordPad), open the collection's configuration file, which is in Greenstone → collect → oaiservi → etc → collect.cfg. Add the following line (all on one line):
-acquire OAI -src http://rocky.dlib.vt.edu/~jcdlpix/cgi-bin/OAI/jcdlpix.pl -getdoc
-Although the position of this line is not critical, we recommend that you place it near the beginning of the file, after the public and creator lines but before the index line. Save the file and quit the editor.
-
-
-Delete the contents of the collection's import folder. This contains the canned version of the collection files, put there during the previous exercise. Now we want to witness the data arriving anew from the external OAI server.
-
-
-Open a DOS window to access the command-line prompt. This facility should be located somewhere within your menu, but details vary between different Windows systems. If you cannot locate it, select and enter cmd in the popup window that appears.
-
-
-In the DOS window, move to the home directory where you installed Greenstone. This is accomplished by something like:
-cd C:\Program Files\Greenstone
-
-
-Type:
-setup.bat
-to set up the ability to run Greenstone command-line programs.
-
-
-Change directory into the folder containing the OAI Services Provider collection you built in the last exercise.
-cd collect\oaiservi
-
-Even though the collection name used capital letters the directory generated by the Librarian Interface is all lowercase.
-
-
-
-Run:
- perl -S importfrom.pl oaiservi
-
-Greenstone will immediately set to work and generate a stream of diagnostic output. The importfrom.pl program connects to the OAI data provider specified in collection configuration file (it does this for each "acquire" line in the file) and exports all the records on that site.
-
-
-
-The downloaded files are saved in the collection's import folder. Once the command is finished, everything is in place and the collection is ready to be built. Confirm you have successfully acquired the OAI records by rebuilding the collection.
-
-
-
-
+Setting up your Greenstone OAI Server
@@ -3285,11 +3204,11 @@
-Greenstone 2 collections are not enabled for OAI out of the box. To make a collection available to serve up over OAI, some minor adjustments need to be made first.
-This tutorial will look at how to make an existing collection available over OAI and how to get it validated against the Open Archives validator.
+Greenstone 2 collections are not enabled for OAI out of the box. To make a collection available for serving up over OAI, some minor adjustments need to be made first.
+This tutorial will look at how to make an existing collection available over OAI and testing its accessibility by getting it validated against the Open Archives validator.Use a text editor to open the file etc/oai.cfg located in your Greenstone installation folder. The oai.cfg configuration file contains properties that control the behaviour and features of your Greenstone OAI server.
-The basic properties to edit in order to get your collection served by the inbuilt OAI server are the , and . Look up these properties in the file.
-For and , type in some values that make sense for your digital library. For example:
+The basic properties to edit in order to get your collection served by the inbuilt OAI server are the repositoryName, repositoryID and oaicollection. Look up these properties in the file.
+For repositoryName and repositoryID, type in some values that make sense for your digital library. For example:repositoryName "Greenstone"
repositoryID "greenstone"
@@ -3308,40 +3227,83 @@
Press the button and you will end up on your Digital Library home page as usual. Adjust the URL so that instead of the suffix, it says .The page that loads now will contain an error message () saying that you've provided an illegal OAI verb. This is because the OAI specification requires you to provide more instruction in the URL as to what you want. The specification defines verbs and possible arguments to them.
-A basic verb is , which requests the OAI server to return some information about the OAI repository it's serving. Adjust the URL once more by suffixing so that your URL now looks like:
+A basic verb is , which requests the OAI server to return some information about the OAI repository that it's serving. Adjust the URL once more by suffixing , so that your URL now looks like:http://<domain>/greenstone/cgi-bin/oaiserver.cgi?verb=IdentifyVisiting this page now gives some information about your Greenstone OAI repository.
-Although the data transmitted over OAI is in the form of XML, Greenstone uses a stylesheet to transform that XML response into a user-friendly, structured web page you see when you perform the request (thereby visiting the response page). This allows and other verbs in the OAI specification to be shown in the main Greenstone OAI Server pages as link buttons. You can see these in the main Greenstone (or ) page, as a row of links starting with "Identify" at the top and in the lower end of the page.
+Although the data transmitted over OAI is in the form of XML, Greenstone uses a stylesheet to transform that XML response into a user-friendly, structured web page that you see when you perform the request (as happens when you visit the response page). This allows and other verbs in the OAI specification to be shown in the main Greenstone OAI Server pages as link buttons. You can see these verbs represented in the main Greenstone (or ) page as a row of links, starting with "Identify" at the top and in the lower end of the page.Clicking on the links will execute that verb as a request and return the response from your Greenstone OAI server as a structured web page. Try clicking on all the links.OAI defines a concept called a . In Greenstone, the OAI Set concept is mapped to the practical Greenstone collection. The link to the verb will therefore request the Greenstone OAI server to list all the collections that have been enabled for OAI.
-Click on the ListSets button link and have a look.
-The response page for the verb will show you that your backdrop collection is one of the collections available over OAI in your Greenstone repository.
-
-
-You will see a couple of buttons next to each collection (or ) listed here. The first is Identifiers and the second Records. Click on the Identifiers button for the backdrop Set. This will list all the IDs of the documents contained in your OAI collection. If you look at the IDs, they look similar enough to Greenstone's internal document IDs, but with an additional prefix (oai:<repositoryID>:setname, where was set by you in the oai.cfg configuration file).
-
-
-Click the browser Back button to get back to the ListSets page and press the Records button located next to the backdrop collection created in A Simple image collection tutorial.
-As you would have specified some Dublin Core (dc) metadata for some of the images in the backdrop collection, the page that loads will display this information for each document in the collection (Set).
-Greenstone's OAI at present supports 3 metadata formats, as is explained in the comments in the oai.cfg file. Of these three, the OAI standard for Dublin Core, , is the one pertinent to this tutorial. If your collection specifies metadata for a different metadata set format, you can use the oai.cfg file to tell Greenstone how to map the metadata fields of your chosen metadata set format into the Dublin Core metadata set supported by the Greenstone OAI server (or one of the other metadata sets it supports).
-Look in the oai.cfg file again and scroll down to the section on , which will explain and provide examples for how to specify such mappings from your metadata format to one that Greenstone's OAI server uses. For instance, the demo collection comes enabled for OAI upon installation, and specifies some mappings from its metadata format to . Its metadata is mapped using the following line in the oai.cfg configuration file:
+Click on the ListSets link and have a look.
+The response page for the verb will show you that your backdrop collection (created in the Simple image collection tutorial) is one of the collections available over OAI in your Greenstone repository.
+
+
+You will see a couple of buttons next to each collection (or ) listed here. The first is Identifiers and the second Records. Click on the Identifiers button for the backdrop Set. This will list all the IDs of the documents contained in your OAI collection. If you look at the IDs, they look similar enough to Greenstone's internal document IDs, but with an additional prefix (oai:<repositoryID>:<setname>, where repositoryID was set by you in the configuration file, and setname is the name of the collection).
+
+
+Click the browser Back button to get back to the ListSets page and press the Records button located next to the backdrop collection.
+If you specified some Dublin Core (dc) metadata for each images in the backdrop collection, then the page that loads will display this information for each document in the collection (Set).
+Greenstone's OAI at present supports 3 metadata formats, as is explained in the instructive comments in the oai.cfg file. Of these three, the OAI standard for Dublin Core, , is the one pertinent to this tutorial. If your collection specifies metadata for a different metadata set format, you can use the oai.cfg file to tell Greenstone how to map the metadata fields of your chosen metadata set format into the Dublin Core metadata set supported by the Greenstone OAI server (or one of the other metadata sets it supports).
+Look in the oai.cfg file again and scroll down to the section on oaimapping, which will explain and provide examples for how to specify such mappings from your metadata format to one that Greenstone's OAI server uses. For instance, the demo collection comes enabled for OAI upon installation, and specifies some mappings from its metadata format to . Its metadata is mapped to using the following line in the oai.cfg configuration file (note the use of case):oaimapping dls.Title oai_dc.titleBecause the backdrop collection uses DC metadata already, no mapping is required.
+
+Validating the Greenstone OAI server
+
+
+In this section, you'll be testing that you've set up your Greenstone OAI server correctly so that it's accessible over OAI. For this part of the exercise, you need to be on a networked computer and your host computer needs to be visible to the outside world. (That is, when you provide the full name of your computer, someone else in the world should be able to find that computer by typing its URL into their browser's address field.)
+
+
+We'll be using an external OAI client to access our up-and-running Greenstone OAI server. It's not just any OAI client either, but an OAI Server validator.
+
+
+You will want to be running the included Apache web server. So if you're on Windows and using the Local Library Server, quit it and rename the application in your Greenstone installation folder to server.not. Then use the menu shortcut to the Greenstone Server once more, to now launch the Apache web server.
+
+
+For this exercise, we will be visiting the Open Archives Validator, for which your OAIserver needs to provide a valid email address. In a text editor, open up your greenstone installation's etc/oai.cfg file and set the value of the maintainer field to your email address.
+Note that by default, your Greenstone installation will make the demo collection available over OAI. This collection has been set up with a dummy (and invalid) email address for the creator and maintainer fields in the collection's collect.cfg file. You will need to open up collect/demo/etc/collect.cfg and clear the email values for the creator and maintainer properties (or else set these to a valid email again). Otherwise the OpenArchives validator will resort to using the demo collection's default dummy email to send the initial validation results to. Alternatively, you can simply remove the demo collection from being listed in the oai.cfg file's oaicollection property, which will cease to make the demo collection available over OAI.
+Note also that, if you wish to specify contact emails at a collection level, you will need to edit your greenstone installation's collect/<collection-name>/etc/collect.cfg file for those collections and set the creator and maintainer fields to the desired email address.
+
+
+If your collection contains document items for which you have not assigned any (Dublin Core, dc) metadata, the OAI validation can fail because it is dependent on having Metadata Formats listed even on a per record (per document) basis. Therefore, if your document has no dc metadata assigned, Greenstone won't know what OAI-supported metadata format is used by that document in order to list it.
+In practice, this means that you either have to assign one or more dc.* metadata to each document in your OAI collection, or you will have to set up an oaimapping in the oai.cfg file to map existing metadata of whichever format to dc.* metadata.
+For instance, if you created an image collection without assigning any metadata and are happy to use the Title or Source metadata that Greenstone extracted for each image (, ) as the image document's "title", you could map either of these metadata to in the file oai.cfg. To do so, you'd open up oai.cfg in an editor, go down to the section specifying the oaimapping properties and add a new line:
+oaimapping Title oai_dc.title
+(Or: oaimapping SourceFile oai_dc.title).
+This step will not be not necessary for the backdrop collection if you had assigned any dc.* metadata for each image in the collection.
+Note: If the demo collection that comes with a Greenstone installation is not built, it will either need to be built before submitting your OAI server for inspection by the Open Archives validator, or you will need to adjust the oai.cfg file once more by removing the mention of demo from the oaicollection property. This is because the demo collection is mentioned as being set up for OAI in the oai.cfg file. However, if this collection is unbuilt, it will not be accessible to the OAI validator and so your oaiserver may fail tests due to this oversight.
+
+
+If you are working with legacy collections (built before Greenstone version 2.85) you may have to rebuild them if you plan to make them available over OAI and be compliant with the Open Archives validator. Rebuilding old collections will recalculate the value for the repository. This calculation is different from Greenstone 2.85 onwards.
+
+
+Next you will need to set up your Greenstone server to be accessible from outside, so that external OAI clients can access it.
+Go to the File → Settings menu of your Greenstone server interface dialog and check the option and also check the option (or the option) as its address resolution method.
+
+
+Press the button in the Greenstone Server Interface dialog that says (or it may say ). Your Digital Library home page will open up in a browser tab. Adjust this URL to have a suffix of oaiserver.cgi in place of the terminating library.cgi, then copy the resulting URL and visit http://www.openarchives.org/Register/ValidateSite.
+
+
+The Open Archives Validator page will request the URL to your Greenstone OAI server. Paste the URL you have in your copy buffer into the field provided for this, and press the Validate baseURL button to start running the tests. You will be told to check your email to continue the remaining tests and get the validation report.
+If the validator does not recognise the URL, make sure you have given the full domain of your host machine rather than just the host name. Alternatively, visit the page again and check that works. If it doesn't, maybe your machine is not set up to be accessible to outside networks. Check your proxy settings, make sure you've set up port forwarding and that your firewall is not interfering.
+
-
+
-Connecting to an OAI server from GLI
+Downloading over OAI
-
-
+
+
-GLI can serve as an OAI client application: it can connect to a remote OAI server and retrieve metadata, even download documents. In the previous tutorial, we set up the Greenstone's OAI server and set up the backdrop collection to be served over OAI. In this tutorial we will use GLI to connect to that OAI server and download OAI metadata for the A Simple image collection and even download its documents.
-
+GLI can serve as an OAI client application: it can connect to a remote OAI server and retrieve metadata, even download documents. The tutorial did not obtain the data from an external OAI-PMH server. This missing step is accomplished either by running a command-line program or by using the panel in the Librarian Interface. This exercise explains you would do this using both methods. In the previous exercise, we set up the Greenstone server to serve the Simple image collection (backdrop) over OAI. In this tutorial, we will use GLI to connect to that OAI server and download OAI metadata for the Simple image collection and even download its documents. The principle is the same if you wish to connect to other OAI servers.
+
+
+Downloading using the Librarian Interface
+Launch GLI. This should launch the Greenstone server as well, if this is not already running, so that the OAI server is also up and running.
@@ -3351,76 +3313,76 @@
-On the right, set the Source URL field to contain the URL to your Greenstone OAI server. It would be of the form
+On the right, set the field to contain the URL to your Greenstone OAI server. It would be of the formhttp://<hostname:portnumber>/greenstone/cgi-bin/oaiserver.cgi
-Make sure that you can generally access this URL from your browser.
-
-
-If at this stage you press the button (in the central row of buttons), a dialog will pop up with basic details about the OAI server. At the end, it will diplay the names of the sets available at the OAI Server. In our example, would be listed as one of the setNames.
-
-
-Tick the checkbox as well as the checkbox. For the latter, type backdrop for the set name. Then tick , and add jpg to the list of comma separated values for it so that it becomes
+Make sure that you can generally access this URL from your browser.
+
+
+If your computer is behind a firewall or proxy server, you will need to edit the proxy settings in the Librarian Interface. Click the button. Switch on the checkbox. Enter the proxy server address and port number in the and boxes. Click to get back to the section of the panel.
+
+
+If at this stage you were to press the button (in the central row of buttons), a dialog will pop up with basic details about the OAI server. At the end, it will diplay the names of the sets available via that OAI Server. In our example, (the Simple Image collection) would be listed as one of the setNames.
+
+
+Tick the checkbox as well as the checkbox. For the latter, type for the name. Then tick , and add to the list of comma separated values for it so that it becomesjpg,doc,pdf,ppt
-Next, tick and set it to 10. There will be 9 images in the collection, so we don't really need to set the Max records value, but this is a helpful feature that you can use when downloading from an OAI server.
-
-
-Finally, press the button that's located beside the button. GLI will start downloading oai metadata. Moreover, because we have ticked the checkbox, it will also be retrieving actual documents, but not more than 10, because of the limit of 10 that we've placed on the number of records to download.
-
-
-After a while, it will have finished downloading. Change to the panel, and on the left-hand side, open up the folder. This is where Greenstone stores files you downloaded using the panel. In this case, it will contain a folder wherein the oai metadata files and images that you've just downloaded from your own Greenstone OAI server is stored.
-
-
-You can now drag and drop these downloaded files into a new Greenstone collection. Because there are *.oai files among them, GLI will offer to add the . Accept, and go to the section of the panel. There, you will find at the end of your plugin list. Select it and press the button so that it is listed above the . Because appears earlier in the plugin pipeline, it processes the metadata in the oai files, rather than letting the more general process their contents.
-
-
-Move onto the panel and press the build button. During this stage, the will extract the metadata in the oai files and attach them to the associated jpg file. You can see this once the collection has been built, by switching to the panel and clicking on an oai file, as no metadata is set for such files. If you then click on a jpg file and scroll down, there will be metadata names that start with ex.dc. This refers to Greenstone-extracted Dublin Core metadata. and will be set to the values you had assigned the images in the tutorial A Simple Image Collection. Greenstone will have added additional ex.dc metadata in the form of , which is the source URL for this image.
-
-
-If you wish, you can now set up this collection in a manner similar to how the backdrop collection was set up in A Simple Image Collection. Don't forget to copy any specific format statements, then rebuild it and Preview the collection.
-
-
-
-
-
-Connecting to the Greenstone OAI server from the outside world
-
-
-
-
-
-For this exercise, you need to be on a networked computer and your host computer needs to be visible to the outside world.
-(That is, when you provide the full name of your computer, someone else in the world should be able to find that computer by typing its URL into their browser's address field.)
-
-
-For now though, we proceed to using an external OAI client to access our up-and-running Greenstone OAI server. It's not just any OAI client either, but an OAI Server validator.
-
-
-You will want to be running the included Apache web server. So if you're on Windows and using the Local Library Server, quit it and rename the application in your Greenstone installation folder to server.not. Then use the menu shortcut to the Greenstone Server once more, to now launch the Apache web server.
-
-
-For this exercise, we will visit the Open Archives Validator, for which your OAIserver needs to provide a valid email address. In a text editor, open up your greenstone installation's etc/oai.cfg file and set the value of the field to your email address.
-Note that by default, your Greenstone installation will make the demo collection available over OAI. This collection has been set up with a dummy (and invalid) email address for the and fields in the collection's collect.cfg file. You will need to open up collect/demo/etc/collect.cfg and clear the email values for the and properties (or else set these to a valid email again). Otherwise the OpenArchives validator will resort to using the demo collection's default dummy email to send the initial validation results to. Alternatively, you can simply remove the demo collection from being listed in the oai.cfg file's oaicollection property, which will cease to make the demo collection available over OAI.
-Note also that, if you wish to specify contact emails at a collection level, you will need to edit your greenstone installation's collect/<collection-name>/etc/collect.cfg file for those collections and set the and fields to the desired email address.
-
-
-If your collection contains document items for which you have not assigned any (Dublin Core, dc) metadata, the OAI validation can fail because it is dependent on having Metadata Formats listed even on a per record (per document) basis. And if your document has no dc metadata assigned, Greenstone won't know what OAI-supported metadata format is used by that document in order to list it.
-In practice, this means that you either have to assign one or more dc.* metadata to each document in your OAI collection, or you will have to set up an oaimapping in the oai.cfg file to map existing metadata of whichever format to dc.* metadata.
-For instance, if you created an image collection without assigning any metadata and are happy to use the Title or Source metadata that Greenstone extracted for each image (, ) as the image document's "title", you could map either of these metadata to in the file oai.cfg. To do so, you'd open up oai.cfg in an editor, go down to the section specifying the oaimapping properties and add a new line:
-oaimapping Title oai_dc.title
-(Or: oaimapping SourceFile oai_dc.title).
-This step is not necessary for the backdrop collection, since each image in the collection was assigned some dc.* metadata.
-
-
-If you are working with legacy collections (built before Greenstone version 2.85) you may have to rebuild them if you plan to make them available over OAI and compliant with the Open Archives validator. Rebuilding old collections will recalculate the for the repository. This calculation is different from Greenstone 2.85 onwards.
-
-
-Next you will need to set up your Greenstone server to be accessible from outside, so that external OAI clients can access it.
-Go to the File → Settings menu of your Greenstone server interface dialog and check the option and also check the option (or the option) as its address resolution method.
-
-
-Press the button in the Greenstone Server Interface dialog that says (or it may say ). Your Digital Library home page will open up in a browser tab. Adjust this URL to have a suffix of oaiserver.cgi in place of the terminating library.cgi, then copy the resulting URL and visit http://www.openarchives.org/Register/ValidateSite.
-
-
-The Open Archives Validator page will request the URL to your Greenstone OAI server. Paste the URL you have in your copy buffer into the field provided for this, and press the Validate baseURL button to start running the tests. You will be told to check your email to continue the remaining tests and get the validation report.
-If the validator does not recognise the URL, make sure you have given the full domain of your host machine rather than just the host name. Alternatively, visit the page again and check that works. If it doesn't, maybe your machine is not set up to be accessible to outside networks. Check you proxy settings, make sure you've set up port forwarding and that your firewall is not interfering.
+Next, tick and set it to 10. There will be 9 images in the collection, so we don't really need to set the Max records value, but this is a helpful feature that you can use when downloading from an OAI server.
+
+
+Finally, click , located beside the button. If you have set proxy information in , a popup will ask for your user name and password. Once the download has started, a progress bar appears in the lower half of the panel that reports on how the downloading process is doing. GLI will download oai metadata and, because we have ticked the checkbox, it will also be retrieving actual documents, but not more than 10, because of the limit of 10 that we've placed on the number of records to download.
+
+
+After a while, it will have finished downloading. Change to the panel, and on the left-hand side, open up the folder. This is where Greenstone stores files you downloaded using the panel. In this case, it will contain a folder wherein the oai metadata files and images that you've just downloaded from your own Greenstone OAI server is stored. These files can then be added to a collection, as will be covered later in this tutorial.
+
+
+Downloading using the command line
+
+
+For command line downloading to work, your computer must have a direct connection to the Internet—being behind a firewall may interfere with the ability to download the information. You will need to use the Librarian Interface for downloading if you are behind a firewall.
+
+
+Close the Librarian Interface.
+
+
+Open a DOS window to access the command-line prompt. This facility should be located somewhere within your menu, but details vary between different Windows systems. If you cannot locate it, select and enter cmd in the popup window that appears.
+
+
+Before you start, you must set up your Greenstone environment in the terminal. In the DOS window, move to the home directory where you installed Greenstone. This is accomplished by something like:
+cd C:\Program Files\Greenstone
+
+
+Type:
+setup.bat
+to set up the ability to run Greenstone command-line programs. On Linux/Mac, you would run source setup.bash.
+
+
+GLI uses a perl script, downloadfrom.pl, to do the downloading. This can be run on the command line, outside of GLI.
+
+
+The script can download using several different protocols. These are specified using the option. To see the available options for download mode, run perl -S downloadfrom.pl -h. This shows that the current options are: . For OAI downloading, use -mode OAI
+
+
+To see the options for downloading using the OAI mode, you can run perl -S downloadinfo.pl OAIDownload. The options are the same as you can see in the GLI OAI download panel.
+
+
+We'll use the set and max_records OAI Download options to download a maximum of 5 OAI records from the backdrop collection at your Greenstone's OAI server, which was made available over OAI as a in the previous tutorial:
+perl -S downloadfrom.pl -mode OAI -url http://<hostname:portnumber>/greenstone/cgi-bin/oaiserver.cgi -set backdrop -max_records 5
+The records (and optionally documents) will be downloaded into the folder where the downloadfrom.pl script is run from. To change this, use the -cache_dir full-path-to-folder option and set its value to the full path of the destination folder you choose.
+
+You can import the downloaded documents into a new Greenstone collection and build them in the usual manner.
+
+Building the downloaded documents in GLI
+
+
+If you used GLI to download documents over OAI, as seen in the first part of the tutorial, you can find the downloaded items in the folder in the filesystem view on the left side of the panel.
+If you used the command line to download documents, the downloaded files will be stored wherever you ran the script from.
+
+
+Open GLI, locate files you downloaded over OAI and drag and drop these into a new Greenstone collection called . Because there are *.oai files among those downloaded, GLI will offer to add the .
+
+
+Switch to the panel and press the build button. During this stage, the will extract the metadata in the files and attach them to the associated files of the downloaded backdrop collection. You can see this once the collection has been built by switching to the panel and clicking on an oai file, as no metadata is set for such files. However, if you then click on a jpg file and scroll down, there will be metadata names that start with ex.dc. This refers to Greenstone-extracted Dublin Core metadata. and will be set to the values you had assigned the images in the tutorial A Simple Image Collection. Greenstone will have added additional ex.dc metadata in the form of , which is the source URL for this image.
+
+
+If you wish, you can now set up this collection in a manner similar to how the backdrop collection was set up in . Don't forget to copy in any specific format statements, then rebuild and preview the collection.