Changeset 19690 for documentation/trunk

Show
Ignore:
Timestamp:
31.05.2009 10:50:42 (10 years ago)
Author:
kjdon
Message:

a few changes to dspace_to_geenstone, advanced_scanned_image, multimedia_collection_build tutorials. got rid of text for diff gs versions - now just applies to 2.82. visual changes only, ie untested :-)

Files:
1 modified

Legend:

Unmodified
Added
Removed
  • documentation/trunk/tutorials/xml-source/tutorial_en.xml

    r19680 r19690  
    24432443<Text id="0618a">We now clean up the <AutoText key="coredm::_Global:labelTitle_"/> browser.</Text> 
    24442444</Comment> 
    2445 <Comment> 
    2446 <Text id="0621">We are going to use the <AutoText text="removesuffix"/> option of a classifier, and to do this we must put the Librarian Interface into a different mode in versions before 2.82. Since Greenstone 2.82, the interface supports three levels of user: <AutoText key="glidict::Preferences.Mode.Assistant"/>, who can add documents and metadata to collections, and create new ones whose structure mirrors that of existing collections; <AutoText key="glidict::Preferences.Mode.Librarian"/>, who can, in addition, design new collections, but cannot use specialist IT features (e.g. regular expressions); and <AutoText key="glidict::Preferences.Mode.Expert"/>, who can perform all functions. For Greenstone versions before 2.82, there is another user type between <AutoText key="glidict::Preferences.Mode.Librarian"/> and <AutoText key="glidict::Preferences.Mode.Expert"/>&mdash;<AutoText text="Library Systems Specialist"/>, who can use all design features, but cannot perform troubleshooting tasks (e.g. interpreting debugging output from Perl programs).</Text> 
    2447 </Comment> 
    2448 <Comment> 
    2449 <Text id="0622a">For Greenstone 2.82, skip the next step.</Text> 
    2450 <Text id="0622">Now we switch to <AutoText text="Library Systems Specialist"/> mode.</Text> 
    2451 </Comment> 
    2452 <NumberedItem> 
    2453 <Text id="0624">To switch modes, click <Menu><AutoText key="glidict::Menu.File"/> &rarr; <AutoText key="glidict::Menu.File_Options"/> &rarr; <AutoText key="glidict::Preferences.Mode"/></Menu> and change to <AutoText text="Library Systems Specialist"/>. Note from the description that appears that you need to be able to formulate regular expressions to use this mode fully. That is what we do below.</Text> 
    2454 </NumberedItem> 
    2455 <NumberedItem> 
    2456 <Text id="0635">Next we return to our <AutoText key="coredm::_Global:labelTitle_"/> browser and clean it up. The aim is to amalgamate variants of titles by stripping away extraneous text. For example, we would like to treat <AutoText text="ANTHOLOGY 1" type="quoted"/>, <AutoText text="ANTHOLOGY 2" type="quoted"/> and <AutoText text="ANTHOLOGY 3" type="quoted"/> the same for grouping purposes. To achieve this:</Text> 
     2445<NumberedItem> 
     2446<Text id="0621">We are going to use the <AutoText text="removesuffix"/> classifier option. The aim is to amalgamate variants of titles by stripping away extraneous text. For example, we would like to treat <AutoText text="ANTHOLOGY 1" type="quoted"/>, <AutoText text="ANTHOLOGY 2" type="quoted"/> and <AutoText text="ANTHOLOGY 3" type="quoted"/> the same for grouping purposes. To achieve this:</Text> 
    24572447<BulletList> 
    24582448<Bullet> 
     
    24662456<Text id="0638"><b>Build</b> the collection and <b>preview</b> the result. Observe how many more times similar titles have been amalgamated under the same bookshelf. Test your understanding of regular expressions by trying to rationalize the amalgamations. (Note: <AutoText text="[[:punct:]]" type="italics"/> stands for any punctuation character.) The icons beside the Word and PDF documents are not the correct ones, but that will be fixed in the next format statement.</Text> 
    24672457</NumberedItem> 
    2468 <Comment> 
    2469 <Text id="0639">The previous exercise was done in <AutoText text="Library Systems Specialist"/> mode (for versions before 2.82) because it requires the use of regular expressions, and now the <AutoText text="Library Systems Specialist"/> mode doesn't exist anymore and regular expressions can be edited in <AutoText key="glidict::Preferences.Mode.Librarian"/> mode.</Text> 
    2470 </Comment> 
    24712458<Comment> 
    24722459<Text id="0640">One powerful use of regular expressions in the exercise was to clean up the <AutoText key="coredm::_Global:labelTitle_"/> browser. Perhaps the best way of doing this would be to have proper title metadata. The metadata extracted from HTML files is messy and inconsistent, and this was reflected in the original <AutoText key="coredm::_Global:labelTitle_" type="italics"/> browser. Defining proper title metadata would be simple but rather laborious. Instead, we have opted to use regular expressions in the <AutoText text="AZCompactList"/> classifier to clean up the title metadata. This is difficult to understand, and a bit fiddly to do, but if you can cope with its idiosyncrasies it provides a quick way to clean up the extracted metadata and avoid having to enter a large amount of metadata.</Text> 
     
    28392826</NumberedItem> 
    28402827<NumberedItem> 
    2841 <Text id="sc25b">For versions before 2.82, you need to change the mode in the Librarian Interface to <AutoText text="Library Systems Specialist"/> (or <AutoText key="glidict::Preferences.Mode.Expert"/>) mode (using <Menu><AutoText key="glidict::Menu.File"/> &rarr; <AutoText key="glidict::Menu.File_Options"/> &rarr; <AutoText key="glidict::Preferences.Mode"/></Menu>), because you will need to change the order of plugins, and use regular expressions in the plugin options.</Text> 
    2842 <Comment> 
    2843 <Text id="ep-23c-1">For version 2.71, you'll need to close GLI now then restart it to get the list of plugins to update properly.</Text> 
    2844 </Comment> 
    2845 </NumberedItem> 
    2846 <NumberedItem> 
    28472828<Text id="sc25c">Go to the <AutoText key="glidict::CDM.GUI.Plugins"/> section of the <AutoText key="glidict::GUI.Design"/> panel, and add a new <AutoText text="PagedImagePlugin"/> plugin. Enable the <AutoText text="create_screenview"/> option, set the <AutoText text="documenttype"/> option to <AutoText text="hierarchy"/> and set the <AutoText text="process_exp"/> option to <AutoText text="xml.*\.item$"/>.</Text> 
    28482829</NumberedItem> 
     
    31203101<Content> 
    31213102<NumberedItem> 
    3122 <Text id="0761">First, change to <AutoText text="Library Systems Specialist"/> (or <AutoText key="glidict::Preferences.Mode.Expert"/>) mode (using <Menu><AutoText key="glidict::Menu.File"/> &rarr; <AutoText key="glidict::Menu.File_Options"/></Menu>), because you will need to change the order of plug-ins in the <AutoText key="glidict::GUI.Design"/> panel.</Text> 
    3123 </NumberedItem> 
    3124 <NumberedItem> 
    3125 <Text id="0763">Start a <b>new collection</b> called <b>StoneD</b> and fill out its fields appropriately. Leave the metadata set at Dublin Core, the default.</Text> 
    3126 </NumberedItem> 
    3127 <NumberedItem> 
    3128 <Text id="0765">Now add <AutoText text="DSpacePlugin" />. Leave the plugin options at their defaults and press <AutoText key="glidict::General.OK" type="button"/>.</Text> 
     3103<Text id="0763">Start a <b>new collection</b> called <b>StoneD</b> and fill out its fields appropriately. </Text> 
     3104</NumberedItem> 
     3105<NumberedItem> 
     3106<Text id="0765">In the <AutoText key="glidict::GUI.Design"/> panel add <AutoText text="DSpacePlugin" />. Leave the plugin options at their defaults and press <AutoText key="glidict::General.OK" type="button"/>.</Text> 
    31293107</NumberedItem> 
    31303108<NumberedItem> 
     
    31473125</Comment> 
    31483126<NumberedItem> 
    3149 <Text id="0773">Select <AutoText text="DSpacePlugin" /> and click <AutoText key="glidict::CDM.PlugInManager.Configure" type="button"/>. Switch on its configuration option <AutoText text="first_inorder_ext"/>. Set its value to <AutoText text="pdf,doc,rtf" type="quoted"/> in the popup window that appears and press <AutoText key="glidict::General.OK" type="button"/>.</Text> 
     3127<Text id="0773">Select <AutoText text="DSpacePlugin" /> and click <AutoText key="glidict::CDM.PlugInManager.Configure" type="button"/>. Switch on its configuration option <AutoText text="first_inorder_ext"/>. Set its value to <AutoText text="pdf,doc,rtf" type="quoted"/>.</Text> 
    31503128</NumberedItem> 
    31513129<NumberedItem> 
     
    31603138<Text id="0776">The DSpace exported files contain Dublin Core metadata for title and author (amongst other things).</Text> 
    31613139<NumberedItem> 
    3162 <Text id="0778">In the <AutoText key="glidict::GUI.Design"/> panel, select <AutoText key="glidict::CDM.GUI.Indexes"/>. Delete the <AutoText key="metadata::ex.Title"/> and <AutoText key="metadata::ex.Source"/> indexes, and add one for <AutoText key="metadata::dc.Title"/> and another for <AutoText key="metadata::dc.Contributor"/>. Rename the <AutoText key="metadata::dc.Contributor"/> index by going to the <AutoText key="glidict::CDM.GUI.SearchMetadata"/> section in the <AutoText key="glidict::GUI.Format"/> panel, select this index and change its value to "authors". Greenstone 2.82 automatically detects index names, so <AutoText key="metadata::dc.Title"/> index will be named as <AutoText key="coredm::_Global:labelTitle_"/></Text> 
    3163 </NumberedItem> 
    3164 <NumberedItem> 
    3165 <Text id="0779">Go back to the <AutoText key="glidict::GUI.Design"/> panel, select <AutoText key="glidict::CDM.GUI.Classifiers"/> and <b>delete</b> both <AutoText text="AZList" /> classifiers (<AutoText key="metadata::ex.Title"/> and <AutoText key="metadata::ex.Source"/>). Add an <AutoText text="AZList" /> classifier for <AutoText key="metadata::dc.Title"/> and an <AutoText text="AZCompactList" /> classifier for <AutoText key="metadata::dc.Contributor"/>.</Text> 
     3140<Text id="0778">In the <AutoText key="glidict::GUI.Design"/> panel, select <AutoText key="glidict::CDM.GUI.Indexes"/>. Delete the <AutoText key="metadata::ex.Source"/> index, and add one for <AutoText key="metadata::dc.Contributor"/>. Rename the <AutoText key="metadata::dc.Contributor"/> index by going to the <AutoText key="glidict::CDM.GUI.SearchMetadata"/> section in the <AutoText key="glidict::GUI.Format"/> panel, select this index and change its value to "_labelAuthor_". Using a macro for an index name means that it will display in the correct language (assuming that the macro has been translated). You can check <Path>Greenstone &rarr; macros &rarr; english.dm</Path> to see which macros are available.</Text> 
     3141</NumberedItem> 
     3142<NumberedItem> 
     3143<Text id="0779">Go back to the <AutoText key="glidict::GUI.Design"/> panel, select <AutoText key="glidict::CDM.GUI.Classifiers"/>. Select the <AutoText key="metadata::ex.Source"/> <AutoText text="List" /> classifier and click <AutoText key="glidict::CDM.ClassifierManager.Configure" type="button"/>. Change the <AutoText text="metadata"/> option to <AutoText key="metadata::dc.Contributor"/>. Activate the <AutoText text="bookshelf_type"/> option and set its value to <AutoText text="always"/>.</Text> 
    31663144</NumberedItem> 
    31673145<NumberedItem> 
     
    31723150</NumberedItem> 
    31733151<NumberedItem> 
    3174 <Text id="0782">Also, let's add a format statement for the classifier based on <AutoText key="metadata::dc.Contributor"/> metadata. In the <AutoText key="glidict::CDM.FormatManager.Feature"/> menu (under <AutoText key="glidict::CDM.GUI.Formats"/> on the  <AutoText key="glidict::GUI.Format"/> panel), select the item that says:</Text> 
     3152<Text id="0782">Also, let's add a format statement for the classifier based on <AutoText key="metadata::dc.Contributor"/> metadata. In the <AutoText key="glidict::CDM.FormatManager.Feature"/> menu (under <AutoText key="glidict::CDM.GUI.Formats"/> on the  <AutoText key="glidict::GUI.Format"/> panel), select the item that starts with:</Text> 
    31753153<Indent> 
    3176 CL2: AZCompactList -metadata <AutoText key="metadata::dc.Contributor" type="plain"/> 
     3154CL2: List -metadata <AutoText key="metadata::dc.Contributor" type="plain"/> 
    31773155</Indent>  
    31783156</NumberedItem>