Ignore:
Timestamp:
2009-04-20T21:59:14+12:00 (15 years ago)
Author:
kjdon
Message:

updated config file collectionmeta

File:
1 edited

Legend:

Unmodified
Added
Removed
  • collections/documented-examples/trunk/wiki-e/etc/collect.cfg

    r19060 r19148  
    1515plugin      WordPlugin
    1616plugin      NulPlugin
     17plugin      MediaWikiPlugin -searchbox_div_exp <div([^>]*)id=(\"|')p-search(\"|')(.|\n)*?</div> -show_toc -delete_toc -remove_title_suffix_exp " - (.*)" -toc_exp <table([^>]*)id=(\"|')toc(\"|')(.|\n)*?</table>\n -nav_div_exp <div([^>]*)id=(\"|')p-navigation(\"|')(.|\n)*?</div> -delete_searchbox -delete_nav
    1718plugin      MetadataXMLPlugin
    18 plugin      MediaWikiPlugin -searchbox_div_exp <div([^>]*)id=(\"|')p-search(\"|')(.|\n)*?</div> -show_toc -delete_toc -remove_title_suffix_exp " - (.*)" -toc_exp <table([^>]*)id=(\"|')toc(\"|')(.|\n)*?</table>\n -nav_div_exp <div([^>]*)id=(\"|')p-navigation(\"|')(.|\n)*?</div> -delete_searchbox -delete_nav
    1919plugin      ArchivesInfPlugin
    2020plugin      DirectoryPlugin
     
    3737format DocumentButtons "Detach|Highlight"
    3838
    39 format SearchTypes "plain,form"
     39format SearchTypes "plain"
    4040
    41 collectionmeta  collectionname [l=en] "wiki"
    42 collectionmeta  .document:text [l=en] "text"
    43 collectionmeta  .document:Title [l=en] "titles"
    44 collectionmeta  .document:Source [l=en] "filenames"
    45 collectionmeta  .text [l=en] "text"
    46 collectionmeta  .Title [l=en] "titles"
    47 collectionmeta  .Source [l=en] "filenames"
     41collectionmeta  .text  "_labeltext_"
     42collectionmeta  .Title  "_labeltitle_"
     43collectionmeta  .Source "_labelSource_"
    4844
    4945# -- English text ----------------
    50 collectionmeta  collectionextra [l=en] "This demonstration collection is made from the Greenstone Wiki website. It shows off the new feature of building a Greenstone collection from a MediaWiki website in Greestone.\n
     46collectionmeta  collectionname [l=en] "MediaWiki collection"
     47collectionmeta  collectionextra [l=en] "
     48This demonstration collection is made from the Greenstone Wiki website.
     49It shows off the new feature of building a Greenstone collection from a
     50MediaWiki website in Greestone.
    5151
    52 <h3>How the collection works</h3>\n
     52<h3>How the collection works</h3>
    5353
    5454<p>
    55 The <a href=\"_httpcollection_/etc/collect.cfg\" target=collect.cfg>collection configuration file</a> contains nine plugins <i>MediaWikiPlug</i>, <i>ImagePlug</i>, <i>ZipPlug</i>, <i>PDFPlug</i>, <i>PPTPlug</i>, <i>WordPlug</i> (along with the standard three <i>GAPlug</i>, <i>ArcPlug</i> and <i>RecPlug</i>). The <i>MediaWikiPlug</i> handles the HTML pages downloaded from a MediaWiki website, while the <i>ImagePlug</i>, <i>ZipPlug</i>, <i>PDFPlug</i>, <i>PPTPlug</i>, <i>WordPlug</i> handles the image, zip, PDF, PPT and Word files associated with the Greenstone Wiki. \n
     55The <a href=\"_httpcollection_/etc/collect.cfg\" target=collect.cfg>collection
     56 configuration file</a> contains the plugins <i>MediaWikiPlugin</i>,
     57<i>ImagePlugin</i>, <i>ZipPlugin</i>, <i>PDFPlugin</i>,
     58<i>PowerPointPlugin</i>, <i>WordPlugin</i> (along with the standard
     59plugins <i>GreenstoneXMLPlugin</i>, <i>ArchivesInfPlugin</i> and
     60<i>DirectoryPlugin</i>). The <i>MediaWikiPlugin</i> handles the HTML pages
     61downloaded from a MediaWiki website, while <i>ImagePlugin</i>,
     62<i>ZipPlugin</i>, <i>PDFPlugin</i>, <i>PowerPointPlugin</i> and
     63<i>WordPlugin</i> handle the image, zip, PDF, PowerPoint and Word
     64files associated with the Greenstone Wiki.
    5665
    5766<p>
    58 To build the collection from a MediaWiki website, first add the new <i>MediaWiki</i> download plug on the <i>Download</i> panel of GLI. This plug works in a similar way with the <i>Web</i> download plug, but is speically for crawling MediaWiki websites.\n
     67To build a collection from a MediaWiki website, first download the wiki
     68files using the <i>MediaWiki</i> option on the <i>Download</i> panel of GLI.
     69This download type works in a similar way to a the <i>Web</i> download, but
     70is specially designed for crawling MediaWiki websites.\n
    5971
     72<p>Once the files are downloaded, copy them into a collection using the <i>Gather</i>, dragging them from the <i>Downloaded Files</i> folder in the Workspace tree on the left-hand side.
     73 
    6074<p>
    61 In the <i>Document Plugins</i> section of the <i>Design</i> panel, add <i>MediaWikiPlug</i>. <i>MediaWikiPlug</i> has 5 options, <i>show_toc</i>, <i>toc_exp</i>, <i>delete_toc</i>, <i>delete_nav</i>, <i>nav_exp</i>. Switch on <i>show_toc</i> will copy the table of contents on the website\'s main page to the collection\'s about page. This needs to specify a regex expression for the table of contents in the <i>toc_exp</i> option. Select the <i>delete_toc</i> option will surpress the table of content on each Wiki page. <i>delete_nav</i> and <i>nav_exp</i> options are for supressing the navigation toolbar, which is normally on the left side of a wiki page, and the <i>nav_exp</i> option is an regular expression of the position of the navigation toolbar. \n
     75In the <i>Document Plugins</i> section of the <i>Design</i> panel,
     76add <i>MediaWikiPlugin</i>. <i>MediaWikiPlugin</i> has several specific
     77options which control aspects of page presentation, such as whether or not
     78the table of contents, navigation toolbars and search box are shown on each
     79page. Configure these options based on how you want the pages to appear. You can see the options used by this collection in its
     80<a href=\"_httpcollection_/etc/collect.cfg\" target=collect.cfg>collection
     81 configuration file</a>.
    6282"
    63 collectionmeta  iconcollection [l=en] "_httpprefix_/collect/wiki/images/en/wiki.gif"
    64 collectionmeta  iconcollectionsmall [l=en] "_httpprefix_/collect/wiki/images/en/wiki.gif"
Note: See TracChangeset for help on using the changeset viewer.