Show
Ignore:
Timestamp:
20.04.2009 21:59:14 (11 years ago)
Author:
kjdon
Message:

updated config file collectionmeta

Files:
1 modified

Legend:

Unmodified
Added
Removed
  • collections/documented-examples/trunk/wiki-e/etc/collect.cfg

    r19060 r19148  
    1515plugin      WordPlugin 
    1616plugin      NulPlugin 
     17plugin      MediaWikiPlugin -searchbox_div_exp <div([^>]*)id=(\"|')p-search(\"|')(.|\n)*?</div> -show_toc -delete_toc -remove_title_suffix_exp " - (.*)" -toc_exp <table([^>]*)id=(\"|')toc(\"|')(.|\n)*?</table>\n -nav_div_exp <div([^>]*)id=(\"|')p-navigation(\"|')(.|\n)*?</div> -delete_searchbox -delete_nav 
    1718plugin      MetadataXMLPlugin 
    18 plugin      MediaWikiPlugin -searchbox_div_exp <div([^>]*)id=(\"|')p-search(\"|')(.|\n)*?</div> -show_toc -delete_toc -remove_title_suffix_exp " - (.*)" -toc_exp <table([^>]*)id=(\"|')toc(\"|')(.|\n)*?</table>\n -nav_div_exp <div([^>]*)id=(\"|')p-navigation(\"|')(.|\n)*?</div> -delete_searchbox -delete_nav 
    1919plugin      ArchivesInfPlugin 
    2020plugin      DirectoryPlugin 
     
    3737format DocumentButtons "Detach|Highlight" 
    3838 
    39 format SearchTypes "plain,form" 
     39format SearchTypes "plain" 
    4040 
    41 collectionmeta  collectionname [l=en] "wiki" 
    42 collectionmeta  .document:text [l=en] "text" 
    43 collectionmeta  .document:Title [l=en] "titles" 
    44 collectionmeta  .document:Source [l=en] "filenames" 
    45 collectionmeta  .text [l=en] "text" 
    46 collectionmeta  .Title [l=en] "titles" 
    47 collectionmeta  .Source [l=en] "filenames" 
     41collectionmeta  .text  "_labeltext_" 
     42collectionmeta  .Title  "_labeltitle_" 
     43collectionmeta  .Source "_labelSource_" 
    4844 
    4945# -- English text ---------------- 
    50 collectionmeta  collectionextra [l=en] "This demonstration collection is made from the Greenstone Wiki website. It shows off the new feature of building a Greenstone collection from a MediaWiki website in Greestone.\n 
     46collectionmeta  collectionname [l=en] "MediaWiki collection" 
     47collectionmeta  collectionextra [l=en] " 
     48This demonstration collection is made from the Greenstone Wiki website.  
     49It shows off the new feature of building a Greenstone collection from a  
     50MediaWiki website in Greestone. 
    5151 
    52 <h3>How the collection works</h3>\n 
     52<h3>How the collection works</h3> 
    5353 
    5454<p> 
    55 The <a href=\"_httpcollection_/etc/collect.cfg\" target=collect.cfg>collection configuration file</a> contains nine plugins <i>MediaWikiPlug</i>, <i>ImagePlug</i>, <i>ZipPlug</i>, <i>PDFPlug</i>, <i>PPTPlug</i>, <i>WordPlug</i> (along with the standard three <i>GAPlug</i>, <i>ArcPlug</i> and <i>RecPlug</i>). The <i>MediaWikiPlug</i> handles the HTML pages downloaded from a MediaWiki website, while the <i>ImagePlug</i>, <i>ZipPlug</i>, <i>PDFPlug</i>, <i>PPTPlug</i>, <i>WordPlug</i> handles the image, zip, PDF, PPT and Word files associated with the Greenstone Wiki. \n 
     55The <a href=\"_httpcollection_/etc/collect.cfg\" target=collect.cfg>collection 
     56 configuration file</a> contains the plugins <i>MediaWikiPlugin</i>,  
     57<i>ImagePlugin</i>, <i>ZipPlugin</i>, <i>PDFPlugin</i>,  
     58<i>PowerPointPlugin</i>, <i>WordPlugin</i> (along with the standard  
     59plugins <i>GreenstoneXMLPlugin</i>, <i>ArchivesInfPlugin</i> and  
     60<i>DirectoryPlugin</i>). The <i>MediaWikiPlugin</i> handles the HTML pages  
     61downloaded from a MediaWiki website, while <i>ImagePlugin</i>,  
     62<i>ZipPlugin</i>, <i>PDFPlugin</i>, <i>PowerPointPlugin</i> and  
     63<i>WordPlugin</i> handle the image, zip, PDF, PowerPoint and Word  
     64files associated with the Greenstone Wiki.  
    5665 
    5766<p> 
    58 To build the collection from a MediaWiki website, first add the new <i>MediaWiki</i> download plug on the <i>Download</i> panel of GLI. This plug works in a similar way with the <i>Web</i> download plug, but is speically for crawling MediaWiki websites.\n 
     67To build a collection from a MediaWiki website, first download the wiki  
     68files using the <i>MediaWiki</i> option on the <i>Download</i> panel of GLI.  
     69This download type works in a similar way to a the <i>Web</i> download, but  
     70is specially designed for crawling MediaWiki websites.\n 
    5971 
     72<p>Once the files are downloaded, copy them into a collection using the <i>Gather</i>, dragging them from the <i>Downloaded Files</i> folder in the Workspace tree on the left-hand side. 
     73  
    6074<p> 
    61 In the <i>Document Plugins</i> section of the <i>Design</i> panel, add <i>MediaWikiPlug</i>. <i>MediaWikiPlug</i> has 5 options, <i>show_toc</i>, <i>toc_exp</i>, <i>delete_toc</i>, <i>delete_nav</i>, <i>nav_exp</i>. Switch on <i>show_toc</i> will copy the table of contents on the website\'s main page to the collection\'s about page. This needs to specify a regex expression for the table of contents in the <i>toc_exp</i> option. Select the <i>delete_toc</i> option will surpress the table of content on each Wiki page. <i>delete_nav</i> and <i>nav_exp</i> options are for supressing the navigation toolbar, which is normally on the left side of a wiki page, and the <i>nav_exp</i> option is an regular expression of the position of the navigation toolbar. \n 
     75In the <i>Document Plugins</i> section of the <i>Design</i> panel,  
     76add <i>MediaWikiPlugin</i>. <i>MediaWikiPlugin</i> has several specific  
     77options which control aspects of page presentation, such as whether or not  
     78the table of contents, navigation toolbars and search box are shown on each  
     79page. Configure these options based on how you want the pages to appear. You can see the options used by this collection in its  
     80<a href=\"_httpcollection_/etc/collect.cfg\" target=collect.cfg>collection 
     81 configuration file</a>. 
    6282" 
    63 collectionmeta  iconcollection [l=en] "_httpprefix_/collect/wiki/images/en/wiki.gif" 
    64 collectionmeta  iconcollectionsmall [l=en] "_httpprefix_/collect/wiki/images/en/wiki.gif"