Changeset 17748

Show
Ignore:
Timestamp:
06.11.2008 14:49:17 (11 years ago)
Author:
kjdon
Message:

plugin and plugout name changes

Files:
1 modified

Legend:

Unmodified
Added
Removed
  • gsdl/trunk/perllib/strings.properties

    r17587 r17748  
    8585buildcol.debug:Print output to STDOUT. 
    8686 
    87 buildcol.desc:PERL script used to build a greenstone collection from GA documents. 
     87buildcol.desc:PERL script used to build a greenstone collection from archive documents. 
    8888 
    8989buildcol.faillog:Fail log filename. This log receives the filenames of any files which fail to be processed. 
     
    236236import.complete:Import complete 
    237237 
    238 import.debug:Print imported text to STDOUT (for GA importing) 
    239  
    240 import.desc:PERL script used to import files into a GA format ready for building. 
     238import.debug:Print imported text to STDOUT (for GreenstoneXML importing) 
     239 
     240import.desc:PERL script used to import files into a format (GreenstoneXML or GreenstoneMETS) ready for building. 
    241241 
    242242import.faillog:Fail log filename. This log receives the filenames of any files which fail to be processed. 
     
    269269import.OIDmetadata:Specifies the metadata element that hold's the document's unique identifier, for use with -OIDtype=assigned. 
    270270 
    271 import.saveas:This is to decide the archives format to be generated. The default setting is to GA. 
    272  
    273 import.saveas.GA:Will generate Greenstone Archive format. 
    274  
    275 import.saveas.METS:Will generate METS format. 
     271import.saveas:Format that the archive files should be saved as.  
    276272 
    277273import.out:Filename or handle to print output status to. 
     
    338334export.complete:Export complete 
    339335 
    340 export.debug:Print exported text to STDOUT (for GA exporting) 
     336export.debug:Print exported text to STDOUT (for GreenstoneXML exporting) 
    341337 
    342338export.desc:PERL script used to export files in a Greenstone collection to another format. 
     
    363359export.saveas.FedoraMETS:METS format using the Fedora profile. 
    364360 
    365 export.saveas.GA:Greenstone Archive format 
     361export.saveas.GreenstoneXML:Greenstone XML Archive format 
    366362 
    367363export.saveas.MARCXML:MARC XML format (an XML version of MARC 21) 
    368  
    369364 
    370365export.out:Filename or handle to print output status to. 
     
    650645# Plugin option descriptions 
    651646# 
    652 AbstractPlugin.desc:A simple base plugin for plugins that don't inherit from BasePlugin. 
    653  
    654647AcronymExtractor.adding:adding 
    655648 
     
    670663AcronymExtractor.markup_acronyms:Add acronym metadata into document text. 
    671664 
    672 ArchivesInfPlugin.desc:Plugin which recurses through an archives.inf file (i.e. the file generated in the archives directory when an import is done), processing each file it finds. 
     665ArchivesInfPlugin.desc:Plugin which reads through an archives.inf file (i.e. the file generated in the archives directory when an import is done), processing each file it finds. 
    673666 
    674667AutoExtractMetadata.desc: Base plugin that brings together all the extractor functionality from the Extractor plugins. 
     
    744737ConvertBinaryFile.convert_to.pagedimg_png:A series of images in PNG format.  
    745738 
    746 ConvertBinaryFile.desc:This plugin is inherited by such plugins as WordPlugin, PPTPlugin, PSPlugin, RTFPlugin and PDFPlugin. It facilitates the conversion of these document types to either HTML, TEXT or a series of images. It works by dynamically loading an appropriate secondary plugin (HTMLPlugin, StructuredHTMLPlugin, PagedImagePlugin or TextPlugin) based on the plugin argument 'convert_to'.  
     739ConvertBinaryFile.desc:This plugin is inherited by such plugins as WordPlugin, PowerPointPlugin, PostScriptPlugin, RTFPlugin and PDFPlugin. It facilitates the conversion of these document types to either HTML, TEXT or a series of images. It works by dynamically loading an appropriate secondary plugin (HTMLPlugin, StructuredHTMLPlugin, PagedImagePlugin or TextPlugin) based on the plugin argument 'convert_to'.  
    747740 
    748741ConvertBinaryFile.keep_original_filename:Keep the original filename for the associated file, rather than converting to doc.pdf, doc.doc etc. 
     
    770763DirectoryPlugin.use_metadata_files:(DEPRECATED - Add MetadataXMLPlugin to the list of plugins instead) Read metadata from metadata XML files. 
    771764 
    772 DBPlugin.desc:A plugin that imports records from a database. This uses perl's DBI module, which includes back-ends for mysql, postgresql, comma separated values (CSV), MS Excel, ODBC, sybase, etc... Extra modules may need to be installed to use this. See <GSDLHOME>/etc/packages/example.dbi for an example config file.  
     765DatabasePlugin.desc:A plugin that imports records from a database. This uses perl's DBI module, which includes back-ends for mysql, postgresql, comma separated values (CSV), MS Excel, ODBC, sybase, etc... Extra modules may need to be installed to use this. See <GSDLHOME>/etc/packages/example.dbi for an example config file.  
    773766 
    774767DSpacePlugin.desc:A plugin that takes a collection of documents exported from DSpace and imports them into Greenstone. 
     
    804797FOXPlugin.desc:Plugin to process a Foxbase dbt file. This plugin provides the basic functionality to read in the dbt and dbf files and process each record. This general plugin should be overridden for a particular database to process the appropriate fields in the file. 
    805798 
    806 GAPlugin.desc:Processes Greenstone Archive XML documents. Note that this plugin does no syntax checking (though the XML::Parser module tests for well-formedness). It's assumed that the Greenstone Archive files conform to their DTD. 
     799GreenstoneXMLPlugin.desc:Processes Greenstone Archive XML documents. Note that this plugin does no syntax checking (though the XML::Parser module tests for well-formedness). It's assumed that the Greenstone Archive files conform to their DTD. 
    807800 
    808801GISExtractor.desc:Helper extractor plugin for extracting placenames from text. Requires GIS extension to Greenstone. 
     
    814807GISExtractor.place_list:When extracting placements, include list of placenames at start of the document. Requires GIS extension to Greenstone. 
    815808 
    816 GMLPlugin.desc:Plugin which processes a GML format document. It assumes that gml tags are all in lower-case. 
    817  
    818809HBPlugin.desc:Plugin which processes an HTML book directory. This plugin is used by the Humanity Library collections and does not handle input encodings other than ascii or extended ascii. This code is kind of ugly and could no doubt be made to run faster, by leaving it in this state I hope to encourage people to make their collections use BookPlugin instead ;-)\n\nUse BookPlugin if creating a new collection and marking up files like the Humanity Library collections. BookPlugin accepts all input encodings but expects the marked up files to be cleaner than those used by the Humanity Library collections 
    819810 
    820811HBPlugin.encoding.iso_8859_1:Latin1 (western languages) 
     812 
     813HTMLImagePlugin.aggressiveness:Range of related text extraction techniques to use. 
     814HTMLImagePlugin.aggressiveness.1:Filename, path, ALT text only. 
     815HTMLImagePlugin.aggressiveness.2:All of 1, plus caption where available. 
     816HTMLImagePlugin.aggressiveness.3:All of 2, plus near paragraphs where available. 
     817HTMLImagePlugin.aggressiveness.4:All of 3, plus previous headers (<h1>, <h2>...) where available. 
     818HTMLImagePlugin.aggressiveness.5:All of 4, plus textual references where available. 
     819HTMLImagePlugin.aggressiveness.6:All of 4, plus page metatags (title, keywords, etc). 
     820HTMLImagePlugin.aggressiveness.7:All of 6, 5 and 4 combined. 
     821HTMLImagePlugin.aggressiveness.8:All of 7, plus repeat caption, filename, etc (raise ranking of more relevant results). 
     822HTMLImagePlugin.aggressiveness.9:All of 1, plus full text of source page. 
     823 
     824HTMLImagePlugin.caption_length:Maximum length of captions (in characters). 
     825 
     826HTMLImagePlugin.convert_params:Additional parameters for ImageMagicK convert on thumbnail creation. For example, '-raise' will give a three dimensional effect to thumbnail images. 
     827 
     828HTMLImagePlugin.desc:A plugin for extracting images and associated text from webpages. 
     829 
     830HTMLImagePlugin.document_text:Add image text as document:text (otherwise IndexedText metadata field). 
     831 
     832HTMLImagePlugin.index_pages:Index the pages along with the images. Otherwise reference the pages at the source URL. 
     833 
     834HTMLImagePlugin.max_near_text:Maximum characters near images to extract. 
     835 
     836HTMLImagePlugin.min_height:Pixels. Skip images shorter than this. 
     837 
     838HTMLImagePlugin.min_near_text:Minimum characters of near text or caption to extract. 
     839 
     840HTMLImagePlugin.min_size:Bytes. Skip images smaller than this. 
     841 
     842HTMLImagePlugin.min_width:Pixels. Skip images narrower than this. 
     843 
     844HTMLImagePlugin.neartext_length:Target length of near text (in characters). 
     845 
     846HTMLImagePlugin.no_cache_images:Don't cache images (point to URL of original). 
     847 
     848HTMLImagePlugin.smallpage_threshold:Images on pages smaller than this (bytes) will have the page (title, keywords, etc) meta-data added. 
     849 
     850HTMLImagePlugin.textrefs_threshold:Threshold for textual references. Lower values mean the algorithm is less strict. 
     851 
     852HTMLImagePlugin.thumb_size:Max thumbnail size. Both width and height. 
    821853 
    822854HTMLPlugin.assoc_files:Perl regular expression of file extensions to associate with html documents. 
     
    9961028PDFPlugin.zoom:The factor by which to zoom the PDF for output (this is only useful if -complex is set). 
    9971029 
    998 PPTPlugin.desc:A plugin for importing Microsoft PowerPoint files. 
    999  
    1000 PPTPlugin.windows_scripting:Use MicroSoft Windows scripting technology (Visual Basic for Applications) to get PPT to convert document to various image types (e.g. JPEG,PNG,GIF) rather than rely on the open source package ppttohtml.  
     1030PostScriptPlugin.desc:This is a \"poor man's\" ps to text converter. If you are serious, consider using the PRESCRIPT package, which is available for download at http://www.nzdl.org/html/software.html 
     1031 
     1032PostScriptPlugin.extract_date:Extract date from PS header. 
     1033 
     1034PostScriptPlugin.extract_pages:Extract pages from PS header. 
     1035 
     1036PostScriptPlugin.extract_title:Extract title from PS header. 
     1037 
     1038PowerPointPlugin.desc:A plugin for importing Microsoft PowerPoint files. 
     1039 
     1040PowerPointPlugin.windows_scripting:Use MicroSoft Windows scripting technology (Visual Basic for Applications) to get PPT to convert document to various image types (e.g. JPEG,PNG,GIF) rather than rely on the open source package ppttohtml.  
    10011041 
    10021042PrintInfo.bad_general_option:The %s plugin uses an incorrect option. Check your collect.cfg configuration file. 
     
    10081048ProCitePlugin.entry_separator:The string used to separate multiple values for single metadata fields in ProCite database records. 
    10091049 
    1010 PSPlugin.desc:This is a \"poor man's\" ps to text converter. If you are serious, consider using the PRESCRIPT package, which is available for download at http://www.nzdl.org/html/software.html 
    1011  
    1012 PSPlugin.extract_date:Extract date from PS header. 
    1013  
    1014 PSPlugin.extract_pages:Extract pages from PS header. 
    1015  
    1016 PSPlugin.extract_title:Extract title from PS header. 
    1017  
    10181050ReadTextFile.could_not_extract_encoding:WARNING: encoding could not be extracted from %s - defaulting to %s 
    10191051 
     
    10821114TextPlugin.desc:Creates simple single-level document. Adds Title metadata of first line of text (up to 100 characters long). 
    10831115 
    1084 TextPlugin.title_sub:Substitution expression to modify string stored as Title. Used by, for example, PSPlugin to remove "Page 1" etc from text used as the title. 
     1116TextPlugin.title_sub:Substitution expression to modify string stored as Title. Used by, for example, PostScriptPlugin to remove "Page 1" etc from text used as the title. 
    10851117 
    10861118UnknownPlugin.assoc_field:Name of the metadata field that will hold the associated file's name. 
     
    10951127 
    10961128UnknownPlugin.srcicon:Specify a macro name (without underscores) to use as srcicon metadata. 
    1097  
    1098 W3ImagePlugin.aggressiveness:Range of related text extraction techniques to use. 
    1099 W3ImagePlugin.aggressiveness.1:Filename, path, ALT text only. 
    1100 W3ImagePlugin.aggressiveness.2:All of 1, plus caption where available. 
    1101 W3ImagePlugin.aggressiveness.3:All of 2, plus near paragraphs where available. 
    1102 W3ImagePlugin.aggressiveness.4:All of 3, plus previous headers (<h1>, <h2>...) where available. 
    1103 W3ImagePlugin.aggressiveness.5:All of 4, plus textual references where available. 
    1104 W3ImagePlugin.aggressiveness.6:All of 4, plus page metatags (title, keywords, etc). 
    1105 W3ImagePlugin.aggressiveness.7:All of 6, 5 and 4 combined. 
    1106 W3ImagePlugin.aggressiveness.8:All of 7, plus repeat caption, filename, etc (raise ranking of more relevant results). 
    1107 W3ImagePlugin.aggressiveness.9:All of 1, plus full text of source page. 
    1108  
    1109 W3ImagePlugin.caption_length:Maximum length of captions (in characters). 
    1110  
    1111 W3ImagePlugin.convert_params:Additional parameters for ImageMagicK convert on thumbnail creation. For example, '-raise' will give a three dimensional effect to thumbnail images. 
    1112  
    1113 W3ImagePlugin.desc:A plugin for extracting images and associated text from webpages. 
    1114  
    1115 W3ImagePlugin.document_text:Add image text as document:text (otherwise IndexedText metadata field). 
    1116  
    1117 W3ImagePlugin.index_pages:Index the pages along with the images. Otherwise reference the pages at the source URL. 
    1118  
    1119 W3ImagePlugin.max_near_text:Maximum characters near images to extract. 
    1120  
    1121 W3ImagePlugin.min_height:Pixels. Skip images shorter than this. 
    1122  
    1123 W3ImagePlugin.min_near_text:Minimum characters of near text or caption to extract. 
    1124  
    1125 W3ImagePlugin.min_size:Bytes. Skip images smaller than this. 
    1126  
    1127 W3ImagePlugin.min_width:Pixels. Skip images narrower than this. 
    1128  
    1129 W3ImagePlugin.neartext_length:Target length of near text (in characters). 
    1130  
    1131 W3ImagePlugin.no_cache_images:Don't cache images (point to URL of original). 
    1132  
    1133 W3ImagePlugin.smallpage_threshold:Images on pages smaller than this (bytes) will have the page (title, keywords, etc) meta-data added. 
    1134  
    1135 W3ImagePlugin.textrefs_threshold:Threshold for textual references. Lower values mean the algorithm is less strict. 
    1136  
    1137 W3ImagePlugin.thumb_size:Max thumbnail size. Both width and height. 
    11381129 
    11391130WordPlugin.desc:A plugin for importing Microsoft Word documents. 
     
    12241215FedoraMETSPlugout.fedora_namespace:The prefix used in Fedora for process ids (PIDS) e.g. greenstone:HASH0122efe4a2c58d0 
    12251216 
    1226 GAPlugout.desc:Greenstone Archive format. 
     1217GreenstoneXMLPlugout.desc:Greenstone XML Archive format. 
    12271218 
    12281219GreenstoneMETSPlugout.desc:METS format using the Greenstone profile.