Changeset 17748 for gsdl/trunk/perllib


Ignore:
Timestamp:
2008-11-06T14:49:17+13:00 (15 years ago)
Author:
kjdon
Message:

plugin and plugout name changes

File:
1 edited

Legend:

Unmodified
Added
Removed
  • gsdl/trunk/perllib/strings.properties

    r17587 r17748  
    8585buildcol.debug:Print output to STDOUT.
    8686
    87 buildcol.desc:PERL script used to build a greenstone collection from GA documents.
     87buildcol.desc:PERL script used to build a greenstone collection from archive documents.
    8888
    8989buildcol.faillog:Fail log filename. This log receives the filenames of any files which fail to be processed.
     
    236236import.complete:Import complete
    237237
    238 import.debug:Print imported text to STDOUT (for GA importing)
    239 
    240 import.desc:PERL script used to import files into a GA format ready for building.
     238import.debug:Print imported text to STDOUT (for GreenstoneXML importing)
     239
     240import.desc:PERL script used to import files into a format (GreenstoneXML or GreenstoneMETS) ready for building.
    241241
    242242import.faillog:Fail log filename. This log receives the filenames of any files which fail to be processed.
     
    269269import.OIDmetadata:Specifies the metadata element that hold's the document's unique identifier, for use with -OIDtype=assigned.
    270270
    271 import.saveas:This is to decide the archives format to be generated. The default setting is to GA.
    272 
    273 import.saveas.GA:Will generate Greenstone Archive format.
    274 
    275 import.saveas.METS:Will generate METS format.
     271import.saveas:Format that the archive files should be saved as.
    276272
    277273import.out:Filename or handle to print output status to.
     
    338334export.complete:Export complete
    339335
    340 export.debug:Print exported text to STDOUT (for GA exporting)
     336export.debug:Print exported text to STDOUT (for GreenstoneXML exporting)
    341337
    342338export.desc:PERL script used to export files in a Greenstone collection to another format.
     
    363359export.saveas.FedoraMETS:METS format using the Fedora profile.
    364360
    365 export.saveas.GA:Greenstone Archive format
     361export.saveas.GreenstoneXML:Greenstone XML Archive format
    366362
    367363export.saveas.MARCXML:MARC XML format (an XML version of MARC 21)
    368 
    369364
    370365export.out:Filename or handle to print output status to.
     
    650645# Plugin option descriptions
    651646#
    652 AbstractPlugin.desc:A simple base plugin for plugins that don't inherit from BasePlugin.
    653 
    654647AcronymExtractor.adding:adding
    655648
     
    670663AcronymExtractor.markup_acronyms:Add acronym metadata into document text.
    671664
    672 ArchivesInfPlugin.desc:Plugin which recurses through an archives.inf file (i.e. the file generated in the archives directory when an import is done), processing each file it finds.
     665ArchivesInfPlugin.desc:Plugin which reads through an archives.inf file (i.e. the file generated in the archives directory when an import is done), processing each file it finds.
    673666
    674667AutoExtractMetadata.desc: Base plugin that brings together all the extractor functionality from the Extractor plugins.
     
    744737ConvertBinaryFile.convert_to.pagedimg_png:A series of images in PNG format.
    745738
    746 ConvertBinaryFile.desc:This plugin is inherited by such plugins as WordPlugin, PPTPlugin, PSPlugin, RTFPlugin and PDFPlugin. It facilitates the conversion of these document types to either HTML, TEXT or a series of images. It works by dynamically loading an appropriate secondary plugin (HTMLPlugin, StructuredHTMLPlugin, PagedImagePlugin or TextPlugin) based on the plugin argument 'convert_to'.
     739ConvertBinaryFile.desc:This plugin is inherited by such plugins as WordPlugin, PowerPointPlugin, PostScriptPlugin, RTFPlugin and PDFPlugin. It facilitates the conversion of these document types to either HTML, TEXT or a series of images. It works by dynamically loading an appropriate secondary plugin (HTMLPlugin, StructuredHTMLPlugin, PagedImagePlugin or TextPlugin) based on the plugin argument 'convert_to'.
    747740
    748741ConvertBinaryFile.keep_original_filename:Keep the original filename for the associated file, rather than converting to doc.pdf, doc.doc etc.
     
    770763DirectoryPlugin.use_metadata_files:(DEPRECATED - Add MetadataXMLPlugin to the list of plugins instead) Read metadata from metadata XML files.
    771764
    772 DBPlugin.desc:A plugin that imports records from a database. This uses perl's DBI module, which includes back-ends for mysql, postgresql, comma separated values (CSV), MS Excel, ODBC, sybase, etc... Extra modules may need to be installed to use this. See <GSDLHOME>/etc/packages/example.dbi for an example config file.
     765DatabasePlugin.desc:A plugin that imports records from a database. This uses perl's DBI module, which includes back-ends for mysql, postgresql, comma separated values (CSV), MS Excel, ODBC, sybase, etc... Extra modules may need to be installed to use this. See <GSDLHOME>/etc/packages/example.dbi for an example config file.
    773766
    774767DSpacePlugin.desc:A plugin that takes a collection of documents exported from DSpace and imports them into Greenstone.
     
    804797FOXPlugin.desc:Plugin to process a Foxbase dbt file. This plugin provides the basic functionality to read in the dbt and dbf files and process each record. This general plugin should be overridden for a particular database to process the appropriate fields in the file.
    805798
    806 GAPlugin.desc:Processes Greenstone Archive XML documents. Note that this plugin does no syntax checking (though the XML::Parser module tests for well-formedness). It's assumed that the Greenstone Archive files conform to their DTD.
     799GreenstoneXMLPlugin.desc:Processes Greenstone Archive XML documents. Note that this plugin does no syntax checking (though the XML::Parser module tests for well-formedness). It's assumed that the Greenstone Archive files conform to their DTD.
    807800
    808801GISExtractor.desc:Helper extractor plugin for extracting placenames from text. Requires GIS extension to Greenstone.
     
    814807GISExtractor.place_list:When extracting placements, include list of placenames at start of the document. Requires GIS extension to Greenstone.
    815808
    816 GMLPlugin.desc:Plugin which processes a GML format document. It assumes that gml tags are all in lower-case.
    817 
    818809HBPlugin.desc:Plugin which processes an HTML book directory. This plugin is used by the Humanity Library collections and does not handle input encodings other than ascii or extended ascii. This code is kind of ugly and could no doubt be made to run faster, by leaving it in this state I hope to encourage people to make their collections use BookPlugin instead ;-)\n\nUse BookPlugin if creating a new collection and marking up files like the Humanity Library collections. BookPlugin accepts all input encodings but expects the marked up files to be cleaner than those used by the Humanity Library collections
    819810
    820811HBPlugin.encoding.iso_8859_1:Latin1 (western languages)
     812
     813HTMLImagePlugin.aggressiveness:Range of related text extraction techniques to use.
     814HTMLImagePlugin.aggressiveness.1:Filename, path, ALT text only.
     815HTMLImagePlugin.aggressiveness.2:All of 1, plus caption where available.
     816HTMLImagePlugin.aggressiveness.3:All of 2, plus near paragraphs where available.
     817HTMLImagePlugin.aggressiveness.4:All of 3, plus previous headers (<h1>, <h2>...) where available.
     818HTMLImagePlugin.aggressiveness.5:All of 4, plus textual references where available.
     819HTMLImagePlugin.aggressiveness.6:All of 4, plus page metatags (title, keywords, etc).
     820HTMLImagePlugin.aggressiveness.7:All of 6, 5 and 4 combined.
     821HTMLImagePlugin.aggressiveness.8:All of 7, plus repeat caption, filename, etc (raise ranking of more relevant results).
     822HTMLImagePlugin.aggressiveness.9:All of 1, plus full text of source page.
     823
     824HTMLImagePlugin.caption_length:Maximum length of captions (in characters).
     825
     826HTMLImagePlugin.convert_params:Additional parameters for ImageMagicK convert on thumbnail creation. For example, '-raise' will give a three dimensional effect to thumbnail images.
     827
     828HTMLImagePlugin.desc:A plugin for extracting images and associated text from webpages.
     829
     830HTMLImagePlugin.document_text:Add image text as document:text (otherwise IndexedText metadata field).
     831
     832HTMLImagePlugin.index_pages:Index the pages along with the images. Otherwise reference the pages at the source URL.
     833
     834HTMLImagePlugin.max_near_text:Maximum characters near images to extract.
     835
     836HTMLImagePlugin.min_height:Pixels. Skip images shorter than this.
     837
     838HTMLImagePlugin.min_near_text:Minimum characters of near text or caption to extract.
     839
     840HTMLImagePlugin.min_size:Bytes. Skip images smaller than this.
     841
     842HTMLImagePlugin.min_width:Pixels. Skip images narrower than this.
     843
     844HTMLImagePlugin.neartext_length:Target length of near text (in characters).
     845
     846HTMLImagePlugin.no_cache_images:Don't cache images (point to URL of original).
     847
     848HTMLImagePlugin.smallpage_threshold:Images on pages smaller than this (bytes) will have the page (title, keywords, etc) meta-data added.
     849
     850HTMLImagePlugin.textrefs_threshold:Threshold for textual references. Lower values mean the algorithm is less strict.
     851
     852HTMLImagePlugin.thumb_size:Max thumbnail size. Both width and height.
    821853
    822854HTMLPlugin.assoc_files:Perl regular expression of file extensions to associate with html documents.
     
    9961028PDFPlugin.zoom:The factor by which to zoom the PDF for output (this is only useful if -complex is set).
    9971029
    998 PPTPlugin.desc:A plugin for importing Microsoft PowerPoint files.
    999 
    1000 PPTPlugin.windows_scripting:Use MicroSoft Windows scripting technology (Visual Basic for Applications) to get PPT to convert document to various image types (e.g. JPEG,PNG,GIF) rather than rely on the open source package ppttohtml.
     1030PostScriptPlugin.desc:This is a \"poor man's\" ps to text converter. If you are serious, consider using the PRESCRIPT package, which is available for download at http://www.nzdl.org/html/software.html
     1031
     1032PostScriptPlugin.extract_date:Extract date from PS header.
     1033
     1034PostScriptPlugin.extract_pages:Extract pages from PS header.
     1035
     1036PostScriptPlugin.extract_title:Extract title from PS header.
     1037
     1038PowerPointPlugin.desc:A plugin for importing Microsoft PowerPoint files.
     1039
     1040PowerPointPlugin.windows_scripting:Use MicroSoft Windows scripting technology (Visual Basic for Applications) to get PPT to convert document to various image types (e.g. JPEG,PNG,GIF) rather than rely on the open source package ppttohtml.
    10011041
    10021042PrintInfo.bad_general_option:The %s plugin uses an incorrect option. Check your collect.cfg configuration file.
     
    10081048ProCitePlugin.entry_separator:The string used to separate multiple values for single metadata fields in ProCite database records.
    10091049
    1010 PSPlugin.desc:This is a \"poor man's\" ps to text converter. If you are serious, consider using the PRESCRIPT package, which is available for download at http://www.nzdl.org/html/software.html
    1011 
    1012 PSPlugin.extract_date:Extract date from PS header.
    1013 
    1014 PSPlugin.extract_pages:Extract pages from PS header.
    1015 
    1016 PSPlugin.extract_title:Extract title from PS header.
    1017 
    10181050ReadTextFile.could_not_extract_encoding:WARNING: encoding could not be extracted from %s - defaulting to %s
    10191051
     
    10821114TextPlugin.desc:Creates simple single-level document. Adds Title metadata of first line of text (up to 100 characters long).
    10831115
    1084 TextPlugin.title_sub:Substitution expression to modify string stored as Title. Used by, for example, PSPlugin to remove "Page 1" etc from text used as the title.
     1116TextPlugin.title_sub:Substitution expression to modify string stored as Title. Used by, for example, PostScriptPlugin to remove "Page 1" etc from text used as the title.
    10851117
    10861118UnknownPlugin.assoc_field:Name of the metadata field that will hold the associated file's name.
     
    10951127
    10961128UnknownPlugin.srcicon:Specify a macro name (without underscores) to use as srcicon metadata.
    1097 
    1098 W3ImagePlugin.aggressiveness:Range of related text extraction techniques to use.
    1099 W3ImagePlugin.aggressiveness.1:Filename, path, ALT text only.
    1100 W3ImagePlugin.aggressiveness.2:All of 1, plus caption where available.
    1101 W3ImagePlugin.aggressiveness.3:All of 2, plus near paragraphs where available.
    1102 W3ImagePlugin.aggressiveness.4:All of 3, plus previous headers (<h1>, <h2>...) where available.
    1103 W3ImagePlugin.aggressiveness.5:All of 4, plus textual references where available.
    1104 W3ImagePlugin.aggressiveness.6:All of 4, plus page metatags (title, keywords, etc).
    1105 W3ImagePlugin.aggressiveness.7:All of 6, 5 and 4 combined.
    1106 W3ImagePlugin.aggressiveness.8:All of 7, plus repeat caption, filename, etc (raise ranking of more relevant results).
    1107 W3ImagePlugin.aggressiveness.9:All of 1, plus full text of source page.
    1108 
    1109 W3ImagePlugin.caption_length:Maximum length of captions (in characters).
    1110 
    1111 W3ImagePlugin.convert_params:Additional parameters for ImageMagicK convert on thumbnail creation. For example, '-raise' will give a three dimensional effect to thumbnail images.
    1112 
    1113 W3ImagePlugin.desc:A plugin for extracting images and associated text from webpages.
    1114 
    1115 W3ImagePlugin.document_text:Add image text as document:text (otherwise IndexedText metadata field).
    1116 
    1117 W3ImagePlugin.index_pages:Index the pages along with the images. Otherwise reference the pages at the source URL.
    1118 
    1119 W3ImagePlugin.max_near_text:Maximum characters near images to extract.
    1120 
    1121 W3ImagePlugin.min_height:Pixels. Skip images shorter than this.
    1122 
    1123 W3ImagePlugin.min_near_text:Minimum characters of near text or caption to extract.
    1124 
    1125 W3ImagePlugin.min_size:Bytes. Skip images smaller than this.
    1126 
    1127 W3ImagePlugin.min_width:Pixels. Skip images narrower than this.
    1128 
    1129 W3ImagePlugin.neartext_length:Target length of near text (in characters).
    1130 
    1131 W3ImagePlugin.no_cache_images:Don't cache images (point to URL of original).
    1132 
    1133 W3ImagePlugin.smallpage_threshold:Images on pages smaller than this (bytes) will have the page (title, keywords, etc) meta-data added.
    1134 
    1135 W3ImagePlugin.textrefs_threshold:Threshold for textual references. Lower values mean the algorithm is less strict.
    1136 
    1137 W3ImagePlugin.thumb_size:Max thumbnail size. Both width and height.
    11381129
    11391130WordPlugin.desc:A plugin for importing Microsoft Word documents.
     
    12241215FedoraMETSPlugout.fedora_namespace:The prefix used in Fedora for process ids (PIDS) e.g. greenstone:HASH0122efe4a2c58d0
    12251216
    1226 GAPlugout.desc:Greenstone Archive format.
     1217GreenstoneXMLPlugout.desc:Greenstone XML Archive format.
    12271218
    12281219GreenstoneMETSPlugout.desc:METS format using the Greenstone profile.
Note: See TracChangeset for help on using the changeset viewer.