Ignore:
Timestamp:
2008-06-16T12:51:51+12:00 (16 years ago)
Author:
kjdon
Message:

renamed lots of keys - the ones where there wasn't a simple mapping between old plugin name and new one - eg because the string had been moved to a different plugin. Also reordered them in english version, hopefully this will reorder them in other files once they have been through translator

File:
1 edited

Legend:

Unmodified
Added
Removed
  • gsdl/trunk/perllib/strings.properties

    r15968 r16017  
    4343
    4444common.invalid_options:Invalid arguments: %s
     45
     46common.true:true
     47
     48common.false:false
    4549
    4650#
     
    652656#
    653657
     658AcronymExtractor.adding:adding
     659
     660AcronymExtractor.already_seen:already seen
     661
     662AcronymExtractor.desc:Helper extractor plugin for location and marking up acronyms in text.
     663
     664AcronymExtractor.done_acronym_extract:done extracting acronyms.
     665
     666AcronymExtractor.done_acronym_markup:done acronym markup.
     667
     668AcronymExtractor.extract_acronyms:Extract acronyms from within text and set as metadata.
     669
     670AcronymExtractor.extracting_acronyms:extracting acronyms
     671
     672AcronymExtractor.marking_up_acronyms:marking up acronyms
     673
     674AcronymExtractor.markup_acronyms:Add acronym metadata into document text.
     675
    654676ArchivesInfPlugin.desc:Plugin which recurses through an archives.inf file (i.e. the file generated in the archives directory when an import is done), processing each file it finds.
    655677
    656 BasePlugin.adding:adding
    657 
    658 BasePlugin.already_seen:already seen
    659 
    660 BasePlugin.bad_general_option:The %s plugin uses an incorrect option. Check your collect.cfg configuration file.
     678AutoExtractMetadata.desc: Base plugin that brings together all the extractor functionality from the Extractor plugins.
     679
     680AutoExtractMetadata.extracting:extracting
     681
     682AutoExtractMetadata.first:Comma separated list of first sizes to extract from the text into a metadata field. The field is called 'FirstNNN'.
     683
     684BasePlugin.associate_ext:Causes files with the same root filename as the document being processed by the plugin AND a filename extension from the comma separated list provided by this argument to be associated with the document being processed rather than handled as a separate list.
     685
     686BasePlugin.associate_tail_re:XXX
    661687
    662688BasePlugin.block_exp:Files matching this regular expression will be blocked from being passed to any later plugins in the list. This has no real effect other than to prevent lots of warning messages about input files you don't care about. Each plugin might have a default block_exp. e.g. by default HTMLPlug blocks any files with .gif, .jpg, .jpeg, .png or .css file extensions.
    663689
    664 BasePlugin.associate_ext:Causes files with the same root filename as the document being processed by the plugin AND a filename extension from the comma separated list provided by this argument to be associated with the document being processed rather than handled as a separate list.
    665 
    666 BasePlugin.could_not_extract_encoding:WARNING: encoding could not be extracted from %s - defaulting to %s
    667 
    668 BasePlugin.could_not_extract_language:WARNING: language could not be extracted from %s - defaulting to %s
    669 
    670 BasePlugin.could_not_open_for_reading:could not open %s for reading
     690BasePlugin.desc:Base class for all the import plugins.
     691
     692BasePlugin.dummy_text:This document has no text.
     693
     694BasePlugin.encoding.ascii:Plain 7 bit ascii. This may be a bit faster than using iso_8859_1. Beware of using this when the text may contain characters outside the plain 7 bit ascii set though (e.g. German or French text containing accents), use iso_8859_1 instead.
     695
     696BasePlugin.encoding.unicode:Just unicode.
     697
     698BasePlugin.encoding.utf8:Either utf8 or unicode -- automatically detected.
     699
     700BasePlugin.filename_encoding:The encoding of the source file filenames.
     701
     702BasePlugin.filename_encoding.auto:Automatically detect the encoding of the filename.
    671703
    672704BasePlugin.no_cover_image:Do not look for a prefix.jpg file (where prefix is the same prefix as the file being processed) and associate it as a cover image.
    673705
    674 BasePlugin.default_encoding:Use this encoding if -input_encoding is set to 'auto' and the text categorization algorithm fails to extract the encoding or extracts an encoding unsupported by Greenstone. This option can take the same values as -input_encoding.
    675 
    676 BasePlugin.default_language:If Greenstone fails to work out what language a document is the 'Language' metadata element will be set to this value. The default is 'en' (ISO 639 language symbols are used: en = English). Note that if -input_encoding is not set to 'auto' and -extract_language is not set, all documents will have their 'Language' metadata set to this value.
    677 
    678 BasePlugin.desc:Base class for all the import plugins.
    679 
    680 BasePlugin.done_acronym_extract:done extracting acronyms.
    681 
    682 BasePlugin.done_acronym_markup:done acronym markup.
    683 
    684 BasePlugin.done_email_extract:done extracting e-mail addresses.
    685 
    686 BasePlugin.dummy_text:This document has no text.
    687 
    688 BasePlugin.empty_file:file contains no text
    689 
    690 BasePlugin.extract_acronyms:Extract acronyms from within text and set as metadata.
    691 
    692 BasePlugin.extract_email:Extract email addresses as metadata.
    693 
    694 BasePlugin.extract_historical_years:Extract time-period information from historical documents.  This is stored as metadata with the document. There is a search interface for this metadata, which you can include in your collection by adding the statement, "format QueryInterface DateSearch" to your collection configuration file.
    695 
    696 BasePlugin.extract_language:Identify the language of each document and set 'Language' metadata. Note that this will be done automatically if -input_encoding is 'auto'.
    697 
    698 BasePlugin.extracting:extracting
    699 
    700 BasePlugin.extracting_acronyms:extracting acronyms
    701 
    702 BasePlugin.extract_keyphrases:Extract keyphrases automatically with Kea (default settings).
    703 
    704 BasePlugin.extract_keyphrases_kea4:Extract keyphrases automatically with Kea 4.0 (default settings). Kea 4.0 is a new version of Kea that has been developed for controlled indexing of documents in the domain of agriculture.
    705 
    706 BasePlugin.extract_keyphrase_options:Options for keyphrase extraction with Kea. For example: mALIWEB - use ALIWEB extraction model; n5 - extract 5 keyphrase;, eGBK - use GBK encoding.
    707 
    708 BasePlugin.extracting_emails:extracting e-mail addresses
    709 
    710 BasePlugin.file_has_no_text:ERROR: %s contains no text
    711 
    712 BasePlugin.first:Comma separated list of first sizes to extract from the text into a metadata field. The field is called 'FirstNNN'.
    713 
    714 BasePlugin.input_encoding:The encoding of the source documents. Documents will be converted from these encodings and stored internally as utf8.
    715 BasePlugin.input_encoding.ascii:Plain 7 bit ascii. This may be a bit faster than using iso_8859_1. Beware of using this on a collection of documents that may contain characters outside the plain 7 bit ascii set though (e.g. German or French documents containing accents), use iso_8859_1 instead.
    716 BasePlugin.input_encoding.auto:Use text categorization algorithm to automatically identify the encoding of each source document. This will be slower than explicitly setting the encoding but will work where more than one encoding is used within the same collection.
    717 
    718 BasePlugin.input_encoding.unicode:Just unicode.
    719 
    720 BasePlugin.input_encoding.utf8:Either utf8 or unicode -- automatically detected.
    721 
    722 BasePlugin.keyphrases:keyphrases
    723 
    724 BasePlugin.marking_up_acronyms:marking up acronyms
    725 
    726 BasePlugin.markup_acronyms:Add acronym metadata into document text.
    727 
    728 BasePlugin.maximum_century:The maximum named century to be extracted as historical metadata (e.g. 14 will extract all references up to the 14th century).
    729 
    730 BasePlugin.maximum_year:The maximum historical date to be used as metadata (in a Common Era date, such as 1950).
    731 
    732 BasePlugin.missing_kea:Error: The Kea software could not be found at %s. Please download Kea %s from http://www.nzdl.org/Kea and install it in this directory.
    733 
    734 BasePlugin.must_be_implemented:BasPlug::read function must be implemented in sub-class for recursive plugins
    735 
    736 BasePlugin.no_bibliography:Do not try to block bibliographic dates when extracting historical dates.
    737 
    738706BasePlugin.process_exp:A perl regular expression to match against filenames. Matching filenames will be processed by this plugin. For example, using '(?i).html?\$' matches all documents ending in .htm or .html (case-insensitive).
    739707
    740 BasePlugin.read_denied:Read permission denied for %s
    741 
    742 BasePlugin.separate_cjk:Insert spaces between Chinese/Japanese/Korean characters to make each character a word. Use if text is not segmented.
    743 
    744708BasePlugin.smart_block:Block files in a smarter way than just looking at filenames.
    745709
    746710BasePlugin.stems:stems
    747711
    748 BasePlugin.unsupported_encoding:WARNING: %s appears to be encoded in an unsupported encoding (%s) - using %s
    749 
    750 BasePlugin.wrong_encoding:WARNING: %s was read using %s encoding but appears to be encoded as %s.
    751 
    752712BibTexPlugin.desc:BibTexPlug reads bibliography files in BibTex format. BibTexPlug creates a document object for every reference in the file. It is a subclass of SplitPlug, so if there are multiple records, all are read.
    753713
     
    755715
    756716ConvertBinaryFile.apply_fribidi:Run the "fribidi" Unicode Bidirectional Algorithm program over the converted file (for right-to-left text).
     717
    757718ConvertBinaryFile.convert_to:Plugin converts to TEXT or HTML or various types of Image (e.g. JPEG, GIF, PNG).
     719
    758720ConvertBinaryFile.convert_to.auto:Automatically select the format converted too.  Format chosen depends on input document type, for example Word will automatically be converted to HTML, whereas PowerPoint will be converted to Greenstone's PagedImage format.
     721
    759722ConvertBinaryFile.convert_to.html:HTML format.
     723
    760724ConvertBinaryFile.convert_to.text:Plain text format.
     725
    761726ConvertBinaryFile.convert_to.pagedimg_jpg:JPEG format.
     727
    762728ConvertBinaryFile.convert_to.pagedimg_gif:GIF format.
     729
    763730ConvertBinaryFile.convert_to.pagedimg_png:PNG format.
    764731
    765 ConvertToPlugin.desc:This plugin is inherited by such plugins as WordPlug, PPTPlug, PSPlug, RTFPlug and PDFPlug. It facilitates the conversion of these document types to either HTML, TEXT or a series of images. It works by dynamically loading an appropriate secondary plugin (HTMLPlug, StructuredHTMLPlug, PagedImgPlug or TEXTPlug) based on the plugin argument 'convert_to'.
     732ConvertBinaryFile.desc:This plugin is inherited by such plugins as WordPlugin, PPTPlugin, PSPlugin, RTFPlugin and PDFPlugin. It facilitates the conversion of these document types to either HTML, TEXT or a series of images. It works by dynamically loading an appropriate secondary plugin (HTMLPlugin, StructuredHTMLPlugin, PagedImagePlugin or TextPlugin) based on the plugin argument 'convert_to'.
     733
    766734ConvertBinaryFile.keep_original_filename:Keep the original filename for the associated file, rather than converting to doc.pdf, doc.doc etc.
     735
    767736ConvertBinaryFile.use_strings:If set, a simple strings function will be called to extract text if the conversion utility fails.
    768737
    769 ConvertToRogPlugin.desc:A plugin that inherits from RogPlug.
     738ConvertToRogPlugin.desc:A plugin that inherits from RogPlugin.
    770739
    771740CSVPlugin.desc:A plugin for files in comma-separated value format. A new document will be created for each line of the file.
    772741
     742DateExtractor.desc:Helper extractor plugin for extracting historical date information from text.
     743
     744DateExtractor.extract_historical_years:Extract time-period information from historical documents.  This is stored as metadata with the document. There is a search interface for this metadata, which you can include in your collection by adding the statement, "format QueryInterface DateSearch" to your collection configuration file.
     745
     746DateExtractor.maximum_century:The maximum named century to be extracted as historical metadata (e.g. 14 will extract all references up to the 14th century).
     747
     748DateExtractor.maximum_year:The maximum historical date to be used as metadata (in a Common Era date, such as 1950).
     749
     750DateExtractor.no_bibliography:Do not try to block bibliographic dates when extracting historical dates.
     751
     752DirectoryPlugin.desc:A plugin which recurses through directories processing each file it finds.
     753
     754DirectoryPlugin.recheck_directories:After the files in an import directory have been processed, re-read the directory to discover any new files created.
     755
     756DirectoryPlugin.use_metadata_files:(DEPRECATED - Add MetadataXMLPlugin to the list of plugins instead) Read metadata from metadata XML files.
     757
    773758DBPlugin.desc:A plugin that imports records from a database. This uses perl's DBI module, which includes back-ends for mysql, postgresql, comma separated values (CSV), MS Excel, ODBC, sybase, etc... Extra modules may need to be installed to use this. See <GSDLHOME>/etc/packages/example.dbi for an example config file.
    774759
    775 DBPlugin.title_sub:Substitution expression to modify string stored as Title. Used by, for example, PSPlug to remove "Page 1" etc from text used as the title.
    776 
    777 DSpacePlugin.desc:DSpacePlug takes a collection of documents exported from DSpace and imports them into Greenstone.
     760DSpacePlugin.desc:A plugin that takes a collection of documents exported from DSpace and imports them into Greenstone.
    778761   
    779762DSpacePlugin.first_inorder_ext: This is used to identify the primary stream of DSpace collection document. With this option, the system will treat the defined ext types of document in sequence to look for the possible primary stream.
     763
    780764DSpacePlugin.first_inorder_mime:This is used to identify the primary data stream of DSpace collection document.With this option, the system will treat the defined mime types of document in sequence to look for the possible primary stream.
     765
    781766DSpacePlugin.only_first_doc:This is used to identify the primary data stream of DSpace collection document.With this option, the system will treat the first document in dublic_core metadata file as the possible primary stream.
    782767
    783 EmailPlugin.desc:Email plug reads email files.  These are named with a simple number (i.e. as they appear in maildir folders) or with the extension .mbx (for mbox mail file format).\nDocument text: The document text consists of all the text after the first blank line in the document.\nMetadata (not Dublin Core!):\n\t\$Headers      All the header content (optional, not stored by default)\n\t\$Subject      Subject: header\n\t\$To           To: header\n\t\$From         From: header\n\t\$FromName     Name of sender (where available)\n\t\$FromAddr     E-mail address of sender\n\t\$DateText     Date: header\n\t\$Date         Date: header in GSDL format (eg: 19990924)
     768EmailAddressExtractor.desc:Helper extractor plugin for discovering email addresses in text.
     769
     770EmailAddressExtractor.done_email_extract:done extracting e-mail addresses.
     771
     772EmailAddressExtractor.extracting_emails:extracting e-mail addresses
     773
     774EmailAddressExtractor.extract_email:Extract email addresses as metadata.
     775
     776EmailPlugin.desc:A plugin that reads email files.  These are named with a simple number (i.e. as they appear in maildir folders) or with the extension .mbx (for mbox mail file format).\nDocument text: The document text consists of all the text after the first blank line in the document.\nMetadata (not Dublin Core!):\n\t\$Headers      All the header content (optional, not stored by default)\n\t\$Subject      Subject: header\n\t\$To           To: header\n\t\$From         From: header\n\t\$FromName     Name of sender (where available)\n\t\$FromAddr     E-mail address of sender\n\t\$DateText     Date: header\n\t\$Date         Date: header in GSDL format (eg: 19990924)
    784777
    785778EmailPlugin.no_attachments:Do not save message attachments.
     
    795788GAPlugin.desc:Processes Greenstone Archive XML documents. Note that this plugin does no syntax checking (though the XML::Parser module tests for well-formedness). It's assumed that the Greenstone Archive files conform to their DTD.
    796789
     790GISExtractor.desc:Helper extractor plugin for extracting placenames from text. Requires GIS extension to Greenstone.
     791
    797792GISExtractor.extract_placenames:Extract placenames from within text and set as metadata. Requires GIS extension to Greenstone.
    798793
     
    803798GMLPlugin.desc:Plugin which processes a GML format document. It assumes that gml tags are all in lower-case.
    804799
    805 HBPlugin.desc:Plugin which processes an HTML book directory. This plugin is used by the Humanity Library collections and does not handle input encodings other than ascii or extended ascii. This code is kind of ugly and could no doubt be made to run faster, by leaving it in this state I hope to encourage people to make their collections use HBSPlug instead ;-)\n\nUse HBSPlug if creating a new collection and marking up files like the Humanity Library collections. HBSPlug accepts all input encodings but expects the marked up files to be cleaner than those used by the Humanity Library collections
     800HBPlugin.desc:Plugin which processes an HTML book directory. This plugin is used by the Humanity Library collections and does not handle input encodings other than ascii or extended ascii. This code is kind of ugly and could no doubt be made to run faster, by leaving it in this state I hope to encourage people to make their collections use BookPlugin instead ;-)\n\nUse BookPlugin if creating a new collection and marking up files like the Humanity Library collections. BookPlugin accepts all input encodings but expects the marked up files to be cleaner than those used by the Humanity Library collections
    806801
    807802HTMLPlugin.assoc_files:Perl regular expression of file extensions to associate with html documents.
     
    839834HTMLPlugin.old_style_HDL:To mark whether the file in this collection used the old HDL document's tags style.
    840835
     836ImageConverter.converttotype:Convert main image to format 's'.
     837
     838ImageConverter.create_screenview:If set to true, create a screen sized image, and set Screen, ScreenType, screenicon, ScreenWidth, ScreenHeight metadata.
     839
     840ImageConverter.create_thumbnail:If set to true, create a thumbnail version of each image, and add Thumb, ThumbType, thumbicon, ThumbWidth, ThumbHeight metadata.
     841ImageConverter.desc:Helper plugin for image conversion using ImageMagick.
     842
     843ImageConverter.imagemagicknotinstalled: Image Magick not installed
     844
     845ImageConverter.minimumsize:Ignore images smaller than n bytes.
     846
    841847ImageConverter.noconversionavailable:Image conversion not available
    842 ImageConverter.imagemagicknotinstalled: Image Magick not installed
     848
     849ImageConverter.noscaleup:Don't scale up small images when making thumbnails.
     850
     851ImageConverter.screenviewsize:Make screenview images of size nxn.
     852
     853ImageConverter.screenviewtype:Make screenview images in format 's'.
     854
     855ImageConverter.thumbnailsize:Make thumbnails of size nxn.
     856
     857ImageConverter.thumbnailtype:Make thumbnails in format 's'.
     858
    843859ImageConverter.win95notsupported: Image Magick not supported on Win95/98
    844 ImagePlugin.converttotype:Convert main image to format 's'.
    845860
    846861ImagePlugin.desc:This plugin processes images, adding basic metadata
    847862
    848 ImagePlugin.minimumsize:Ignore images smaller than n bytes.
    849 
    850 ImagePlugin.noscaleup:Don't scale up small images when making thumbnails.
    851 
    852 ImagePlugin.generatescreenview:Used to control whether screeen-view resolution images are generated or not.
    853 
    854 ImagePlugin.screenviewsize:If set, makes an image of size n for screen display and sets Screen, ScreenSize, ScreenWidth and ScreenHeight metadata.  By default it is not set.
    855 
    856 ImagePlugin.screenviewtype:If -screenviewsize is set, this sets the screen display image type.
    857 
    858 ImagePlugin.generatethumbnail:Used to control whether thumbnails are generated or not.
    859 
    860 ImagePlugin.thumbnailsize:Make thumbnails of size nxn.
    861 
    862 ImagePlugin.thumbnailtype:Make thumbnails in format 's'.
    863 
    864863IndexPlugin.desc:This recursive plugin processes an index.txt file. The index.txt file should contain the list of files to be included in the collection followed by any extra metadata to be associated with each file.\n\nThe index.txt file should be formatted as follows: The first line may be a key (beginning with key:) to name the metadata fields (e.g. key: Subject Organization Date). The following lines will contain a filename followed by the value that metadata entry is to be set to. (e.g. 'irma/iw097e 3.2 unesco 1993' will associate the metadata Subject=3.2, Organization=unesco, and Date=1993 with the file irma/iw097e if the above key line was used)\n\nNote that if any of the metadata fields use the Hierarchy classifier plugin then the value they're set to should correspond to the first field (the descriptor) in the appropriate classification file.\n\nMetadata values may be named separately using a tag (e.g. <Subject>3.2) and this will override any name given to them by the key line. If there's no key line any unnamed metadata value will be named 'Subject'.
    865864
     
    870869ISISPlugin.entry_separator:The string used to separate multiple values for single metadata fields in CDS/ISIS database records.
    871870
     871KeyphraseExtractor.desc:Helper extractor plugin for generating keyphrases from text. Uses Kea keyphrase extraction system.
     872
     873KeyphraseExtractor.extract_keyphrases:Extract keyphrases automatically with Kea (default settings).
     874
     875KeyphraseExtractor.extract_keyphrases_kea4:Extract keyphrases automatically with Kea 4.0 (default settings). Kea 4.0 is a new version of Kea that has been developed for controlled indexing of documents in the domain of agriculture.
     876
     877KeyphraseExtractor.extract_keyphrase_options:Options for keyphrase extraction with Kea. For example: mALIWEB - use ALIWEB extraction model; n5 - extract 5 keyphrase;, eGBK - use GBK encoding.
     878
     879KeyphraseExtractor.keyphrases:keyphrases
     880
     881KeyphraseExtractor.missing_kea:Error: The Kea software could not be found at %s. Please download Kea %s from http://www.nzdl.org/Kea and install it in this directory.
     882
    872883LaTeXPlugin.desc:Plugin for LaTeX documents.
    873884
     
    875886
    876887LOMPlugin.root_tag:The DocType of the XML file (or a regular expression that matches the root element).
     888
    877889LOMPlugin.check_timestamp:Check timestamps of previously downloaded files, and only download again if source file is newer.
     890
    878891LOMPlugin.download_srcdocs:Download the source document if one is specified (in general^identifier^entry or technical^location). This option should specify a regular expression to match filenames against before downloading. Note, this currently doesn't work for documents outside a firewall.
    879892
     
    900913MediaWikiPlugin.delete_searchbox:Delete the searchbox section. Needs to specify a Perl regular expression in searchbox_div_exp below.
    901914
    902 MediaWikiPlugin.searchbox_div_id:A Perl regular expression to match the searchbox section. The default value matches common MediaWiki web pages.
     915MediaWikiPlugin.searchbox_div_exp:A Perl regular expression to match the searchbox section. The default value matches common MediaWiki web pages.
    903916
    904917MediaWikiPlugin.remove_title_suffix_exp:A Perl regular expression to trim the extracted title. For example, \\s-(.+) will trim title contents after "-".
     
    910923METSPlugin.desc:Process Greenstone-style METS documents
    911924
    912 GISExtractor.desc: On-the-side base class to BasPlug that supports GIS capabilities
     925MP3Plugin.desc:Plugin for processing MP3 files.
     926
     927MP3Plugin.assoc_images:Use Google image search to locate images related to MP3 file based on ID3 Title and Artist metadata.
     928
     929MP3Plugin.applet_metadata:Used to store [applet] metadata for each document that contains the necessary HTML for an MP3 audio player applet to play that file.
     930
     931MP3Plugin.metadata_fields:Comma separated list of metadata fields to extract (assuming present) in an MP3 file. Use \"*\" to extract all the fields.
    913932
    914933NulPlugin.desc:Dummy (.nul) file plugin. Used with the files produced by exploding metadata database files.
     
    930949
    931950PagedImagePlugin.documenttype:Set the document type (used for display)
     951
    932952PagedImagePlugin.documenttype.paged:Paged documents have next and previous arrows and a 'go to page X' box
     953
    933954PagedImagePlugin.documenttype.hierarchy:Hierarchical documents have a table of contents
    934955
    935956PagedImagePlugin.headerpage:Add a top level header page (that contains no image) to each document.
    936957
    937 PagedImagePlugin.screenview:Produce a screenview image for each image, and set Screen, ScreenSize, ScreenWidth and ScreenHeight metadata.
    938 
    939 PagedImagePlugin.screenviewsize:Make screenview images of size nxn.
    940 
    941 PagedImagePlugin.screenviewtype:Make screenview images in format 's'.
    942 
    943 PagedImagePlugin.thumbnail:Produce a thumbnail for each image
    944 
    945958PDFPlugin.allowimagesonly:Allow PDF files with no extractable text. Avoids the need to have -complex set. Only useful with convert_to html.
     959
    946960PDFPlugin.complex:Create more complex output. With this option set the output html will look much more like the original PDF file. For this to function properly you Ghostscript installed (for *nix gs should be on your path while for windows you must have gswin32c.exe on your path).
    947961
     
    960974PPTPlugin.windows_scripting:Use MicroSoft Windows scripting technology (Visual Basic for Applications) to get PPT to convert document to various image types (e.g. JPEG,PNG,GIF) rather than rely on the open source package ppttohtml.
    961975
     976PrintInfo.bad_general_option:The %s plugin uses an incorrect option. Check your collect.cfg configuration file.
     977
    962978ProCitePlugin.desc:A plugin for (exported) ProCite databases
    963979
     
    970986PSPlugin.extract_title:Extract title from PS header.
    971987
     988ReadTextFile.could_not_extract_encoding:WARNING: encoding could not be extracted from %s - defaulting to %s
     989
     990ReadTextFile.could_not_extract_language:WARNING: language could not be extracted from %s - defaulting to %s
     991
     992ReadTextFile.could_not_open_for_reading:could not open %s for reading
     993
     994ReadTextFile.default_encoding:Use this encoding if -input_encoding is set to 'auto' and the text categorization algorithm fails to extract the encoding or extracts an encoding unsupported by Greenstone. This option can take the same values as -input_encoding.
     995
     996ReadTextFile.default_language:If Greenstone fails to work out what language a document is the 'Language' metadata element will be set to this value. The default is 'en' (ISO 639 language symbols are used: en = English). Note that if -input_encoding is not set to 'auto' and -extract_language is not set, all documents will have their 'Language' metadata set to this value.
     997
     998ReadTextFile.desc:Base plugin for files that are plain text.
     999
     1000ReadTextFile.empty_file:file contains no text
     1001
     1002ReadTextFile.extract_language:Identify the language of each document and set 'Language' metadata. Note that this will be done automatically if -input_encoding is 'auto'.
     1003
     1004ReadTextFile.file_has_no_text:ERROR: %s contains no text
     1005
     1006ReadTextFile.input_encoding:The encoding of the source documents. Documents will be converted from these encodings and stored internally as utf8.
     1007
     1008ReadTextFile.input_encoding.auto:Use text categorization algorithm to automatically identify the encoding of each source document. This will be slower than explicitly setting the encoding but will work where more than one encoding is used within the same collection.
     1009
     1010ReadTextFile.read_denied:Read permission denied for %s
     1011
     1012ReadTextFile.separate_cjk:Insert spaces between Chinese/Japanese/Korean characters to make each character a word. Use if text is not segmented.
     1013
     1014ReadTextFile.unsupported_encoding:WARNING: %s appears to be encoded in an unsupported encoding (%s) - using %s
     1015
     1016ReadTextFile.wrong_encoding:WARNING: %s was read using %s encoding but appears to be encoded as %s.
     1017
     1018ReadXMLFile.desc:Base class for XML plugins.
     1019
     1020ReadXMLFile.xslt:Transform a matching input document with the XSLT in the named file.  A relative filename is assumed to be in the collection's file area, for instance etc/mods2dc.xsl.
     1021
    9721022RealMediaPlugin.desc:A plugin for processing Real Media files.
    9731023
    974 DirectoryPlugin.desc:RecPlug is a plugin which recurses through directories processing each file it finds.
    975 
    976 DirectoryPlugin.recheck_directories:After the files in an import directory have been processed, re-read the directory to discover any new files created.
    977 
    978 DirectoryPlugin.use_metadata_files:(DEPRECATED - Add MetadataXMLPlug to the list of plugins instead) Read metadata from metadata XML files.
    979 
    9801024ReferPlugin.desc:ReferPlug reads bibliography files in Refer format.
    9811025
    982 ReferPlugin.longdesc:ReferPlug reads bibliography files in Refer format.\nBy Gordon W. Paynter (gwp\@cs.waikato.ac.nz), November 2000\n\nLoosely based on hcibib2Plug by Steve Jones (stevej\@cs.waikato.ac.nz). Which was based on EMAILPlug by Gordon Paynter (gwp\@cs.waikato.ac.nz). Which was based on old versions of HTMLplug and HCIBIBPlugby by Stefan Boddie and others -- it's hard to tell what came from where, now.\n\nReferPlug creates a document object for every reference in the file. It is a subclass of SplitPlug, so if there are multiple records, all are read.\n\nDocument text:\n\tThe document text consists of the reference in Refer format.\nMetadata:\n\t\$Creator  \%A Author name\n\t\$Title      \%T Title of article of book\n\t\$Journal   \%J Title of Journal\n\t\$Booktitle \%B Title of book containing the publication\n\t\$Report        \%R Type of Report, paper or thesis\n\t\$Volume     \%V Volume Number of Journal\n\t\$Number        \%N Number of Journal within Volume\n\t\$Editor     \%E Editor name\n\t\$Pages      \%P Page Number of article\n\t\$Publisher   \%I Name of Publisher\n\t\$Publisheraddr    \%C Publisher's address\n\t\$Date       \%D Date of publication\n\t\$Keywords   \%K Keywords associated with publication\n\t\$Abstract  \%X Abstract of publication\n\t\$Copyright\t\%* Copyright information for the article
    983 
    9841026RogPlugin.desc:Creates simple single-level documents from .rog or .mdb files.
    9851027
     
    9901032SourceCodePlugin.remove_prefix:Remove this leading pattern from the filename (eg -remove_prefix /tmp/XX/src/). The default is to remove the whole path from the filename.
    9911033
    992 SplitPlugin.desc:SplitPlug is a plugin for splitting input files into segments that will then be individually processed. This plugin should not be called directly.  Instead, if you need to process input files that contain several documents, you should write a plugin with a process function that will handle one of those documents and have it inherit from SplitPlug.  See ReferPlug for an example.
    993 
    994 SplitPlugin.split_exp:A perl regular expression to split input files into segments.
     1034SplitTextFile.desc:SplitPlug is a plugin for splitting input files into segments that will then be individually processed. This plugin should not be called directly.  Instead, if you need to process input files that contain several documents, you should write a plugin with a process function that will handle one of those documents and have it inherit from SplitPlug.  See ReferPlug for an example.
     1035
     1036SplitTextFile.split_exp:A perl regular expression to split input files into segments.
    9951037
    9961038StructuredHTMLPlugin.desc:A plugin to process structured HTML documents, splitting them into sections based on style information.
     
    10231065
    10241066UnknownPlugin.srcicon:Specify a macro name (without underscores) to use as srcicon metadata.
    1025 
    1026 MP3Plugin.desc:Plugin for processing MP3 files.
    1027 
    1028 MP3Plugin.assoc_images:Use Google image search to locate images related to MP3 file based on ID3 Title and Artist metadata.
    1029 
    1030 MP3Plugin.applet_metadata:Used to store [applet] metadata for each document that contains the necessary HTML for an MP3 audio player applet to play that file.
    1031 
    1032 MP3Plugin.metadata_fields:Comma separated list of metadata fields to extract (assuming present) in an MP3 file. Use \"*\" to extract all the fields.
    10331067
    10341068W3ImagePlugin.aggressiveness:Range of related text extraction techniques to use.
     
    10791113WordPlugin.metadata_fields: This is to retrieve metadata from the HTML document converted by VB scripting. It allows users to define comma separated list of metadata fields to attempt to extract. Use 'tag<tagname>' to have the contents of the first <tagname> pair put in a metadata element called 'tagname'. Capitalise this as you want the metadata capitalised in Greenstone, since the tag extraction is case insensitive
    10801114
    1081 ReadXMLFile.desc:Base class for XML plugins.
    1082 
    1083 ReadXMLFile.xslt:Transform a matching input document with the XSLT in the named file.  A relative filename is assumed to be in the collection's file area, for instance etc/mods2dc.xsl.
    10841115ZIPPlugin.desc:Plugin which handles compressed and/or archived input formats currently handled formats and file extensions are:\ngzip (.gz, .z, .tgz, .taz)\nbzip (.bz)\nbzip2 (.bz2)\nzip (.zip .jar)\ntar (.tar)\n\nThis plugin relies on the following utilities being present (if trying to process the corresponding formats):\ngunzip (for gzip)\nbunzip (for bzip)\nbunzip2 \nunzip (for zip)\ntar (for tar)
    10851116
Note: See TracChangeset for help on using the changeset viewer.