source: main/trunk/greenstone2/perllib/plugins

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @23484   13 years ak19 Further improvements by Dr Bainbridge to pretty-printing.
(edit) @23472   13 years ak19 Erroneous forth argument (a filename) left over from an earlier time, …
(edit) @23465   13 years ak19 Dr Bainbridge fixed the change to gs.filenameEncoding (previously: …
(edit) @23463   13 years ak19 Previously, when reverting back to an earlier RE match for …
(edit) @23461   13 years kjdon set_Source_metadata now takes an optional section argument so that we …
(edit) @23460   13 years kjdon pass in the section to set_Source_metadata as we may be processing a …
(edit) @23458   13 years kjdon added a check that deduced_filename_encoding is defined before testing …
(edit) @23457   13 years kjdon reindented the file in emacs
(edit) @23452   13 years kjdon use filename_cat when generating the full path for blocking. If teh …
(edit) @23419   13 years max Setting the values to store as block files is now done through an API …
(edit) @23418   13 years davidb A few further additions to help windows keep track of c\... and C:\... …
(edit) @23415   13 years davidb More careful handling of filenames going into 'block' hash. On …
(edit) @23392   13 years sjm84 Reverted a regular expression designed to locate various tags inside …
(edit) @23387   13 years davidb Further changes to deal with documents that use different filename …
(edit) @23377   13 years ak19 Perl syntax error fixed: referring to uninitialised variable metadata …
(edit) @23371   13 years davidb Further refinement of code to support HTML linking between documents …
(edit) @23364   13 years sjm84 C and Posix added to the possible locales as well as removing the …
(edit) @23363   13 years davidb Plugin code upgrade to support Greenstone working with filenames under …
(edit) @23355   13 years kjdon use unicode for mp3 data. patch thanks to Dan Wright
(edit) @23353   13 years davidb Modifications to code to support filename encoding issues when tested …
(edit) @23352   13 years davidb Modifications to code to support filename encoding issues when tested …
(edit) @23349   13 years davidb More careful use of encoding parameter to $self->set_Source_metadata …
(edit) @23348   13 years davidb Added extra parameter to call to deduce_filename_encoding()
(edit) @23347   13 years davidb Tidy up of debugging statements for handline filename encodings, plus …
(edit) @23335   13 years davidb Work done on improving handing of filenames when the actualy filename …
(edit) @23280   13 years kjdon fixed this plugin up for incremental import. need to set …
(edit) @23279   13 years kjdon in extra_metadata, new special case for gsdlzipfilename metadata - if …
(edit) @23277   13 years kjdon removed a commented out line
(edit) @23261   13 years kjdon ZIPPlugin needs to do a block pass on the extracted folder so we don't …
(edit) @23248   14 years ak19 Bugfix: file called mimetype (among the files extracted from an Open …
(edit) @23212   14 years kjdon metadata_read no longer takes maxdocs args - metadata_read must …
(edit) @23171   14 years kjdon if infodbtype is gdbm-txtgz, we need to use gdbm for all archives dbs
(edit) @23167   14 years davidb GreenstoneXMLPlugin used to (or at least in theory used to) to be able …
(edit) @22953   14 years davidb Further code tweaks to correctly support Unicode aware strings in our …
(edit) @22951   14 years davidb Encode::decode cannot be applied to all characters returned by …
(edit) @22900   14 years kjdon getting this to work properly
(edit) @22896   14 years kjdon fixed an odd bug. If had a metadata file directly in import folder, …
(edit) @22894   14 years kjdon added wpd (word perfect) extension into the list that can be processed …
(edit) @22887   14 years kjdon use new util::get_timestamped_dir, and clean_up_after_doc_processing …
(edit) @22882   14 years kjdon set up convert_to list for the case when windows_scripting and …
(edit) @22880   14 years kjdon implemented the read method for when using open office to convert to …
(edit) @22879   14 years kjdon now have an html_multi option to convert_to (for PowerPointPlugin)
(edit) @22874   14 years kjdon no longer use filename_extension, as we should be using the original …
(edit) @22871   14 years kjdon added code to generate an item file if asked for pagedimg output with …
(edit) @22865   14 years kjdon forgot to set openoffice_available so that get_default_process_exp works
(edit) @22864   14 years kjdon needed use ConvertBinaryFile
(edit) @22862   14 years kjdon changed a comment
(edit) @22861   14 years kjdon now uses new AutoLoadConverters instead of AutoloadConverterScripting. …
(edit) @22860   14 years kjdon changed a line
(edit) @22859   14 years kjdon this plugin inherits from others
(edit) @22858   14 years kjdon I have written a new version of AutoloadConverterScripting, called …
(edit) @22857   14 years davidb Further adjustments to our reading in of text files/data to be Unicode …
(edit) @22853   14 years kjdon print parse errors to failhandle and GLI xml as well as to outhandle
(edit) @22852   14 years kjdon now prints errors to outhandle, failhandle and gli xml instead of just …
(edit) @22844   14 years davidb More explicit use of utf8 for input and output file handling. Relies …
(edit) @22842   14 years davidb Minor tidy up of code
(edit) @22841   14 years davidb More explicit use of utf8 for input and output file handling. Relies …
(edit) @22840   14 years davidb More explicit use of utf8 for input and output file handling. Relies …
(edit) @22814   14 years kjdon removes tidy_item_file from store_block_files as it makes the file new …
(edit) @22709   14 years davidb Fixed up -process_exp so it now dynamically configures itself …
(edit) @22705   14 years davidb User of AutoloadConverterScripting expanded to encompass PowerPoint
(edit) @22702   14 years davidb Introduction of new plugin AutoloadConverterScripting to replace …
(edit) @22689   14 years mdewsnip Trac ticket #634: change so "ftp://" is used instead of "http://" in …
(edit) @22675   14 years sjm84 Modified PDFPlugin to use PDFBox if it is available
(edit) @22674   14 years sjm84 Added a version of ConvertBinaryFile for PDFBox
(edit) @22673   14 years sjm84 Dr. Bainbridge added a begin method to OOConvertBinaryFile
(edit) @22666   14 years davidb Commented out debugging statement
(edit) @22664   14 years mdewsnip Minor comment change.
(edit) @22663   14 years mdewsnip Changed "srclink_file" metadata to always contain the filename, …
(edit) @22658   14 years mdewsnip Changed "srcicon" values in ImageConverter.pm and ImagePlugin.pm to …
(edit) @22656   14 years mdewsnip Changed to add "srclink_file" metadata instead of the deprecated …
(edit) @22655   14 years mdewsnip Removed some old (commented out) "[srclink]" code, as part of tidying …
(edit) @22654   14 years mdewsnip Removed some old (commented out) "[srclink]" code, as part of tidying …
(edit) @22652   14 years mdewsnip Removed call to ghtml::dmsafe() from …
(edit) @22641   14 years kjdon now inherits from OOConvertBinaryFile. still a couple of things to iron out
(edit) @22640   14 years kjdon now uses new OOConvertBInaryFile super class
(edit) @22639   14 years kjdon now uses new OOConvertBinaryFile as super class
(edit) @22638   14 years kjdon new ConvertBinaryFile plugin that will include OpenOfficeConverter if …
(edit) @22636   14 years davidb Using -utf8 as options to html-tidy leads to wrong encoding for HTML …
(edit) @22632   14 years mdewsnip Changed "use textcat" to a "require textcat", so it is only loaded if …
(edit) @22612   14 years kjdon made the default process exp a bit nicer to read
(edit) @22611   14 years kjdon now uses OpenOfficeConverter that is not ConvertBinaryFile
(edit) @22597   14 years kjdon code tidy up. rearranged how convertbinaryfile plugins set up their …
(edit) @22594   14 years kjdon committing for davidb. pipe output of tidy to /dev/null if verbosity is low
(edit) @22565   14 years kjdon removed block exp. now it scans the item file to work out which files …
(edit) @22552   14 years kjdon by default we want this to process all files, so changed default …
(edit) @22515   14 years kjdon added open office support for PowerPoint and Excel plugins. followed …
(edit) @22514   14 years kjdon small tidyings
(edit) @22507   14 years kjdon some moving around and tidying up of code
(edit) @22505   14 years kjdon added the openoffice_scripting arg here instead of in OpenOfficeConverter
(edit) @22504   14 years kjdon for WordPLugin, if openoffice_scripting is set, need to use …
(edit) @22503   14 years kjdon StructuredHTMLPlugin needs -description tags if office_scripting is …
(edit) @22462   14 years ak19 Commented out useful (but no needed in svn commited version) print …
(edit) @22451   14 years kjdon added metadata_field_separator option. if set eg to ;, will split all …
(edit) @22450   14 years kjdon missing argument to autorun_general_cmd. oh the trouble that caused, …
(edit) @22448   14 years kjdon metadata values mught be array type - add each individual item as a …
(edit) @22431   14 years davidb Correction to caching technique to work with input file rather than …
(edit) @22428   14 years davidb Restructuring of WordPlugin to dynamically inherit from …
(edit) @22427   14 years davidb Adjustment of whitespace
(edit) @22412   14 years davidb More accurate comment added
Note: See TracRevisionLog for help on using the revision log.