source: main/trunk/greenstone2/perllib/plugins

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @23261   14 years kjdon ZIPPlugin needs to do a block pass on the extracted folder so we don't …
(edit) @23248   14 years ak19 Bugfix: file called mimetype (among the files extracted from an Open …
(edit) @23212   14 years kjdon metadata_read no longer takes maxdocs args - metadata_read must …
(edit) @23171   14 years kjdon if infodbtype is gdbm-txtgz, we need to use gdbm for all archives dbs
(edit) @23167   14 years davidb GreenstoneXMLPlugin used to (or at least in theory used to) to be able …
(edit) @22953   14 years davidb Further code tweaks to correctly support Unicode aware strings in our …
(edit) @22951   14 years davidb Encode::decode cannot be applied to all characters returned by …
(edit) @22900   14 years kjdon getting this to work properly
(edit) @22896   14 years kjdon fixed an odd bug. If had a metadata file directly in import folder, …
(edit) @22894   14 years kjdon added wpd (word perfect) extension into the list that can be processed …
(edit) @22887   14 years kjdon use new util::get_timestamped_dir, and clean_up_after_doc_processing …
(edit) @22882   14 years kjdon set up convert_to list for the case when windows_scripting and …
(edit) @22880   14 years kjdon implemented the read method for when using open office to convert to …
(edit) @22879   14 years kjdon now have an html_multi option to convert_to (for PowerPointPlugin)
(edit) @22874   14 years kjdon no longer use filename_extension, as we should be using the original …
(edit) @22871   14 years kjdon added code to generate an item file if asked for pagedimg output with …
(edit) @22865   14 years kjdon forgot to set openoffice_available so that get_default_process_exp works
(edit) @22864   14 years kjdon needed use ConvertBinaryFile
(edit) @22862   14 years kjdon changed a comment
(edit) @22861   14 years kjdon now uses new AutoLoadConverters instead of AutoloadConverterScripting. …
(edit) @22860   14 years kjdon changed a line
(edit) @22859   14 years kjdon this plugin inherits from others
(edit) @22858   14 years kjdon I have written a new version of AutoloadConverterScripting, called …
(edit) @22857   14 years davidb Further adjustments to our reading in of text files/data to be Unicode …
(edit) @22853   14 years kjdon print parse errors to failhandle and GLI xml as well as to outhandle
(edit) @22852   14 years kjdon now prints errors to outhandle, failhandle and gli xml instead of just …
(edit) @22844   14 years davidb More explicit use of utf8 for input and output file handling. Relies …
(edit) @22842   14 years davidb Minor tidy up of code
(edit) @22841   14 years davidb More explicit use of utf8 for input and output file handling. Relies …
(edit) @22840   14 years davidb More explicit use of utf8 for input and output file handling. Relies …
(edit) @22814   14 years kjdon removes tidy_item_file from store_block_files as it makes the file new …
(edit) @22709   14 years davidb Fixed up -process_exp so it now dynamically configures itself …
(edit) @22705   14 years davidb User of AutoloadConverterScripting expanded to encompass PowerPoint
(edit) @22702   14 years davidb Introduction of new plugin AutoloadConverterScripting to replace …
(edit) @22689   14 years mdewsnip Trac ticket #634: change so "ftp://" is used instead of "http://" in …
(edit) @22675   14 years sjm84 Modified PDFPlugin to use PDFBox if it is available
(edit) @22674   14 years sjm84 Added a version of ConvertBinaryFile for PDFBox
(edit) @22673   14 years sjm84 Dr. Bainbridge added a begin method to OOConvertBinaryFile
(edit) @22666   14 years davidb Commented out debugging statement
(edit) @22664   14 years mdewsnip Minor comment change.
(edit) @22663   14 years mdewsnip Changed "srclink_file" metadata to always contain the filename, …
(edit) @22658   14 years mdewsnip Changed "srcicon" values in ImageConverter.pm and ImagePlugin.pm to …
(edit) @22656   14 years mdewsnip Changed to add "srclink_file" metadata instead of the deprecated …
(edit) @22655   14 years mdewsnip Removed some old (commented out) "[srclink]" code, as part of tidying …
(edit) @22654   14 years mdewsnip Removed some old (commented out) "[srclink]" code, as part of tidying …
(edit) @22652   14 years mdewsnip Removed call to ghtml::dmsafe() from …
(edit) @22641   14 years kjdon now inherits from OOConvertBinaryFile. still a couple of things to iron out
(edit) @22640   14 years kjdon now uses new OOConvertBInaryFile super class
(edit) @22639   14 years kjdon now uses new OOConvertBinaryFile as super class
(edit) @22638   14 years kjdon new ConvertBinaryFile plugin that will include OpenOfficeConverter if …
(edit) @22636   14 years davidb Using -utf8 as options to html-tidy leads to wrong encoding for HTML …
(edit) @22632   14 years mdewsnip Changed "use textcat" to a "require textcat", so it is only loaded if …
(edit) @22612   14 years kjdon made the default process exp a bit nicer to read
(edit) @22611   14 years kjdon now uses OpenOfficeConverter that is not ConvertBinaryFile
(edit) @22597   14 years kjdon code tidy up. rearranged how convertbinaryfile plugins set up their …
(edit) @22594   14 years kjdon committing for davidb. pipe output of tidy to /dev/null if verbosity is low
(edit) @22565   14 years kjdon removed block exp. now it scans the item file to work out which files …
(edit) @22552   14 years kjdon by default we want this to process all files, so changed default …
(edit) @22515   14 years kjdon added open office support for PowerPoint and Excel plugins. followed …
(edit) @22514   14 years kjdon small tidyings
(edit) @22507   14 years kjdon some moving around and tidying up of code
(edit) @22505   14 years kjdon added the openoffice_scripting arg here instead of in OpenOfficeConverter
(edit) @22504   14 years kjdon for WordPLugin, if openoffice_scripting is set, need to use …
(edit) @22503   14 years kjdon StructuredHTMLPlugin needs -description tags if office_scripting is …
(edit) @22462   14 years ak19 Commented out useful (but no needed in svn commited version) print …
(edit) @22451   14 years kjdon added metadata_field_separator option. if set eg to ;, will split all …
(edit) @22450   14 years kjdon missing argument to autorun_general_cmd. oh the trouble that caused, …
(edit) @22448   14 years kjdon metadata values mught be array type - add each individual item as a …
(edit) @22431   14 years davidb Correction to caching technique to work with input file rather than …
(edit) @22428   14 years davidb Restructuring of WordPlugin to dynamically inherit from …
(edit) @22427   14 years davidb Adjustment of whitespace
(edit) @22412   14 years davidb More accurate comment added
(edit) @22401   14 years kjdon tidied up a bit. Removed options that (I think) will always be set the …
(edit) @22364   14 years ak19 MediainfoOGVPlugin now includes the final changes Arnaud made to his …
(edit) @22363   14 years ak19 Adding in the adjustments to the mediainfoogv plugin that were mailed …
(edit) @22362   14 years ak19 Committing Arnaud Yvan's MediainfoOGVPlugin.pm for the next release of …
(edit) @22355   14 years kjdon previously, when use_realistic_book was set, all files listed in …
(edit) @22351   14 years davidb White-space tidy up
(edit) @22349   14 years kjdon if metadata extracted from item file has a namespace, then prefix it …
(edit) @22348   14 years kjdon store any extracted metadata that has a namespace as ex.ns.meta
(edit) @22330   14 years kjdon we want to store the original file name not the tidy filename as the …
(edit) @22329   14 years kjdon changed mp3:meta to ex.id3.meta. apparently id3 isa better name for …
(edit) @22316   14 years kjdon store extracted namespaced metadata as ex.metadata, eg ex.dc.Title, …
(edit) @22293   14 years kjdon extracted metadata is now going to be added as ex.meta, then GLI will …
(edit) @22267   14 years kjdon fixed a mistake in a method name
(edit) @22232   14 years mdewsnip New OAIMetadataXMLPlugin.pm for extracting information from OAI …
(edit) @22215   14 years kjdon added store_original_file option, used for eg Text, HTML plugins to …
(edit) @22074   14 years kjdon extrametadata needs the filename with no subfolder as that is added in …
(edit) @21981   14 years kjdon fix for ticket #676. Conversion of pdf to html where two pdfs had the …
(edit) @21958   14 years kjdon ppthtml and xslhtml don't seem to output utf8, so remove the …
(edit) @21916   14 years kjdon made this work with a user specified process_exp so that your metadata …
(edit) @21905   14 years mdewsnip Changes made by Jeffrey Ke at DL Consulting Ltd to remove the global …
(edit) @21866   14 years kjdon added some code for if identify returns size in mb.
(edit) @21803   14 years kjdon set file_id to null if ID doesn't match FILE.* (previously it was …
(edit) @21801   14 years kjdon extended HTMLPlugin's metadata_field_separator option to Word and …
(edit) @21800   14 years kjdon added a new option, metadata_field_separator, which specifies what to …
(edit) @21764   14 years kjdon fixed up all my copy and paste errors. doh.
(edit) @21763   14 years kjdon don't modify document_field is info_only - doesn't appear to be …
(edit) @21760   14 years kjdon srclink now generated dynamically at runtime. instead of storing …
(edit) @21759   14 years kjdon changed a comment
Note: See TracRevisionLog for help on using the revision log.