source: main/trunk/greenstone2/perllib

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @22641   14 years kjdon now inherits from OOConvertBinaryFile. still a couple of things to iron out
(edit) @22640   14 years kjdon now uses new OOConvertBInaryFile super class
(edit) @22639   14 years kjdon now uses new OOConvertBinaryFile as super class
(edit) @22638   14 years kjdon new ConvertBinaryFile plugin that will include OpenOfficeConverter if …
(edit) @22636   14 years davidb Using -utf8 as options to html-tidy leads to wrong encoding for HTML …
(edit) @22632   14 years mdewsnip Changed "use textcat" to a "require textcat", so it is only loaded if …
(edit) @22612   14 years kjdon made the default process exp a bit nicer to read
(edit) @22611   14 years kjdon now uses OpenOfficeConverter that is not ConvertBinaryFile
(edit) @22597   14 years kjdon code tidy up. rearranged how convertbinaryfile plugins set up their …
(edit) @22594   14 years kjdon committing for davidb. pipe output of tidy to /dev/null if verbosity is low
(edit) @22590   14 years kjdon perl told me to replace \1 and \2 with $1 and $2
(edit) @22567   14 years kjdon the last step in mark_docs_for_deletion, only do it in delete mode. …
(edit) @22565   14 years kjdon removed block exp. now it scans the item file to work out which files …
(edit) @22553   14 years ak19 To turn the sort-using-unicode-collation option of the generic List …
(edit) @22552   14 years kjdon by default we want this to process all files, so changed default …
(edit) @22536   14 years max Fix the bug where metadata containing back slashes would endlessly be …
(edit) @22535   14 years max Minor cleanup
(edit) @22515   14 years kjdon added open office support for PowerPoint and Excel plugins. followed …
(edit) @22514   14 years kjdon small tidyings
(edit) @22507   14 years kjdon some moving around and tidying up of code
(edit) @22505   14 years kjdon added the openoffice_scripting arg here instead of in OpenOfficeConverter
(edit) @22504   14 years kjdon for WordPLugin, if openoffice_scripting is set, need to use …
(edit) @22503   14 years kjdon StructuredHTMLPlugin needs -description tags if office_scripting is …
(edit) @22485   14 years ak19 1. Dr Bainbridge fixed the database perl modules to all have the …
(edit) @22469   14 years davidb Debugging statement had been left in. Now commented out
(edit) @22466   14 years ak19 Set archivedir to "" in the first instance
(edit) @22465   14 years ak19 Improved checking of environment variables for Fedora values in BEGIN block
(edit) @22464   14 years ak19 Added in missing routine that had forgotten to be brought over from …
(edit) @22462   14 years ak19 Commented out useful (but no needed in svn commited version) print …
(edit) @22460   14 years ak19 Reverted to using -exportdir with export.pl rather than 'merged' …
(edit) @22456   14 years sjb48 Modified collConfigxml.pm to read and configure build options for …
(edit) @22451   14 years kjdon added metadata_field_separator option. if set eg to ;, will split all …
(edit) @22450   14 years kjdon missing argument to autorun_general_cmd. oh the trouble that caused, …
(edit) @22448   14 years kjdon metadata values mught be array type - add each individual item as a …
(edit) @22445   14 years davidb Incorporation of top-level parallel_import functionality into code base.
(edit) @22438   14 years davidb Additional flexibility added in to how get_tmp_filename works with …
(edit) @22431   14 years davidb Correction to caching technique to work with input file rather than …
(edit) @22428   14 years davidb Restructuring of WordPlugin to dynamically inherit from …
(edit) @22427   14 years davidb Adjustment of whitespace
(edit) @22421   14 years davidb Continued work on refactoring code to have better shared support for …
(edit) @22413   14 years davidb Initial pass at getting the main code to import.pl (and the very …
(edit) @22412   14 years davidb More accurate comment added
(edit) @22402   14 years kjdon added a couple of strings for MediainfoOGVPlugin
(edit) @22401   14 years kjdon tidied up a bit. Removed options that (I think) will always be set the …
(edit) @22386   14 years davidb Extra check added to avoid appending/prepending an environment …
(edit) @22382   14 years kjdon if language is ar for phind, then we set textorientation parameter to rtl.
(edit) @22372   14 years ak19 None of Fedora, the GS3 democlient and Greenstone seem to require the …
(edit) @22364   14 years ak19 MediainfoOGVPlugin now includes the final changes Arnaud made to his …
(edit) @22363   14 years ak19 Adding in the adjustments to the mediainfoogv plugin that were mailed …
(edit) @22362   14 years ak19 Committing Arnaud Yvan's MediainfoOGVPlugin.pm for the next release of …
(edit) @22355   14 years kjdon previously, when use_realistic_book was set, all files listed in …
(edit) @22352   14 years kjdon remove ex. when generating index lists. Don't want any ex. in …
(edit) @22351   14 years davidb White-space tidy up
(edit) @22350   14 years davidb Testing for newer version of cached file had the 'sign' incorrect. …
(edit) @22349   14 years kjdon if metadata extracted from item file has a namespace, then prefix it …
(edit) @22348   14 years kjdon store any extracted metadata that has a namespace as ex.ns.meta
(edit) @22338   14 years ak19 Image URLs are adjusted to refer to their location in Fedora.
(edit) @22331   14 years ak19 Goes with the changes made in revision 21822: fixed several additional …
(edit) @22330   14 years kjdon we want to store the original file name not the tidy filename as the …
(edit) @22329   14 years kjdon changed mp3:meta to ex.id3.meta. apparently id3 isa better name for …
(edit) @22328   14 years kjdon store the path to doc.xml with unix / slashes so that the collection …
(edit) @22327   14 years kjdon if a file and all its associated files is deleted from import, then …
(edit) @22316   14 years kjdon store extracted namespaced metadata as ex.metadata, eg ex.dc.Title, …
(edit) @22296   14 years kjdon remove ex. from any metadata before writing it to gdbm file. …
(edit) @22294   14 years kjdon when asking for a piece of metadata, want to remove ex. from the ones …
(edit) @22293   14 years kjdon extracted metadata is now going to be added as ex.meta, then GLI will …
(edit) @22267   14 years kjdon fixed a mistake in a method name
(edit) @22264   14 years kjdon greenstone doesn't like digit only macros when they appear inside …
(edit) @22232   14 years mdewsnip New OAIMetadataXMLPlugin.pm for extracting information from OAI …
(edit) @22222   14 years kjdon half way through the process of tidying up incremental stuff. When …
(edit) @22221   14 years kjdon added commented quote to make emacs colouring proper
(edit) @22220   14 years kjdon in classify, don't readd the document if edit mode is update - all we …
(edit) @22216   14 years kjdon added BasePlugin.store_original_file
(edit) @22215   14 years kjdon added store_original_file option, used for eg Text, HTML plugins to …
(edit) @22175   14 years kjdon remove prefix and suffix are defined per metadata group, not per …
(edit) @22121   14 years max Test added to insure that there is a valid string to convert into the …
(edit) @22120   14 years max Extra check added to allow for documents that do not have any …
(edit) @22119   14 years max Don't check for NTFS anymore because link should take care of saying …
(edit) @22087   14 years sjm84 Corrected an error with escaping
(edit) @22086   14 years sjm84 Corrected a small error in the leading comment of OAIDownload.pm: …
(edit) @22076   14 years sjm84 Added error messages to these files that are printed when a write …
(edit) @22074   14 years kjdon extrametadata needs the filename with no subfolder as that is added in …
(edit) @22038   14 years davidb Tweaked print statment to be more consistent with other classifier …
(edit) @22037   14 years davidb Manifest file processing upgraded to support OIDs. The code then uses …
(edit) @22032   14 years davidb Options supported by mkcol.pl expanding to include -buildtype and …
(edit) @22010   14 years davidb GDBM reader/writer conflict resulted from swithcing from the (now …
(edit) @21981   14 years kjdon fix for ticket #676. Conversion of pdf to html where two pdfs had the …
(edit) @21969   14 years kjdon store the offsets in add_vlist for all the cases, not just for when …
(edit) @21958   14 years kjdon ppthtml and xslhtml don't seem to output utf8, so remove the …
(edit) @21916   14 years kjdon made this work with a user specified process_exp so that your metadata …
(edit) @21905   14 years mdewsnip Changes made by Jeffrey Ke at DL Consulting Ltd to remove the global …
(edit) @21876   14 years kjdon only process into english clauses if english is the only language, not …
(edit) @21872   14 years davidb sqlite3 can take an sql statement as an extra parameter. Better to do …
(edit) @21866   14 years kjdon added some code for if identify returns size in mb.
(edit) @21862   14 years davidb Documents now have 'lastmodifieddate' added as metadata in yyyymmdd format
(edit) @21857   14 years davidb Removal of GDBM specific code in favour of write_infodb_rawentry() …
(edit) @21856   14 years davidb read_infodb_entry() implemented for sqlite. new …
(edit) @21822   14 years ak19 Dr Bainbridge has fixed several perl files that depended on perl 5.8 …
(edit) @21803   14 years kjdon set file_id to null if ID doesn't match FILE.* (previously it was …
(edit) @21801   14 years kjdon extended HTMLPlugin's metadata_field_separator option to Word and …
Note: See TracRevisionLog for help on using the revision log.