source: trunk/gsdl/perllib/plugins

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @11090   18 years kjdon made all plugins that implement read() call read_block to check …
(edit) @11089   18 years kjdon removed a couple of unnecessary bits of code like repeated arguments, …
(edit) @11072   18 years mdewsnip Removed this from CVS because it is a bit too specific to be in the …
(edit) @11071   18 years mdewsnip Undid my previous change. This is going to be removed from CVS and put …
(edit) @11069   18 years mdewsnip Added an option to use Kea 4.0 -- this isn't included with Greenstone, …
(edit) @11044   18 years mdewsnip The "-extract_keyphrase" and "-extract_keyphrase_options" arguments …
(edit) @11043   18 years mdewsnip No idea what this plugin does or is for, but it shouldn't be blocking …
(edit) @11008   18 years mdewsnip Added an option to run the "fribidi" Unicode Bidirectional Algorithm …
(edit) @10997   18 years kjdon new OpenDocument plugin written by Reuben Evans as a 517 project
(edit) @10994   18 years kjdon commented out a line which was incrementing self->num_not_processed. …
(edit) @10985   18 years kjdon added a new option srcicon - can specify a different icon instead of …
(edit) @10978   18 years kjdon added assoc_field option to NULPlug
(edit) @10956   18 years jrm21 now catch and exit if we got an error while parsing/evaling any …
(edit) @10923   18 years jrm21 1) allow a 2nd sql query for 'priming' the db 2) add a space when …
(edit) @10890   18 years kjdon changed convetr_to default to auto (was html), got rid of findType …
(edit) @10889   18 years kjdon added a description to metadata_fields arg, also retabbed the argument list
(edit) @10888   18 years kjdon PS can't convert to html, but the default for convert_to was html. so …
(edit) @10839   18 years jrm21 better match when looking at sub-part types so we don't match …
(edit) @10835   18 years kjdon made the -input_encoding=utf8 always be set for htmlplug secondary plugin
(edit) @10834   18 years jrm21 moved utf8 checking code into separate function. (maybe it should be …
(edit) @10833   18 years jrm21 store the names of files we've already checked when looking for a …
(edit) @10827   18 years jrm21 1) include %xx bits when making hrefs out of urls 2) test if text is …
(edit) @10769   19 years mdewsnip When processing Word documents in an 8-bit encoding wvWare would …
(edit) @10725   19 years chi For some reasons, to change the date format to "yyymmdd" used "date" …
(edit) @10724   19 years chi Add an option-metadata_fields to allow user-specified metadata fields …
(edit) @10723   19 years chi Change the option of extracted_word_metadata_fields to metadata_fields.
(edit) @10620   19 years kjdon now prints out some gli tags when bad args are encountered for plugins …
(edit) @10613   19 years kjdon modified the item file metadata regex so that space is allowed (and …
(edit) @10609   19 years kjdon if convert doesn't work, should return -1 (tried and failed) not 0 …
(edit) @10606   19 years kjdon I hadn't actually tested teh last fix, so this is the correct fix
(edit) @10605   19 years kjdon make pagedimgplug simple format version use the OID_type option
(edit) @10600   19 years chi modifications for deal with document title (as the first H1 heading) …
(edit) @10595   19 years chi Modification of level header regular expression.
(edit) @10594   19 years kjdon mime type added as MimeType metadata
(edit) @10592   19 years kjdon in read, call title_fallback to make sure that we have a title - pdf …
(edit) @10582   19 years kjdon added in cover image handling into read()
(edit) @10580   19 years kjdon if created from pluginfo.pl (self->info_only == 1)then don't load up …
(edit) @10579   19 years kjdon copied classify.pm and BasClas.pm, added -gsdlinfo flag - if this is …
(edit) @10549   19 years chi Modifications to deal with the "dc value" without qualifier.
(edit) @10537   19 years chi Set up the auto conversion type of PSPlug to text.
(edit) @10536   19 years chi Modification of adding pagedimg types of conversion for PS documents. …
(edit) @10514   19 years kjdon added in description_tags option, as it wasn't valid cos no longer …
(edit) @10513   19 years mdewsnip Absolute image tags, like <img src="/image.gif"> were being …
(edit) @10504   19 years kjdon fixed a bug with -convert_to auto handling
(edit) @10503   19 years kjdon added some handling of auto convert to type when windows scripting is on
(edit) @10501   19 years kjdon had to add set_keepold to these cos they are loaded like plugins but …
(edit) @10496   19 years kjdon added some sanity checks, renamed the checkout_toc option to delete_toc
(edit) @10491   19 years kjdon fixed a typo
(edit) @10478   19 years kjdon arcPlug now knows about keepold, and if its not set, it wont try to do …
(edit) @10466   19 years chi convert_to pagedimg_(gif|jpg|png) will only be shown in the PPTPlug …
(edit) @10465   19 years chi To add the convert_post_process() to handle some encoding problems for now.
(edit) @10463   19 years mdewsnip Removing the collection "tmp" directory is now only done when …
(edit) @10453   19 years kjdon fixed up some mistakes from previous merging of davids new code and …
(edit) @10452   19 years kjdon added in allowimagesonly option for use with convert_to html (thanks …
(edit) @10450   19 years kjdon changed from a dos file to unix file (no hat Ms)
(edit) @10446   19 years chi Modifications for converting windows-1252 to windows_1252.
(edit) @10443   19 years chi Modifications to check different StructuredHTML formating conditions.
(edit) @10442   19 years chi To retrieve encoding information for the HTML file generated from …
(edit) @10441   19 years chi Modifications for pushing required option and argument lists to …
(edit) @10434   19 years chi Tidy up the item file to convert the "&" sign in the metadata to "&amp;".
(edit) @10430   19 years chi Allow to remove the soft_link.
(edit) @10429   19 years chi Modification of the way passing argument and option lists for the …
(edit) @10428   19 years chi Modification of the way passing argument and option lists for the …
(edit) @10427   19 years chi Modification of the way passing argument and option list for the …
(edit) @10426   19 years chi Add an option -extracted_word_metadata to extract metadata based on …
(edit) @10425   19 years chi Modification of the way passing argument and option lists for the …
(edit) @10424   19 years chi Modification of the way passing argument and options list for the …
(edit) @10423   19 years chi Modify the structure of pushing argument and option lists to secondary …
(edit) @10419   19 years kjdon is_incremental renamed is_incremental_capable
(edit) @10406   19 years chi If the -windows_scripting is on in WordPlug, the secondary plugin will …
(edit) @10405   19 years chi Adding structured HTML formating arguments here.
(edit) @10404   19 years chi remove the plugin arguments to WordPlug.
(edit) @10403   19 years chi Modifications for tidying up the item file generated through pdftoimg.pl.
(edit) @10395   19 years mdewsnip A plugin for RealMedia files. By Xin Gao for the 517 Digital Libraries …
(edit) @10356   19 years chi tidy up the code.
(edit) @10355   19 years chi Remove heading_title options to StructuredHTMLPlug.
(edit) @10354   19 years chi Add an argument "title_sub" here in PagedImgPlug
(edit) @10353   19 years chi Modification for allowing PDF document being converted to various …
(edit) @10352   19 years chi Change the pagedimg_png,jpg,gif (hyphen to underscore) setting in …
(edit) @10347   19 years kjdon removed the unneeded 'use parsargv'
(edit) @10344   19 years kjdon if there was no Title, add PageNum as a Title
(edit) @10329   19 years mdewsnip Changed the default_language string to be of type "string", since …
(edit) @10313   19 years mdewsnip Fixed undefined doc_oid variable problem.
(edit) @10305   19 years davidb newly introduced is_incremental() used to help determine if file needs …
(edit) @10280   19 years chi Some major changes to allow secondary plugin setting.
(edit) @10279   19 years chi A modification to allow a secondary plugin setting through ConvertToPlug
(edit) @10278   19 years chi A major modification to allow a secondary-plugin setting. With this …
(edit) @10277   19 years chi tidy up the filename in add_file().
(edit) @10276   19 years chi Add a read_into_doc_obj() for enabling secondary_pluging function. …
(edit) @10275   19 years chi A modification to allow a secondary plugin setting through ConvertToPlug
(edit) @10274   19 years chi A modification to allow a secondary plug setting through ConvertToPlug.
(edit) @10273   19 years chi A modification to allow a secondary-plugin setting through ConvertToPlug.
(edit) @10272   19 years chi A modification to allow a secondary-plugin setting.
(edit) @10271   19 years chi A new program to demonstrate HTML document (converted from other …
(edit) @10270   19 years chi The modification to allow the secondary-plugin setting.
(edit) @10254   19 years kjdon added 'use strict' to all plugins, and made modifications (mostly …
(edit) @10229   19 years kjdon fixed up some stuff for printing args (pluginfo.pl, classinfo.pl)
(edit) @10218   19 years kjdon Jeffrey's new parsing modifications, committed approx 6 July, 15.16
(edit) @10170   19 years kjdon made our, added two parse methods - if you want to do xml parsing …
(edit) @10168   19 years kjdon modified this to use a new xml format. it should work as before on the …
Note: See TracRevisionLog for help on using the revision log.