source: trunk/gsdl/perllib

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @11090   18 years kjdon made all plugins that implement read() call read_block to check …
(edit) @11089   18 years kjdon removed a couple of unnecessary bits of code like repeated arguments, …
(edit) @11072   18 years mdewsnip Removed this from CVS because it is a bit too specific to be in the …
(edit) @11071   18 years mdewsnip Undid my previous change. This is going to be removed from CVS and put …
(edit) @11070   18 years mdewsnip A much tidier Kea.pm that now also works on Windows.
(edit) @11069   18 years mdewsnip Added an option to use Kea 4.0 -- this isn't included with Greenstone, …
(edit) @11044   18 years mdewsnip The "-extract_keyphrase" and "-extract_keyphrase_options" arguments …
(edit) @11043   18 years mdewsnip No idea what this plugin does or is for, but it shouldn't be blocking …
(edit) @11030   18 years mdewsnip Now removes extra spaces around the '|' (separator) characters.
(edit) @11008   18 years mdewsnip Added an option to run the "fribidi" Unicode Bidirectional Algorithm …
(edit) @10998   18 years kjdon added OpenDocumentPlug.desc
(edit) @10997   18 years kjdon new OpenDocument plugin written by Reuben Evans as a 517 project
(edit) @10994   18 years kjdon commented out a line which was incrementing self->num_not_processed. …
(edit) @10993   18 years kjdon fixed a spelling mistake
(edit) @10986   18 years kjdon added UnknownPlug.srcicon
(edit) @10985   18 years kjdon added a new option srcicon - can specify a different icon instead of …
(edit) @10984   18 years jrm21 move the terminal detection/pager redirection stuff from pluginfo.pl …
(edit) @10983   18 years jrm21 better error message when we can't load an encoding
(edit) @10982   18 years jrm21 better handle a missing strings.rb bundle file, and print a warning to …
(edit) @10980   18 years kjdon in get_filename_for_hashing, if its been processed by NULPlug, return …
(edit) @10979   18 years kjdon DateList can now take a comma separated list of metadata names.
(edit) @10978   18 years kjdon added assoc_field option to NULPlug
(edit) @10977   18 years kjdon extended the match for author metadata - can now have Authors, and …
(edit) @10961   18 years kjdon modified mgpp/lucene indexes description so that you can have multiple …
(edit) @10956   18 years jrm21 now catch and exit if we got an error while parsing/evaling any …
(edit) @10923   18 years jrm21 1) allow a 2nd sql query for 'priming' the db 2) add a space when …
(edit) @10891   18 years kjdon when parsing options, once something has been set in , then don't …
(edit) @10890   18 years kjdon changed convetr_to default to auto (was html), got rid of findType …
(edit) @10889   18 years kjdon added a description to metadata_fields arg, also retabbed the argument list
(edit) @10888   18 years kjdon PS can't convert to html, but the default for convert_to was html. so …
(edit) @10839   18 years jrm21 better match when looking at sub-part types so we don't match …
(edit) @10835   18 years kjdon made the -input_encoding=utf8 always be set for htmlplug secondary plugin
(edit) @10834   18 years jrm21 moved utf8 checking code into separate function. (maybe it should be …
(edit) @10833   18 years jrm21 store the names of files we've already checked when looking for a …
(edit) @10827   18 years jrm21 1) include %xx bits when making hrefs out of urls 2) test if text is …
(edit) @10769   19 years mdewsnip When processing Word documents in an 8-bit encoding wvWare would …
(edit) @10725   19 years chi For some reasons, to change the date format to "yyymmdd" used "date" …
(edit) @10724   19 years chi Add an option-metadata_fields to allow user-specified metadata fields …
(edit) @10723   19 years chi Change the option of extracted_word_metadata_fields to metadata_fields.
(edit) @10664   19 years kjdon added Hierarchy.documents_last
(edit) @10663   19 years kjdon added documents_last - display document nodes after classifier nodes …
(edit) @10631   19 years kjdon BasClas.no_metadata_formatting
(edit) @10630   19 years kjdon added -no_metadata_formatting option to avoid the …
(edit) @10622   19 years kjdon Hierarchy.suppressfirstlevel
(edit) @10621   19 years kjdon added a suppressfirstlevel - use with e.g. gsdlsourcefilename to …
(edit) @10620   19 years kjdon now prints out some gli tags when bad args are encountered for plugins …
(edit) @10613   19 years kjdon modified the item file metadata regex so that space is allowed (and …
(edit) @10609   19 years kjdon if convert doesn't work, should return -1 (tried and failed) not 0 …
(edit) @10608   19 years kjdon added a check for type being defined before using it in set_OIDtype
(edit) @10606   19 years kjdon I hadn't actually tested teh last fix, so this is the correct fix
(edit) @10605   19 years kjdon make pagedimgplug simple format version use the OID_type option
(edit) @10600   19 years chi modifications for deal with document title (as the first H1 heading) …
(edit) @10595   19 years chi Modification of level header regular expression.
(edit) @10594   19 years kjdon mime type added as MimeType metadata
(edit) @10592   19 years kjdon in read, call title_fallback to make sure that we have a title - pdf …
(edit) @10582   19 years kjdon added in cover image handling into read()
(edit) @10580   19 years kjdon if created from pluginfo.pl (self->info_only == 1)then don't load up …
(edit) @10579   19 years kjdon copied classify.pm and BasClas.pm, added -gsdlinfo flag - if this is …
(edit) @10549   19 years chi Modifications to deal with the "dc value" without qualifier.
(edit) @10538   19 years kjdon add assocfilepath metadata always, even if no associated files. with …
(edit) @10537   19 years chi Set up the auto conversion type of PSPlug to text.
(edit) @10536   19 years chi Modification of adding pagedimg types of conversion for PS documents. …
(edit) @10514   19 years kjdon added in description_tags option, as it wasn't valid cos no longer …
(edit) @10513   19 years mdewsnip Absolute image tags, like <img src="/image.gif"> were being …
(edit) @10505   19 years mdewsnip Fixed a bug in the new "-use_hlist_for" stuff. Hope I haven't messed …
(edit) @10504   19 years kjdon fixed a bug with -convert_to auto handling
(edit) @10503   19 years kjdon added some handling of auto convert to type when windows scripting is on
(edit) @10502   19 years mdewsnip GenericList is not an abstract classifier.
(edit) @10501   19 years kjdon had to add set_keepold to these cos they are loaded like plugins but …
(edit) @10499   19 years mdewsnip Changed the way values in arguments are separated :-)
(edit) @10498   19 years mdewsnip Renamed the GenericList arguments to be more descriptive.
(edit) @10497   19 years kjdon renamed WordPlug.checkout_toc to delete_toc
(edit) @10496   19 years kjdon added some sanity checks, renamed the checkout_toc option to delete_toc
(edit) @10491   19 years kjdon fixed a typo
(edit) @10488   19 years kjdon removed BasClas.metadata.deft, added Collage.buttonname, changed the …
(edit) @10486   19 years kjdon removed BasClas.metadata.deft, added Collage.buttonname
(edit) @10485   19 years kjdon change buttonname desc to Collage.buttonname
(edit) @10484   19 years mdewsnip Somehow my GenericList strings got lost, so I'm adding them again.
(edit) @10483   19 years kjdon removed {BasClas.metadata.deft} in deft values
(edit) @10482   19 years kjdon removed debug statement
(edit) @10478   19 years kjdon arcPlug now knows about keepold, and if its not set, it wont try to do …
(edit) @10477   19 years kjdon added a mod to output_collection_meta
(edit) @10476   19 years kjdon added allowimagesonly
(edit) @10475   19 years kjdon added sections_index_document_metadata
(edit) @10474   19 years kjdon implemented sections_index_document_metadata
(edit) @10473   19 years kjdon declared , line 512
(edit) @10469   19 years kjdon added sections_index_document_metadata variable
(edit) @10468   19 years kjdon made a base builder class, adn moved lots of the code to it. hoe I …
(edit) @10467   19 years kjdon added some sections_index_document_metadata strings
(edit) @10466   19 years chi convert_to pagedimg_(gif|jpg|png) will only be shown in the PPTPlug …
(edit) @10465   19 years chi To add the convert_post_process() to handle some encoding problems for now.
(edit) @10463   19 years mdewsnip Removing the collection "tmp" directory is now only done when …
(edit) @10458   19 years mdewsnip Fixed a silly copy and paste mistake.
(edit) @10453   19 years kjdon fixed up some mistakes from previous merging of davids new code and …
(edit) @10452   19 years kjdon added in allowimagesonly option for use with convert_to html (thanks …
(edit) @10450   19 years kjdon changed from a dos file to unix file (no hat Ms)
(edit) @10446   19 years chi Modifications for converting windows-1252 to windows_1252.
(edit) @10444   19 years chi Add the description for extracted_word_metadata_fields option.
(edit) @10443   19 years chi Modifications to check different StructuredHTML formating conditions.
(edit) @10442   19 years chi To retrieve encoding information for the HTML file generated from …
Note: See TracRevisionLog for help on using the revision log.