source: trunk/gsdl/perllib/plugins

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @12270   18 years kjdon set_OIDtype now takes two arguments, the type and the metadata (used …
(edit) @12269   18 years kjdon fixed a typo in a comment
(edit) @12169   18 years mdewsnip Tidied up that horrible long line in the new() function of every plugin.
(edit) @11995   18 years davidb Improved support for incremental addition: instead of having to run …
(edit) @11970   18 years mdewsnip Removed, as the version of the translator it was used for is long gone.
(edit) @11966   18 years mdewsnip (Profiling) Creating new textcat objects (one for each plugin) is …
(edit) @11920   18 years mdewsnip Removed the code that converts the filename into a regular expression, …
(edit) @11919   18 years mdewsnip Fixed some metadata_read code that was obviously not tested on Windows.
(edit) @11918   18 years mdewsnip Simple plugin for reading metadata from comma-separated value files.
(edit) @11893   18 years kjdon deleted some stuff to do with original Word documents - this plugin …
(edit) @11884   18 years kjdon removed tof_header option - can just specify this in the toc_header option
(edit) @11881   18 years kjdon added a test for image file in case there isn't one, in process_item
(edit) @11880   18 years kjdon added a #" to line 1100 so that emacs colouring is not stuffed up
(edit) @11851   18 years kjdon added modegli=3 to delete_toc option so that it only shows up if the …
(edit) @11849   18 years kjdon added the header etc args into StructuredHTMLPlug args otherwise it …
(edit) @11834   18 years mdewsnip Replaced all "_httpcollection_" in metadata (especially srclink) with …
(edit) @11833   18 years kjdon the test for windows scripting now tests if its true, rather than defined
(edit) @11827   18 years kjdon moved the add FileFormat metadata bit outside of the loop so that it …
(edit) @11681   18 years kjdon print_xml_usage and print_xml_header now take arguments
(edit) @11680   18 years kjdon changed how it loads secondary plugins - now uses eval. did this cos …
(edit) @11679   18 years kjdon changed Yes to yes for inherits
(edit) @11677   18 years kjdon added a few more print methods so that pluginfo will work properly
(edit) @11676   18 years kjdon added adstract tag or changed Yes to yes for inherits
(edit) @11669   18 years kjdon need to pass a parameter to print_xml_header so it knows which DTD to …
(edit) @11661   18 years kjdon changed COLLECTDIR to GSDLCOLLECTDIR cos thats what everyone else uses
(edit) @11546   18 years mdewsnip Added a comment regarding my last change.
(edit) @11545   18 years mdewsnip Added support for repeatable fields using .
(edit) @11538   18 years cvs_anon make HTMLPlug recognize the @ character in file extensions
(edit) @11527   18 years kjdon added URL properly to the list of fields
(edit) @11506   18 years kjdon fixed the bug where space outside Metadata tags was getting added into …
(edit) @11493   18 years kjdon only add [img1], [img2] to the dummy text if assoc_images is on
(edit) @11468   18 years mdewsnip Ooops... fix to code I just committed to deal with < and > characters.
(edit) @11467   18 years mdewsnip Changed my mind about the previous change. Now the nicely formatted …
(edit) @11466   18 years mdewsnip Improved the formatting of the ISISRecordHTML table slightly.
(edit) @11465   18 years mdewsnip Now adds ISISRecordHTML metadata also, as a shortcut towards a nicely …
(edit) @11430   18 years mdewsnip Fix for removing nasty carriage returns/line feeds from the end of …
(edit) @11414   18 years kjdon added URL to the list of fields
(edit) @11390   18 years mdewsnip Added process_exp arguments to these plugins so the GLI knows what …
(edit) @11389   18 years jrm21 try to get the encoding from a '<meta http-equiv' tag if HTML. make …
(edit) @11380   18 years kjdon davids fixes for a couple of warnings
(edit) @11379   18 years mdewsnip Bug fix for subfield change.
(edit) @11368   18 years kjdon For some reason smart_block was hidden for gli, so made it visible
(edit) @11353   18 years mdewsnip Added a bit of code to prevent infinite looping on bad data.
(edit) @11335   18 years mdewsnip Now sends messages to the GLI telling the user the CDS/ISIS file is …
(edit) @11334   18 years mdewsnip Errors reading the .fdt or .xrf files are now reported back to the GLI …
(edit) @11333   18 years mdewsnip Now consistently sets $self->{'gli'} in plugin::begin.
(edit) @11332   18 years mdewsnip Added a mechanism for plugins to do tidying up after exploding. …
(edit) @11329   18 years mdewsnip Made the block_exp option hidden from the GLI, as it is non-negiotable.
(edit) @11305   18 years kjdon forgot to change the mp3 icon name
(edit) @11300   18 years mdewsnip Now also removes '#' characters from field names.
(edit) @11299   18 years mdewsnip Now removes more punctuation from field names, and removes the …
(edit) @11298   18 years mdewsnip Fairly substantial changes in order to deal with …
(edit) @11295   18 years mdewsnip Hide the "split_exp" option from the GLI, since it should never need …
(edit) @11286   18 years kjdon Richards (DLC) fixes for allowing single quotes to surround href, src …
(edit) @11262   18 years mdewsnip The FDT file wasn't being read using the encoding specified, so things …
(edit) @11249   18 years kjdon added some checks that the image and text files exist before we try to …
(edit) @11193   18 years mdewsnip Changed ogg.png to iogg.gif.
(edit) @11123   18 years davidb Filename stored as URL metadata was missing from this plugin and …
(edit) @11122   18 years davidb Introduction of -associate_tail_re option to BasPlug. This is a …
(edit) @11090   18 years kjdon made all plugins that implement read() call read_block to check …
(edit) @11089   18 years kjdon removed a couple of unnecessary bits of code like repeated arguments, …
(edit) @11072   18 years mdewsnip Removed this from CVS because it is a bit too specific to be in the …
(edit) @11071   18 years mdewsnip Undid my previous change. This is going to be removed from CVS and put …
(edit) @11069   18 years mdewsnip Added an option to use Kea 4.0 -- this isn't included with Greenstone, …
(edit) @11044   18 years mdewsnip The "-extract_keyphrase" and "-extract_keyphrase_options" arguments …
(edit) @11043   18 years mdewsnip No idea what this plugin does or is for, but it shouldn't be blocking …
(edit) @11008   18 years mdewsnip Added an option to run the "fribidi" Unicode Bidirectional Algorithm …
(edit) @10997   18 years kjdon new OpenDocument plugin written by Reuben Evans as a 517 project
(edit) @10994   18 years kjdon commented out a line which was incrementing self->num_not_processed. …
(edit) @10985   18 years kjdon added a new option srcicon - can specify a different icon instead of …
(edit) @10978   18 years kjdon added assoc_field option to NULPlug
(edit) @10956   18 years jrm21 now catch and exit if we got an error while parsing/evaling any …
(edit) @10923   18 years jrm21 1) allow a 2nd sql query for 'priming' the db 2) add a space when …
(edit) @10890   18 years kjdon changed convetr_to default to auto (was html), got rid of findType …
(edit) @10889   18 years kjdon added a description to metadata_fields arg, also retabbed the argument list
(edit) @10888   18 years kjdon PS can't convert to html, but the default for convert_to was html. so …
(edit) @10839   19 years jrm21 better match when looking at sub-part types so we don't match …
(edit) @10835   19 years kjdon made the -input_encoding=utf8 always be set for htmlplug secondary plugin
(edit) @10834   19 years jrm21 moved utf8 checking code into separate function. (maybe it should be …
(edit) @10833   19 years jrm21 store the names of files we've already checked when looking for a …
(edit) @10827   19 years jrm21 1) include %xx bits when making hrefs out of urls 2) test if text is …
(edit) @10769   19 years mdewsnip When processing Word documents in an 8-bit encoding wvWare would …
(edit) @10725   19 years chi For some reasons, to change the date format to "yyymmdd" used "date" …
(edit) @10724   19 years chi Add an option-metadata_fields to allow user-specified metadata fields …
(edit) @10723   19 years chi Change the option of extracted_word_metadata_fields to metadata_fields.
(edit) @10620   19 years kjdon now prints out some gli tags when bad args are encountered for plugins …
(edit) @10613   19 years kjdon modified the item file metadata regex so that space is allowed (and …
(edit) @10609   19 years kjdon if convert doesn't work, should return -1 (tried and failed) not 0 …
(edit) @10606   19 years kjdon I hadn't actually tested teh last fix, so this is the correct fix
(edit) @10605   19 years kjdon make pagedimgplug simple format version use the OID_type option
(edit) @10600   19 years chi modifications for deal with document title (as the first H1 heading) …
(edit) @10595   19 years chi Modification of level header regular expression.
(edit) @10594   19 years kjdon mime type added as MimeType metadata
(edit) @10592   19 years kjdon in read, call title_fallback to make sure that we have a title - pdf …
(edit) @10582   19 years kjdon added in cover image handling into read()
(edit) @10580   19 years kjdon if created from pluginfo.pl (self->info_only == 1)then don't load up …
(edit) @10579   19 years kjdon copied classify.pm and BasClas.pm, added -gsdlinfo flag - if this is …
(edit) @10549   19 years chi Modifications to deal with the "dc value" without qualifier.
(edit) @10537   19 years chi Set up the auto conversion type of PSPlug to text.
(edit) @10536   19 years chi Modification of adding pagedimg types of conversion for PS documents. …
Note: See TracRevisionLog for help on using the revision log.