source: trunk/gsdl/perllib/plugins

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @1482   24 years davidb Small modification so Index files can be in subdirectories of an …
(edit) @1448   24 years paynter Changed regular expressions for extracting metadata from META tags …
(edit) @1446   24 years paynter Major overhauls; works with the new gsConvert.pl instead of …
(edit) @1436   24 years davidb Due to rearrangement of ConvertTo hierarchy, this file is now redundant.
(edit) @1435   24 years davidb Rearrangement of ConvertTo inheritence so HTMLPlug and TextPlug do not …
(edit) @1431   24 years sjboddie Made a few minor adjustments to perl building code for use with …
(edit) @1424   24 years sjboddie Added a -out option to most of the perl building scripts to allow …
(edit) @1420   24 years davidb Moved read_file and read from ConvertToBasPlug to ConvertToPlug.
(edit) @1418   24 years davidb Small modification to improve handling of file names with spaces in.
(edit) @1417   24 years davidb Additions so ConvertPlug etc. can handle filenames with spaces in them.
(edit) @1415   24 years davidb Removed some diagnostic print statements.
(edit) @1411   24 years dmm9 added the options for the date extractor
(edit) @1410   24 years davidb Introduction of "ConvertTo" family of plugins. This establishes a new …
(edit) @1403   24 years say1 taught HTMLPlug about shtml, asp, cgi, php and html query files …
(edit) @1401   24 years davidb Fixed small problem with associated files.
(edit) @1400   24 years davidb General tidying of code.
(edit) @1396   24 years say1 changed initialisation code for acronyms
(edit) @1393   24 years say1 acronym markup functionality
(edit) @1384   24 years paynter Changed language extraction to ignoer encoding information, so that …
(edit) @1379   24 years paynter Fixed bug that gave gsdlsourcedocument metadata relative path instead …
(edit) @1360   24 years say1 clarified status messages
(edit) @1358   24 years nzdl Fixed bug I recently introduced into HTMLPlug (<pre> tags were being …
(edit) @1335   24 years say1 many acronym changes
(edit) @1317   24 years paynter Added -extract_language option, which uses the textcat language …
(edit) @1312   24 years sjboddie fixed a bug in the HTML plugin that showed up under windows
(edit) @1269   24 years sjboddie Added ZIPPlug plugin for handling input documents that have been …
(edit) @1245   24 years sjboddie Fixed a bug that davidb found in a couple of regular expressions
(edit) @1244   24 years sjboddie Caught up most general plugins (that's the ones in …
(edit) @1243   24 years sjboddie Caught HTMLPlug up with BasPlug. A few minor changes to some …
(edit) @1242   24 years sjboddie Added Stuart Yeate's acronym extraction code and made it a standard …
(edit) @1235   24 years nzdl * empty log message *
(edit) @1231   24 years gwp Bug fix on the H1 metadata option: if the file has no <H1> tag, …
(edit) @1230   24 years gwp Added an additional H1 metadata field that extracts the text between …
(edit) @1229   24 years sjboddie fixed bug in options
(edit) @1227   24 years sjboddie Modified the perl code for importing arabic encoded documents. Plugins …
(edit) @1221   24 years sjboddie Added a new HBSPlug which is kind of a generalisation of HBPlug …
(edit) @1220   24 years sjboddie Caught HTMLPlug up with the changes I made to BasPlug. HTMLPlug now …
(edit) @1219   24 years sjboddie Made BasPlug take options (these options are available to all plugins …
(edit) @1206   24 years gwp A thorough rewrite; some of the metadata was flawed in such a way that …
(edit) @1190   24 years gwp The first 200 chars of body text can now be extracted as metadata by …
(edit) @1020   24 years sjboddie changed paths to collection images (again!)
(edit) @1010   24 years sjboddie renamed old html module ghtml -- it clashed with builtin html module …
(edit) @1006   24 years sjboddie fixed but in previous changes
(edit) @973   24 years sjboddie new path to images
(edit) @965   24 years sjboddie fixed bug - added assoc_files option
(edit) @918   24 years kjm18 fixed bug where it was creating two doc_obj per file instead of just one.
(edit) @900   24 years sjboddie tweaked the way associated files are handled at build time - some …
(edit) @897   24 years sjboddie lots of stuff
(edit) @863   24 years sjboddie fixed a couple of bugs that I introduced when including Davids stuff
(edit) @862   24 years sjboddie fixed a couple of bugs that were preventing muliple document gml files …
(edit) @850   24 years sjboddie added use strict - tidied a few things up etc.
(edit) @849   24 years sjboddie Fixed a bit of a bug
(edit) @847   24 years sjboddie fixed CVS burp
(edit) @840   24 years davidb Optimisations to make plugin go faster
(edit) @839   24 years davidb added extra_metadata function
(edit) @809   24 years sjboddie plugins now take options, maxdocs is always defined
(edit) @808   24 years sjboddie New html plugin with options
(edit) @796   24 years sjboddie semi-colon;;;;
(edit) @734   25 years sjboddie removed old out of date comments
(edit) @733   25 years sjboddie just minor changes to book cover image stuff
(edit) @732   25 years sjboddie prevent from overriding Title metadata that may have been passed in …
(edit) @721   25 years davidb Support functions to help with the generation of webpages from Perl …
(edit) @709   25 years sjboddie no longer need classifytype metadata added from plugin
(edit) @707   25 years sjboddie fixed a windows specific bug
(edit) @678   25 years sjboddie added bookcover
(edit) @640   25 years sjboddie fixed an error that was causing a 'key: ' line to be sent to plugins
(edit) @639   25 years sjboddie added the <pre> tags …
(edit) @638   25 years sjboddie Gordon's new email plugin thingy
(edit) @620   25 years sjboddie functionality extended so a list of directories to ignore can be …
(edit) @617   25 years sjboddie a few fixes
(edit) @616   25 years sjboddie some new gb plugins
(edit) @594   25 years sjboddie Fixed some silly problems
(edit) @593   25 years sjboddie Fixed bug causing everything to die if it tripped over an unreadable …
(edit) @592   25 years sjboddie new plugin for GB encoded text
(edit) @589   25 years sjboddie fixed bug in regular expression
(edit) @585   25 years sjboddie new plugin
(edit) @537   25 years sjboddie added GPL headers
(edit) @433   25 years sjboddie added gzip option to import.pl
(edit) @339   25 years sjboddie Fixed a bug that I created last time
(edit) @321   25 years sjboddie Fixed a couple of small bugs
(edit) @318   25 years sjboddie fixed a bug causing documents without <body> tags to have no text
(edit) @317   25 years sjboddie Added maxdocs option
(edit) @288   25 years sjboddie Fixed up a few things to allow collections to be built directly …
(edit) @286   25 years sjboddie few changes mostly to get howto and organization classifications …
(edit) @285   25 years sjboddie Did some hacking around here while trying to build subsets of existing …
(edit) @250   25 years sjboddie just changed a comment to stop confusing myself
(edit) @245   25 years sjboddie Small changes to allow metadata to be passed to plugins from …
(edit) @233   25 years sjboddie Altered TOCPlug to work with new gdbm building code. It now reads a …
(edit) @230   25 years sjboddie fixed typo I introduced last time
(edit) @229   25 years sjboddie Removed old sorting function
(edit) @168   25 years rjmcnab Initial revision.
(edit) @139   25 years sjboddie Got building stuff to handle subcollections and language subcollections
(edit) @136   25 years rjmcnab Fixed small bug.
(edit) @88   25 years rjmcnab Special characters are now escaped in GML files.
(edit) @87   25 years rjmcnab Fixed a bug in the sorting and then removed the sorting as its too …
(edit) @75   25 years rjmcnab Added support for the new version of doc.pl (which uses the UTF-8 …
(edit) @70   25 years sjboddie Put sorting back in for now as removing it caused some problems for …
(edit) @68   25 years sjboddie Don't need to be sorting files at build time anymore
(add) @4   25 years sjboddie Initial revision
Note: See TracRevisionLog for help on using the revision log.