source: trunk/gsdl/perllib/plugins

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @1741   24 years sjboddie Fixed a little bug that was causing pluginfo.pl to print some dodgy …
(edit) @1740   24 years jrm21 We now escape underscores so that any macros in source code (wrt to …
(edit) @1735   24 years say1 fixed about a billion little Image things.
(edit) @1733   24 years say1 new plugin for images
(edit) @1731   24 years jrm21 New and improved! Now gets #include information from std C files as …
(edit) @1730   24 years jrm21 removed a debugging statement left in accidentally…
(edit) @1729   24 years jrm21 title regexp should have started "\s*", not "\s+" - it's optional …
(edit) @1728   24 years jrm21 Minor change so that leading whitespace is skipped when grabbing the …
(edit) @1720   24 years dmm9 Added information to the usage text about date extraction option
(edit) @1719   24 years dmm9 Added information to the usage text about date extraction option
(edit) @1718   24 years dmm9 Added information to the usage text about date extraction option
(edit) @1712   24 years say1 cleaned up metadata extraction.
(edit) @1711   24 years say1 fixed minor spelling mistake
(edit) @1710   24 years say1 RecPlug now skips CVS directories.
(edit) @1707   24 years jrm21 Plugin for source code (primarily for putting Greenstone src into a …
(edit) @1706   24 years say1 cleaned up the Title code to strip away standard prefixes inserted by …
(edit) @1705   24 years say1 fixed to handle filenames with multiple dots.
(edit) @1700   24 years say1 changed PSPlug to extract CreationDate, Title and Pages info.
(edit) @1699   24 years say1 fixed the bug in HTML plug which broke images for Dave
(edit) @1691   24 years jrm21 return "" instead of exit 1 on error. This means that if 1 file …
(edit) @1686   24 years jrm21 HTMLPlug no longer blocks .pdf files. (also updated reference to this …
(edit) @1685   24 years jrm21 PSPlug based heavily on PDFPlug…
(edit) @1677   24 years paynter Added teh BibTex entry type as metadata.
(edit) @1676   24 years paynter Plugins for processing files of bibliography records in BibTex and …
(edit) @1658   24 years paynter Fixed a bug reading the headers that confused "To" with "In-Reply-To".
(edit) @1653   24 years paynter Fixed a few bugs where incorrect variable names were used.
(edit) @1609   24 years say1 fixed print_uage
(edit) @1605   24 years say1 fixed some of my earlier mistakes. sorry Stefan
(edit) @1602   24 years say1 metadata extraction work. (email addresses, generalised HTML tags, …
(edit) @1503   24 years davidb A bit of extra error checking.
(edit) @1482   24 years davidb Small modification so Index files can be in subdirectories of an …
(edit) @1448   24 years paynter Changed regular expressions for extracting metadata from META tags …
(edit) @1446   24 years paynter Major overhauls; works with the new gsConvert.pl instead of …
(edit) @1436   24 years davidb Due to rearrangement of ConvertTo hierarchy, this file is now redundant.
(edit) @1435   24 years davidb Rearrangement of ConvertTo inheritence so HTMLPlug and TextPlug do not …
(edit) @1431   24 years sjboddie Made a few minor adjustments to perl building code for use with …
(edit) @1424   24 years sjboddie Added a -out option to most of the perl building scripts to allow …
(edit) @1420   24 years davidb Moved read_file and read from ConvertToBasPlug to ConvertToPlug.
(edit) @1418   24 years davidb Small modification to improve handling of file names with spaces in.
(edit) @1417   24 years davidb Additions so ConvertPlug etc. can handle filenames with spaces in them.
(edit) @1415   24 years davidb Removed some diagnostic print statements.
(edit) @1411   24 years dmm9 added the options for the date extractor
(edit) @1410   24 years davidb Introduction of "ConvertTo" family of plugins. This establishes a new …
(edit) @1403   24 years say1 taught HTMLPlug about shtml, asp, cgi, php and html query files …
(edit) @1401   24 years davidb Fixed small problem with associated files.
(edit) @1400   24 years davidb General tidying of code.
(edit) @1396   24 years say1 changed initialisation code for acronyms
(edit) @1393   24 years say1 acronym markup functionality
(edit) @1384   24 years paynter Changed language extraction to ignoer encoding information, so that …
(edit) @1379   24 years paynter Fixed bug that gave gsdlsourcedocument metadata relative path instead …
(edit) @1360   24 years say1 clarified status messages
(edit) @1358   24 years nzdl Fixed bug I recently introduced into HTMLPlug (<pre> tags were being …
(edit) @1335   24 years say1 many acronym changes
(edit) @1317   24 years paynter Added -extract_language option, which uses the textcat language …
(edit) @1312   24 years sjboddie fixed a bug in the HTML plugin that showed up under windows
(edit) @1269   24 years sjboddie Added ZIPPlug plugin for handling input documents that have been …
(edit) @1245   24 years sjboddie Fixed a bug that davidb found in a couple of regular expressions
(edit) @1244   24 years sjboddie Caught up most general plugins (that's the ones in …
(edit) @1243   24 years sjboddie Caught HTMLPlug up with BasPlug. A few minor changes to some …
(edit) @1242   24 years sjboddie Added Stuart Yeate's acronym extraction code and made it a standard …
(edit) @1235   24 years nzdl * empty log message *
(edit) @1231   24 years gwp Bug fix on the H1 metadata option: if the file has no <H1> tag, …
(edit) @1230   24 years gwp Added an additional H1 metadata field that extracts the text between …
(edit) @1229   24 years sjboddie fixed bug in options
(edit) @1227   24 years sjboddie Modified the perl code for importing arabic encoded documents. Plugins …
(edit) @1221   24 years sjboddie Added a new HBSPlug which is kind of a generalisation of HBPlug …
(edit) @1220   24 years sjboddie Caught HTMLPlug up with the changes I made to BasPlug. HTMLPlug now …
(edit) @1219   24 years sjboddie Made BasPlug take options (these options are available to all plugins …
(edit) @1206   24 years gwp A thorough rewrite; some of the metadata was flawed in such a way that …
(edit) @1190   24 years gwp The first 200 chars of body text can now be extracted as metadata by …
(edit) @1020   24 years sjboddie changed paths to collection images (again!)
(edit) @1010   24 years sjboddie renamed old html module ghtml -- it clashed with builtin html module …
(edit) @1006   24 years sjboddie fixed but in previous changes
(edit) @973   24 years sjboddie new path to images
(edit) @965   24 years sjboddie fixed bug - added assoc_files option
(edit) @918   24 years kjm18 fixed bug where it was creating two doc_obj per file instead of just one.
(edit) @900   24 years sjboddie tweaked the way associated files are handled at build time - some …
(edit) @897   24 years sjboddie lots of stuff
(edit) @863   24 years sjboddie fixed a couple of bugs that I introduced when including Davids stuff
(edit) @862   24 years sjboddie fixed a couple of bugs that were preventing muliple document gml files …
(edit) @850   24 years sjboddie added use strict - tidied a few things up etc.
(edit) @849   24 years sjboddie Fixed a bit of a bug
(edit) @847   24 years sjboddie fixed CVS burp
(edit) @840   24 years davidb Optimisations to make plugin go faster
(edit) @839   24 years davidb added extra_metadata function
(edit) @809   25 years sjboddie plugins now take options, maxdocs is always defined
(edit) @808   25 years sjboddie New html plugin with options
(edit) @796   25 years sjboddie semi-colon;;;;
(edit) @734   25 years sjboddie removed old out of date comments
(edit) @733   25 years sjboddie just minor changes to book cover image stuff
(edit) @732   25 years sjboddie prevent from overriding Title metadata that may have been passed in …
(edit) @721   25 years davidb Support functions to help with the generation of webpages from Perl …
(edit) @709   25 years sjboddie no longer need classifytype metadata added from plugin
(edit) @707   25 years sjboddie fixed a windows specific bug
(edit) @678   25 years sjboddie added bookcover
(edit) @640   25 years sjboddie fixed an error that was causing a 'key: ' line to be sent to plugins
(edit) @639   25 years sjboddie added the <pre> tags …
(edit) @638   25 years sjboddie Gordon's new email plugin thingy
(edit) @620   25 years sjboddie functionality extended so a list of directories to ignore can be …
(edit) @617   25 years sjboddie a few fixes
Note: See TracRevisionLog for help on using the revision log.