source: trunk/gsdl/perllib/plugins

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @2327   23 years sjboddie * empty log message *
(edit) @2326   23 years sjboddie fixed a small bug in the new XML gml code that caused metadata tags …
(edit) @2267   23 years davidb GML file syntax altered to be XML compliant. This basically meant …
(edit) @2241   23 years sjboddie Tidied up the ConvertToPlug stuff to get it working on Windows 95/98
(edit) @2235   23 years sjboddie Hacked the textcat package about so that it only reads all the …
(edit) @2230   23 years paynter User can erquest a "Screen" image - essentially a second thumbnail. …
(edit) @2228   23 years paynter The -use_metadata_files option tells RecPlug to read any metadata XML …
(edit) @2226   23 years paynter Image size metadata fixed, dummy text added, Image filename fixed.
(edit) @2219   23 years sjboddie Had another go at suppressing the "subroutine redefined" warnings as …
(edit) @2209   23 years sjboddie Suppressed some annoying perl warnings
(edit) @2207   23 years paynter Bugfixes: read returns number of files instead of file type, and …
(edit) @2096   23 years jrm21 Minor changes to regexs, so that header fields have to be at start of …
(edit) @2086   23 years jrm21 We create a copy of any args to new() because parsargs might modify …
(edit) @2085   23 years jrm21 When importing, we need to escape any escape codes otherwise mg(?) …
(edit) @2084   23 years jrm21 usage message is now formatted to fit within 80 columns.
(edit) @2082   23 years jrm21 added bzip2 support (untested).
(edit) @2041   23 years jrm21 don't strip all whitespace from tmp filename, only from base name. …
(edit) @2036   23 years jrm21 don't use strict; anymore, as we want to be able to write error msgs …
(edit) @2029   23 years jrm21 Return 0 instead of "" on error in read() so that RecPlug can continue.
(edit) @2027   23 years jrm21 read() is now completely independent of BasPlug::read(), as the latter …
(edit) @2007   23 years sjboddie * empty log message *
(edit) @1999   23 years sjboddie Fixed a small problem with language detection code.
(edit) @1974   23 years cs025 Fixed omission of encoding from parameters in read_file
(edit) @1954   23 years jmt14 * empty log message *
(edit) @1929   23 years dg5 Modified: ConvertToPlug and HTMLPlug to handle files in binary mode to …
(edit) @1903   23 years sjboddie We now use textcats best guess if it returns 3 or less possibilities …
(edit) @1895   23 years jrm21 Email plug now uses SplitPlug for mbox mail files. Hopefully this …
(edit) @1894   23 years jrm21 updated by copying BasPlug's new language/encoding stuff over for the …
(edit) @1891   23 years paynter Named characters like é and ì are translated to UTF8 …
(edit) @1874   23 years sjboddie * empty log message *
(edit) @1870   23 years sjboddie Tidied up language support stuff.
(edit) @1869   23 years paynter Regular expression fix.
(edit) @1868   23 years sjboddie Made a bunch of changes to the building code to support lots of new …
(edit) @1857   23 years dmm9 date extraction options documented
(edit) @1855   23 years paynter Trivial change to warning message.
(edit) @1846   23 years sjboddie Removed a call to a function that I removed in my previous changes - oops
(edit) @1845   23 years paynter Changed a "!=" to a "ne".
(edit) @1844   23 years sjboddie Added an 'auto' argument to BasPlug's '-input_encoding' option ('auto' …
(edit) @1838   23 years sjboddie Added support for Cyrillic languages (windows codepage 1251) - yet to …
(edit) @1812   23 years sjboddie ZIPPlug is now disabled under windows
(edit) @1810   23 years sjboddie Fixed a bug that showed up when using Perl 5.6 on windows
(edit) @1787   23 years jrm21 "allow_extra_options" missing, to get inherited options
(edit) @1758   23 years say1 added minimum image size and a few bug fixes
(edit) @1757   23 years say1 tightened the criteria for email files to avoid matching all dynamic …
(edit) @1756   23 years say1 added detection and handling of unreadable files
(edit) @1755   23 years say1 added better cycle detection (but still not perfect)
(edit) @1754   23 years say1 added support for jar files (which are actually just fancy zip files)
(edit) @1744   23 years say1 about a billion changes to ImagePlug
(edit) @1742   23 years jrm21 Added a comment to the usage stuff about PRESCRIPT.
(edit) @1741   23 years sjboddie Fixed a little bug that was causing pluginfo.pl to print some dodgy …
(edit) @1740   23 years jrm21 We now escape underscores so that any macros in source code (wrt to …
(edit) @1735   23 years say1 fixed about a billion little Image things.
(edit) @1733   23 years say1 new plugin for images
(edit) @1731   23 years jrm21 New and improved! Now gets #include information from std C files as …
(edit) @1730   23 years jrm21 removed a debugging statement left in accidentally…
(edit) @1729   23 years jrm21 title regexp should have started "\s*", not "\s+" - it's optional …
(edit) @1728   23 years jrm21 Minor change so that leading whitespace is skipped when grabbing the …
(edit) @1720   23 years dmm9 Added information to the usage text about date extraction option
(edit) @1719   23 years dmm9 Added information to the usage text about date extraction option
(edit) @1718   23 years dmm9 Added information to the usage text about date extraction option
(edit) @1712   23 years say1 cleaned up metadata extraction.
(edit) @1711   23 years say1 fixed minor spelling mistake
(edit) @1710   23 years say1 RecPlug now skips CVS directories.
(edit) @1707   23 years jrm21 Plugin for source code (primarily for putting Greenstone src into a …
(edit) @1706   23 years say1 cleaned up the Title code to strip away standard prefixes inserted by …
(edit) @1705   23 years say1 fixed to handle filenames with multiple dots.
(edit) @1700   23 years say1 changed PSPlug to extract CreationDate, Title and Pages info.
(edit) @1699   23 years say1 fixed the bug in HTML plug which broke images for Dave
(edit) @1691   24 years jrm21 return "" instead of exit 1 on error. This means that if 1 file …
(edit) @1686   24 years jrm21 HTMLPlug no longer blocks .pdf files. (also updated reference to this …
(edit) @1685   24 years jrm21 PSPlug based heavily on PDFPlug…
(edit) @1677   24 years paynter Added teh BibTex entry type as metadata.
(edit) @1676   24 years paynter Plugins for processing files of bibliography records in BibTex and …
(edit) @1658   24 years paynter Fixed a bug reading the headers that confused "To" with "In-Reply-To".
(edit) @1653   24 years paynter Fixed a few bugs where incorrect variable names were used.
(edit) @1609   24 years say1 fixed print_uage
(edit) @1605   24 years say1 fixed some of my earlier mistakes. sorry Stefan
(edit) @1602   24 years say1 metadata extraction work. (email addresses, generalised HTML tags, …
(edit) @1503   24 years davidb A bit of extra error checking.
(edit) @1482   24 years davidb Small modification so Index files can be in subdirectories of an …
(edit) @1448   24 years paynter Changed regular expressions for extracting metadata from META tags …
(edit) @1446   24 years paynter Major overhauls; works with the new gsConvert.pl instead of …
(edit) @1436   24 years davidb Due to rearrangement of ConvertTo hierarchy, this file is now redundant.
(edit) @1435   24 years davidb Rearrangement of ConvertTo inheritence so HTMLPlug and TextPlug do not …
(edit) @1431   24 years sjboddie Made a few minor adjustments to perl building code for use with …
(edit) @1424   24 years sjboddie Added a -out option to most of the perl building scripts to allow …
(edit) @1420   24 years davidb Moved read_file and read from ConvertToBasPlug to ConvertToPlug.
(edit) @1418   24 years davidb Small modification to improve handling of file names with spaces in.
(edit) @1417   24 years davidb Additions so ConvertPlug etc. can handle filenames with spaces in them.
(edit) @1415   24 years davidb Removed some diagnostic print statements.
(edit) @1411   24 years dmm9 added the options for the date extractor
(edit) @1410   24 years davidb Introduction of "ConvertTo" family of plugins. This establishes a new …
(edit) @1403   24 years say1 taught HTMLPlug about shtml, asp, cgi, php and html query files …
(edit) @1401   24 years davidb Fixed small problem with associated files.
(edit) @1400   24 years davidb General tidying of code.
(edit) @1396   24 years say1 changed initialisation code for acronyms
(edit) @1393   24 years say1 acronym markup functionality
(edit) @1384   24 years paynter Changed language extraction to ignoer encoding information, so that …
(edit) @1379   24 years paynter Fixed bug that gave gsdlsourcedocument metadata relative path instead …
(edit) @1360   24 years say1 clarified status messages
Note: See TracRevisionLog for help on using the revision log.