source: trunk/gsdl/perllib/plugins/BasPlug.pm

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @1954   23 years jmt14 * empty log message *
(edit) @1903   23 years sjboddie We now use textcats best guess if it returns 3 or less possibilities …
(edit) @1874   23 years sjboddie * empty log message *
(edit) @1870   23 years sjboddie Tidied up language support stuff.
(edit) @1868   23 years sjboddie Made a bunch of changes to the building code to support lots of new …
(edit) @1857   23 years dmm9 date extraction options documented
(edit) @1855   23 years paynter Trivial change to warning message.
(edit) @1846   23 years sjboddie Removed a call to a function that I removed in my previous changes - oops
(edit) @1845   23 years paynter Changed a "!=" to a "ne".
(edit) @1844   23 years sjboddie Added an 'auto' argument to BasPlug's '-input_encoding' option ('auto' …
(edit) @1838   23 years sjboddie Added support for Cyrillic languages (windows codepage 1251) - yet to …
(edit) @1756   23 years say1 added detection and handling of unreadable files
(edit) @1720   23 years dmm9 Added information to the usage text about date extraction option
(edit) @1719   23 years dmm9 Added information to the usage text about date extraction option
(edit) @1718   23 years dmm9 Added information to the usage text about date extraction option
(edit) @1686   23 years jrm21 HTMLPlug no longer blocks .pdf files. (also updated reference to this …
(edit) @1605   24 years say1 fixed some of my earlier mistakes. sorry Stefan
(edit) @1602   24 years say1 metadata extraction work. (email addresses, generalised HTML tags, …
(edit) @1424   24 years sjboddie Added a -out option to most of the perl building scripts to allow …
(edit) @1411   24 years dmm9 added the options for the date extractor
(edit) @1396   24 years say1 changed initialisation code for acronyms
(edit) @1393   24 years say1 acronym markup functionality
(edit) @1384   24 years paynter Changed language extraction to ignoer encoding information, so that …
(edit) @1379   24 years paynter Fixed bug that gave gsdlsourcedocument metadata relative path instead …
(edit) @1360   24 years say1 clarified status messages
(edit) @1335   24 years say1 many acronym changes
(edit) @1317   24 years paynter Added -extract_language option, which uses the textcat language …
(edit) @1244   24 years sjboddie Caught up most general plugins (that's the ones in …
(edit) @1242   24 years sjboddie Added Stuart Yeate's acronym extraction code and made it a standard …
(edit) @1229   24 years sjboddie fixed bug in options
(edit) @1227   24 years sjboddie Modified the perl code for importing arabic encoded documents. Plugins …
(edit) @1219   24 years sjboddie Made BasPlug take options (these options are available to all plugins …
(edit) @839   24 years davidb added extra_metadata function
(edit) @537   25 years sjboddie added GPL headers
(edit) @317   25 years sjboddie Added maxdocs option
(add) @4   25 years sjboddie Initial revision
Note: See TracRevisionLog for help on using the revision log.