source: trunk/gsdl/perllib

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @2503   23 years sjboddie fixed a small bug in the datelist classifier that caused year ranges …
(edit) @2500   23 years sjboddie Removed test for phindcgi from phind classifier as it is no longer used
(edit) @2493   23 years paynter Changed at the request of Marcio - see mailing list.
(edit) @2492   23 years paynter Fixed trivial bug in the new set_OID function.
(edit) @2489   23 years dmm9 adding the browse interface as a classifier option
(edit) @2487   23 years sjboddie Changes to get phind working under windows
(edit) @2484   23 years say1 Changed SplitPlug to allow control over the OID. Changed BibTexPlug to …
(edit) @2483   23 years say1 added a "if" to catch the case where someone tries to convert an …
(edit) @2481   23 years kjm18 changed mgpp system calls to use the new executable names
(edit) @2480   23 years kjm18 added the store_text option as done in mgbuildproc.pm
(edit) @2479   23 years kjm18 added indexmap and indexfieldmap to build.cfg fields
(edit) @2478   23 years kjm18 brought it in line with changes to buildcol.pl, mgbuilder.pm now uses …
(edit) @2453   23 years jrm21 Slightly smarter title extraction from body's text.
(edit) @2452   23 years jrm21 -title_sub works now -- previously had a leading "--" argument, which …
(edit) @2451   23 years jrm21 PSPlug now uses the -title_sub option to TEXTPlug, to remove any …
(edit) @2450   23 years jrm21 now accepts the "-title_sub" option, a regexp to remove when …
(edit) @2432   23 years say1 switched the order of removing the symbolic link and checking for …
(edit) @2412   23 years sjboddie Added a tar archive of all the perl modules required to make ping.pl work
(edit) @2364   23 years jrm21 turn "\" into " " so that we don't lose backslashes along the way…
(edit) @2363   23 years jrm21 fixed nasty bug where </srclink></a><srclink> was being matched …
(edit) @2359   23 years sjboddie Altered the help text a little for mkcol.pl, import.pl, buildcol.pl, …
(edit) @2356   23 years sjboddie Renamed HBSPlug BookPlug in the hope that it's a little less crytic
(edit) @2355   23 years sjboddie All options to import.pl and buildcol.pl may now be specified from …
(edit) @2342   23 years sjboddie renamed HTMLPlug's w3mir option to file_is_url
(edit) @2336   23 years sjboddie added a -no_text option to buildcol.pl to allow collections to be …
(edit) @2333   23 years kjm18 closed all filehandles that had remained open, to fix the bug that was …
(edit) @2327   23 years sjboddie * empty log message *
(edit) @2326   23 years sjboddie fixed a small bug in the new XML gml code that caused metadata tags …
(edit) @2267   23 years davidb GML file syntax altered to be XML compliant. This basically meant …
(edit) @2241   23 years sjboddie Tidied up the ConvertToPlug stuff to get it working on Windows 95/98
(edit) @2237   23 years sjboddie Added a unicode2koi8r function to unicode.pm (because I needed one). …
(edit) @2235   23 years sjboddie Hacked the textcat package about so that it only reads all the …
(edit) @2230   23 years paynter User can erquest a "Screen" image - essentially a second thumbnail. …
(edit) @2228   23 years paynter The -use_metadata_files option tells RecPlug to read any metadata XML …
(edit) @2226   23 years paynter Image size metadata fixed, dummy text added, Image filename fixed.
(edit) @2224   23 years paynter When the document has associated files, a metadata element …
(edit) @2219   23 years sjboddie Had another go at suppressing the "subroutine redefined" warnings as …
(edit) @2209   23 years sjboddie Suppressed some annoying perl warnings
(edit) @2207   23 years paynter Bugfixes: read returns number of files instead of file type, and …
(edit) @2206   23 years paynter Annoying bug.
(edit) @2193   23 years sjboddie soft_link function now acts as a simple "copy" function on windows
(edit) @2096   23 years jrm21 Minor changes to regexs, so that header fields have to be at start of …
(edit) @2086   23 years jrm21 We create a copy of any args to new() because parsargs might modify …
(edit) @2085   23 years jrm21 When importing, we need to escape any escape codes otherwise mg(?) …
(edit) @2084   23 years jrm21 usage message is now formatted to fit within 80 columns.
(edit) @2083   23 years paynter Fixed a stupid mistake that I know I've fixed before.
(edit) @2082   23 years jrm21 added bzip2 support (untested).
(edit) @2080   23 years jrm21 When creating nodes, now need to pass -buttonname instead of -title.
(edit) @2079   23 years paynter Added a new binary field to the savephrases output that indicates …
(edit) @2064   23 years paynter Sort thesaurus phrases by frequency then type.
(edit) @2048   23 years sjboddie * empty log message *
(edit) @2041   23 years jrm21 don't strip all whitespace from tmp filename, only from base name. …
(edit) @2040   23 years sjboddie * empty log message *
(edit) @2039   23 years jrm21 do eval{symlink()} because platforms that don't support symlink …
(edit) @2036   23 years jrm21 don't use strict; anymore, as we want to be able to write error msgs …
(edit) @2029   23 years jrm21 Return 0 instead of "" on error in read() so that RecPlug can continue.
(edit) @2027   23 years jrm21 read() is now completely independent of BasPlug::read(), as the latter …
(edit) @2025   23 years paynter You can now have several phind classifiers on one collection. This …
(edit) @2024   23 years paynter Store classifier-specific parameters in gdbm file if required. …
(edit) @2022   23 years sjboddie Caught some of the classifiers up with the documentation (finally). …
(edit) @2018   23 years jrm21 removed "use BasPlug" lines from metadata extractors, as they …
(edit) @2008   23 years paynter Marginally better support for non-English documents.
(edit) @2007   23 years sjboddie * empty log message *
(edit) @2001   23 years sjboddie Added a hack that mysteriously converts iso639 language codes …
(edit) @2000   23 years sjboddie Re-added iso639.pm
(edit) @1999   23 years sjboddie Fixed a small problem with language detection code.
(edit) @1995   23 years jmt14 * empty log message *
(edit) @1989   23 years jmt14 * empty log message *
(edit) @1974   23 years cs025 Fixed omission of encoding from parameters in read_file
(edit) @1973   23 years kjm18 fixed up language stuff
(edit) @1972   23 years jmt14 * empty log message *
(edit) @1954   23 years jmt14 * empty log message *
(edit) @1949   23 years paynter Fixed bug that prevented tokeniser from distinguishing between languages.
(edit) @1948   23 years jrm21 Updated to now pass arguments using the new parsargv list format, …
(edit) @1947   23 years dmm9 updated documentation
(edit) @1929   23 years dg5 Modified: ConvertToPlug and HTMLPlug to handle files in binary mode to …
(edit) @1920   23 years sjboddie * empty log message *
(edit) @1919   23 years sjboddie * empty log message *
(edit) @1917   23 years kjm18 minor changes
(edit) @1905   23 years sjboddie * empty log message *
(edit) @1904   23 years sjboddie Added support for a couple more encodings that I'm told are in common …
(edit) @1903   23 years sjboddie We now use textcats best guess if it returns 3 or less possibilities …
(edit) @1901   23 years sjboddie * empty log message *
(edit) @1897   23 years paynter Convert_gml_into_tokens function a little more language tolerant, and …
(edit) @1895   23 years jrm21 Email plug now uses SplitPlug for mbox mail files. Hopefully this …
(edit) @1894   23 years jrm21 updated by copying BasPlug's new language/encoding stuff over for the …
(edit) @1891   23 years paynter Named characters like &eacute; and &igrave; are translated to UTF8 …
(edit) @1890   23 years paynter When multiple metadata fields have multiple values, get them all. …
(edit) @1885   23 years paynter Added a classinfo.pl script, analogous to pluginfo.pl, that provides …
(edit) @1884   23 years paynter Added some documentation.
(edit) @1883   23 years paynter Supports new parameters of suffix program and new stopword file …
(edit) @1874   23 years sjboddie * empty log message *
(edit) @1871   23 years paynter Use two-letter codes for language names, updated docs.
(edit) @1870   23 years sjboddie Tidied up language support stuff.
(edit) @1869   23 years paynter Regular expression fix.
(edit) @1868   23 years sjboddie Made a bunch of changes to the building code to support lots of new …
(edit) @1857   23 years dmm9 date extraction options documented
(edit) @1855   23 years paynter Trivial change to warning message.
(edit) @1852   23 years kjm18 heaps of changes
(edit) @1851   23 years kjm18 added levels and buildtype for mgpp collections
Note: See TracRevisionLog for help on using the revision log.