|
|
@2503
|
23 years |
sjboddie |
fixed a small bug in the datelist classifier that caused year ranges …
|
|
|
@2500
|
23 years |
sjboddie |
Removed test for phindcgi from phind classifier as it is no longer used
|
|
|
@2493
|
23 years |
paynter |
Changed at the request of Marcio - see mailing list.
|
|
|
@2492
|
23 years |
paynter |
Fixed trivial bug in the new set_OID function.
|
|
|
@2489
|
23 years |
dmm9 |
adding the browse interface as a classifier option
|
|
|
@2487
|
23 years |
sjboddie |
Changes to get phind working under windows
|
|
|
@2484
|
23 years |
say1 |
Changed SplitPlug to allow control over the OID. Changed BibTexPlug to …
|
|
|
@2483
|
23 years |
say1 |
added a "if" to catch the case where someone tries to convert an …
|
|
|
@2481
|
23 years |
kjm18 |
changed mgpp system calls to use the new executable names
|
|
|
@2480
|
23 years |
kjm18 |
added the store_text option as done in mgbuildproc.pm
|
|
|
@2479
|
23 years |
kjm18 |
added indexmap and indexfieldmap to build.cfg fields
|
|
|
@2478
|
23 years |
kjm18 |
brought it in line with changes to buildcol.pl, mgbuilder.pm
now uses …
|
|
|
@2453
|
23 years |
jrm21 |
Slightly smarter title extraction from body's text.
|
|
|
@2452
|
23 years |
jrm21 |
-title_sub works now -- previously had a leading "--" argument, which …
|
|
|
@2451
|
23 years |
jrm21 |
PSPlug now uses the -title_sub option to TEXTPlug, to remove any …
|
|
|
@2450
|
23 years |
jrm21 |
now accepts the "-title_sub" option, a regexp to remove when …
|
|
|
@2432
|
23 years |
say1 |
switched the order of removing the symbolic link and checking for …
|
|
|
@2412
|
23 years |
sjboddie |
Added a tar archive of all the perl modules required to make ping.pl work
|
|
|
@2364
|
23 years |
jrm21 |
turn "\" into " " so that we don't lose backslashes along the way…
|
|
|
@2363
|
23 years |
jrm21 |
fixed nasty bug where </srclink></a><srclink> was being matched …
|
|
|
@2359
|
23 years |
sjboddie |
Altered the help text a little for mkcol.pl, import.pl, buildcol.pl, …
|
|
|
@2356
|
23 years |
sjboddie |
Renamed HBSPlug BookPlug in the hope that it's a little less crytic
|
|
|
@2355
|
23 years |
sjboddie |
All options to import.pl and buildcol.pl may now be specified from …
|
|
|
@2342
|
23 years |
sjboddie |
renamed HTMLPlug's w3mir option to file_is_url
|
|
|
@2336
|
23 years |
sjboddie |
added a -no_text option to buildcol.pl to allow collections to be …
|
|
|
@2333
|
23 years |
kjm18 |
closed all filehandles that had remained open, to fix the bug that was …
|
|
|
@2327
|
23 years |
sjboddie |
* empty log message *
|
|
|
@2326
|
23 years |
sjboddie |
fixed a small bug in the new XML gml code that caused metadata tags …
|
|
|
@2267
|
23 years |
davidb |
GML file syntax altered to be XML compliant. This basically meant …
|
|
|
@2241
|
23 years |
sjboddie |
Tidied up the ConvertToPlug stuff to get it working on Windows 95/98
|
|
|
@2237
|
23 years |
sjboddie |
Added a unicode2koi8r function to unicode.pm (because I needed one). …
|
|
|
@2235
|
23 years |
sjboddie |
Hacked the textcat package about so that it only reads all the …
|
|
|
@2230
|
23 years |
paynter |
User can erquest a "Screen" image - essentially a second thumbnail. …
|
|
|
@2228
|
23 years |
paynter |
The -use_metadata_files option tells RecPlug to read any metadata XML …
|
|
|
@2226
|
23 years |
paynter |
Image size metadata fixed, dummy text added, Image filename fixed.
|
|
|
@2224
|
23 years |
paynter |
When the document has associated files, a metadata element …
|
|
|
@2219
|
23 years |
sjboddie |
Had another go at suppressing the "subroutine redefined" warnings as …
|
|
|
@2209
|
23 years |
sjboddie |
Suppressed some annoying perl warnings
|
|
|
@2207
|
23 years |
paynter |
Bugfixes: read returns number of files instead of file type, and …
|
|
|
@2206
|
23 years |
paynter |
Annoying bug.
|
|
|
@2193
|
23 years |
sjboddie |
soft_link function now acts as a simple "copy" function on windows
|
|
|
@2096
|
23 years |
jrm21 |
Minor changes to regexs, so that header fields have to be at start of …
|
|
|
@2086
|
23 years |
jrm21 |
We create a copy of any args to new() because parsargs might modify …
|
|
|
@2085
|
23 years |
jrm21 |
When importing, we need to escape any escape codes otherwise mg(?) …
|
|
|
@2084
|
23 years |
jrm21 |
usage message is now formatted to fit within 80 columns.
|
|
|
@2083
|
23 years |
paynter |
Fixed a stupid mistake that I know I've fixed before.
|
|
|
@2082
|
23 years |
jrm21 |
added bzip2 support (untested).
|
|
|
@2080
|
23 years |
jrm21 |
When creating nodes, now need to pass -buttonname instead of -title.
|
|
|
@2079
|
23 years |
paynter |
Added a new binary field to the savephrases output that indicates …
|
|
|
@2064
|
23 years |
paynter |
Sort thesaurus phrases by frequency then type.
|
|
|
@2048
|
23 years |
sjboddie |
* empty log message *
|
|
|
@2041
|
23 years |
jrm21 |
don't strip all whitespace from tmp filename, only from base name. …
|
|
|
@2040
|
23 years |
sjboddie |
* empty log message *
|
|
|
@2039
|
23 years |
jrm21 |
do eval{symlink()} because platforms that don't support symlink …
|
|
|
@2036
|
23 years |
jrm21 |
don't use strict; anymore, as we want to be able to write error msgs …
|
|
|
@2029
|
23 years |
jrm21 |
Return 0 instead of "" on error in read() so that RecPlug can continue.
|
|
|
@2027
|
23 years |
jrm21 |
read() is now completely independent of BasPlug::read(), as the latter …
|
|
|
@2025
|
23 years |
paynter |
You can now have several phind classifiers on one collection. This …
|
|
|
@2024
|
23 years |
paynter |
Store classifier-specific parameters in gdbm file if required. …
|
|
|
@2022
|
23 years |
sjboddie |
Caught some of the classifiers up with the documentation (finally). …
|
|
|
@2018
|
23 years |
jrm21 |
removed "use BasPlug" lines from metadata extractors, as they …
|
|
|
@2008
|
23 years |
paynter |
Marginally better support for non-English documents.
|
|
|
@2007
|
23 years |
sjboddie |
* empty log message *
|
|
|
@2001
|
23 years |
sjboddie |
Added a hack that mysteriously converts iso639 language codes …
|
|
|
@2000
|
23 years |
sjboddie |
Re-added iso639.pm
|
|
|
@1999
|
23 years |
sjboddie |
Fixed a small problem with language detection code.
|
|
|
@1995
|
23 years |
jmt14 |
* empty log message *
|
|
|
@1989
|
23 years |
jmt14 |
* empty log message *
|
|
|
@1974
|
23 years |
cs025 |
Fixed omission of encoding from parameters in read_file
|
|
|
@1973
|
23 years |
kjm18 |
fixed up language stuff
|
|
|
@1972
|
23 years |
jmt14 |
* empty log message *
|
|
|
@1954
|
23 years |
jmt14 |
* empty log message *
|
|
|
@1949
|
23 years |
paynter |
Fixed bug that prevented tokeniser from distinguishing between languages.
|
|
|
@1948
|
23 years |
jrm21 |
Updated to now pass arguments using the new parsargv list format, …
|
|
|
@1947
|
23 years |
dmm9 |
updated documentation
|
|
|
@1929
|
23 years |
dg5 |
Modified: ConvertToPlug and HTMLPlug to handle files in binary mode to …
|
|
|
@1920
|
23 years |
sjboddie |
* empty log message *
|
|
|
@1919
|
23 years |
sjboddie |
* empty log message *
|
|
|
@1917
|
23 years |
kjm18 |
minor changes
|
|
|
@1905
|
23 years |
sjboddie |
* empty log message *
|
|
|
@1904
|
23 years |
sjboddie |
Added support for a couple more encodings that I'm told are in common …
|
|
|
@1903
|
23 years |
sjboddie |
We now use textcats best guess if it returns 3 or less possibilities …
|
|
|
@1901
|
23 years |
sjboddie |
* empty log message *
|
|
|
@1897
|
23 years |
paynter |
Convert_gml_into_tokens function a little more language tolerant,
and …
|
|
|
@1895
|
23 years |
jrm21 |
Email plug now uses SplitPlug for mbox mail files. Hopefully this …
|
|
|
@1894
|
23 years |
jrm21 |
updated by copying BasPlug's new language/encoding stuff over for the …
|
|
|
@1891
|
23 years |
paynter |
Named characters like é and ì are translated
to UTF8 …
|
|
|
@1890
|
23 years |
paynter |
When multiple metadata fields have multiple values, get them all. …
|
|
|
@1885
|
23 years |
paynter |
Added a classinfo.pl script, analogous to pluginfo.pl, that provides …
|
|
|
@1884
|
23 years |
paynter |
Added some documentation.
|
|
|
@1883
|
23 years |
paynter |
Supports new parameters of suffix program and new stopword file …
|
|
|
@1874
|
23 years |
sjboddie |
* empty log message *
|
|
|
@1871
|
23 years |
paynter |
Use two-letter codes for language names, updated docs.
|
|
|
@1870
|
23 years |
sjboddie |
Tidied up language support stuff.
|
|
|
@1869
|
23 years |
paynter |
Regular expression fix.
|
|
|
@1868
|
23 years |
sjboddie |
Made a bunch of changes to the building code to support lots of new …
|
|
|
@1857
|
23 years |
dmm9 |
date extraction options documented
|
|
|
@1855
|
23 years |
paynter |
Trivial change to warning message.
|
|
|
@1852
|
23 years |
kjm18 |
heaps of changes
|
|
|
@1851
|
23 years |
kjm18 |
added levels and buildtype for mgpp collections
|
|
|