source: trunk/gsdl/perllib/classify

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @5646   21 years mdewsnip Added fix to prevent bad "W-P" type hlist entries, many thanks to Don …
(edit) @5645   21 years mdewsnip Moved classifier descriptions into the resource bundle …
(edit) @5615   21 years davidb Recopt (used in nested classifier hierarchies) added back in.
(edit) @5532   21 years kjdon added -nogroupyears, -sortmeta and -datemeta options. see comment in …
(edit) @4873   21 years mdewsnip Further work on standardising option descriptions. Specifically, in …
(edit) @4840   21 years davidb -removesuffix option added to provide similar funtionality to -removeprefix
(edit) @4786   21 years mdewsnip Commented out print_usage functions - classifiers should now call …
(edit) @4778   21 years mdewsnip Modified the code for generating the usage texts to use the methods in …
(edit) @4761   21 years mdewsnip Copied (from BasPlug.pm) a generic print usage function which works …
(edit) @4759   21 years mdewsnip Tidied up 'options' and 'arguments' structures (representing the …
(edit) @4225   21 years jrm21 if the "removeprefix" option makes the metadata value empty, we skip …
(edit) @3728   21 years sjboddie Prevent "undefined variable" warnings for documents that have no …
(edit) @3727   21 years sjboddie Reverted back last change to AZList.pm. Attempting to make it work …
(edit) @3719   21 years sjboddie Prevent dodgy format_string_english() functions from destroying any …
(edit) @3639   21 years kjdon modified the default sorting element, and fixed a bug.
(edit) @3540   22 years kjdon added John T's changes into CVS - added info to enable retrieval of …
(edit) @3536   22 years jrm21 set doc title to "" if it is undefined
(edit) @3529   22 years jrm21 fixed oversight where alpha_numeric_cmp was no longer being called …
(edit) @3510   22 years jrm21 need to check that remove_prefix is defined before checking its length
(edit) @3472   22 years kjdon renamed phind.pm to Phind.pm in keeping with the names of the other …
(edit) @3433   22 years jrm21 If a metadata value becomes empty (because of the removeprefix option) …
(edit) @3413   22 years jrm21 Added "\" to the characters we need to escape for classifying.
(edit) @3306   22 years davidb Removed some debugging print statements
(edit) @3303   22 years davidb Classifier extented to support frequency sort option through -freqsort …
(edit) @3302   22 years davidb Classifier modified so it does not include A-Z letters at top of page …
(edit) @3244   22 years jrm21 we no longer exit with an error if the suffix program failed to create …
(edit) @3181   22 years sjboddie Altered the getcharequiv() function so it now converts entities to raw …
(edit) @3109   22 years jrm21 When getting first char for classification, s/(.).*$/$1/g isn't good …
(edit) @2973   22 years sjboddie Fixed a bug in the Hierarchy classifier
(edit) @2956   22 years jrm21 Added Don Gourley's changes for getting Sections to work properly.
(edit) @2955   22 years jrm21 Added removeprefix option. Added better usage information of the options.
(edit) @2954   22 years jrm21 added a remove_prefix option to strip from metadata before sorting for …
(edit) @2916   22 years jrm21 Tidied up the usage output.
(edit) @2897   22 years sjboddie Added AZCompactSectionList which was contributed by Don Gourley …
(edit) @2889   22 years jrm21 Need to define $outhandle before using it in reclassify.
(edit) @2837   23 years sjboddie added hlist_at_top option to Hierarchy classifier
(edit) @2803   23 years sjboddie * empty log message *
(edit) @2685   23 years jrm21 Improved regex for when the last category is too small, and we need to …
(edit) @2666   23 years jrm21 Modified phind classifier so that special delimiters are always …
(edit) @2658   23 years jrm21 fixed a typo
(edit) @2632   23 years jrm21 added an option "-bymonth=1", to group by (eg) 2000-January, …
(edit) @2576   23 years sjboddie Moved phind's stopword directory from etc to etc/packages/phind
(edit) @2539   23 years sjboddie * empty log message *
(edit) @2529   23 years sjboddie added quoting to system calls in phind classifier - needed when …
(edit) @2510   23 years sjboddie renamed phind's stopwords directory and contents to use Win3.1 …
(edit) @2509   23 years sjboddie Fixed (bypassed really) a problem with the phind classifier on windows …
(edit) @2507   23 years sjboddie Tidied up the phind client a little more. It now belongs to the …
(edit) @2503   23 years sjboddie fixed a small bug in the datelist classifier that caused year ranges …
(edit) @2500   23 years sjboddie Removed test for phindcgi from phind classifier as it is no longer used
(edit) @2489   23 years dmm9 adding the browse interface as a classifier option
(edit) @2487   23 years sjboddie Changes to get phind working under windows
(edit) @2481   23 years kjm18 changed mgpp system calls to use the new executable names
(edit) @2333   23 years kjm18 closed all filehandles that had remained open, to fix the bug that was …
(edit) @2206   23 years paynter Annoying bug.
(edit) @2083   23 years paynter Fixed a stupid mistake that I know I've fixed before.
(edit) @2080   23 years jrm21 When creating nodes, now need to pass -buttonname instead of -title.
(edit) @2079   23 years paynter Added a new binary field to the savephrases output that indicates …
(edit) @2064   23 years paynter Sort thesaurus phrases by frequency then type.
(edit) @2025   23 years paynter You can now have several phind classifiers on one collection. This …
(edit) @2022   23 years sjboddie Caught some of the classifiers up with the documentation (finally). …
(edit) @2008   23 years paynter Marginally better support for non-English documents.
(edit) @2001   23 years sjboddie Added a hack that mysteriously converts iso639 language codes …
(edit) @1949   23 years paynter Fixed bug that prevented tokeniser from distinguishing between languages.
(edit) @1948   23 years jrm21 Updated to now pass arguments using the new parsargv list format, …
(edit) @1947   23 years dmm9 updated documentation
(edit) @1897   23 years paynter Convert_gml_into_tokens function a little more language tolerant, and …
(edit) @1890   23 years paynter When multiple metadata fields have multiple values, get them all. …
(edit) @1885   23 years paynter Added a classinfo.pl script, analogous to pluginfo.pl, that provides …
(edit) @1884   23 years paynter Added some documentation.
(edit) @1883   23 years paynter Supports new parameters of suffix program and new stopword file …
(edit) @1871   23 years paynter Use two-letter codes for language names, updated docs.
(edit) @1840   23 years paynter Changed default suffix size, clean up phrases.3 file
(edit) @1839   23 years paynter Updated classifiers to use the parsearg library instead of ad-hoc …
(edit) @1829   23 years paynter Accept a "thesaurus=name" option that identifies a thesaurus in a …
(edit) @1808   23 years paynter Option to save the phind phrases to a text file.
(edit) @1803   23 years paynter Moved the phind classifier's data directory into the index directory. …
(edit) @1716   23 years jrm21 minor change to allow the -title option to display correctly on HTML page.
(edit) @1646   24 years paynter Arguments for setting suffix program parameters.
(edit) @1645   24 years paynter Output less verbose & more consistant with buildcol.pl
(edit) @1643   24 years paynter The phind phrase browsing interface is now a Greenstone classifier. …
(edit) @1611   24 years sjboddie Minor bug fix
(edit) @1608   24 years nzdl Inserted an ugly hack into the Hierarchy classifier to mask a bug …
(edit) @1483   24 years sjboddie added -out option to classifiers
(edit) @1362   24 years say1 removed use statement so other files could be compiled with use strict …
(edit) @1313   24 years sjboddie Added Davids version of AZCompactList which handles multiple value metadata
(edit) @1250   24 years sjboddie Tidied up the classfiers slightly, made them a little more object …
(edit) @1086   24 years sjboddie Added AZCompactList.pm to distribution (and altered List.pm slightly …
(edit) @838   24 years davidb added options passed into 'new' subroutine
(edit) @837   24 years davidb added alpha_numeric search
(edit) @779   25 years sjboddie fixed bug in title option
(edit) @741   25 years sjboddie fixed up a bit of a bug - should fix this properly some time
(edit) @717   25 years sjboddie caught HTML classifier up with new browsing structure
(edit) @708   25 years sjboddie fixed problem with titles beginning with tags or html elements
(edit) @677   25 years sjboddie changes for new browsing support
(edit) @652   25 years sjboddie redesigned browsing support
(edit) @641   25 years sjboddie Fixed a harmless sort of an error that was causing uninitialized …
(edit) @618   25 years sjboddie removed a couple of instances where =~ /\w/ was used to test for an …
(edit) @545   25 years sjboddie changed sorting slightly
(edit) @537   25 years sjboddie added GPL headers
(edit) @449   25 years sjboddie new classifier
Note: See TracRevisionLog for help on using the revision log.