Ignore:
Timestamp:
2004-01-22T14:17:30+13:00 (20 years ago)
Author:
kjdon
Message:

Fiddled around with segmenting for chinese text. Haven't changed how the
segmentation is done, or what character ranges are used.
But when its done is now controlled by the collect.cfg. There is a new
option, separate_cjk, values true or false, default false. Segmentation
is only done if this is set to true. This is passed as a global option to
all plugins by the import.pl script, so the user just needs to add it
once to the config file, not as an option to all plugins.
The queryaction uses this option too to determine whether or not to segment
the query.

File:
1 edited

Legend:

Unmodified
Added
Removed
  • trunk/gsdl/src/colservr/collectserver.cpp

    r5868 r6584  
    101101      collectinfo.searchTypes = cfgline;
    102102    }
    103 
     103    else if (key == "separate_cjk") {
     104      if (value == "true") collectinfo.isSegmented = true;
     105      else collectinfo.isSegmented = false;
     106    }
    104107    // What have we set in our collect.cfg file :  document or collection ?
    105108    else if (key == "authenticate") collectinfo.authenticate = value;
Note: See TracChangeset for help on using the changeset viewer.