Opened 13 years ago

Last modified 12 years ago

#359 new enhancement

CJK character segmentation

Reported by: kjdon Owned by: nobody
Priority: low Milestone: Greenstone 2 wishlist
Component: Greenstone2 Runtime Severity: enhancement
Keywords: Cc:

Description

Need to implement handling for high number ranges. code values are in the code, but commented out. perllib/cnsseg.pm runtime-src/src/recpt/querytools.cpp

text_t can't handle numbers > 0xffff (unsigned short).

Change History (2)

comment:1 by kjdon, 13 years ago

Also, currently the text going to the compressed text and to the index is segmented. Would be nice to only segment the text going to the index. Easy to do, but if we do this search term highlighting doesn't work.

So, need to fix up the search term highlighting, then fix this.

comment:2 by kjdon, 12 years ago

Milestone: Release 2.82Release 2.83
Note: See TracTickets for help on using tickets.