Ticket #359 (new enhancement)

Opened 12 years ago

Last modified 11 years ago

CJK character segmentation

Reported by: kjdon Owned by: nobody
Priority: low Milestone: Greenstone 2 wishlist
Component: Greenstone2 Runtime Severity: enhancement
Keywords: Cc:


Need to implement handling for high number ranges. code values are in the code, but commented out. perllib/cnsseg.pm runtime-src/src/recpt/querytools.cpp

text_t can't handle numbers > 0xffff (unsigned short).

Change History

Changed 12 years ago by kjdon

Also, currently the text going to the compressed text and to the index is segmented. Would be nice to only segment the text going to the index. Easy to do, but if we do this search term highlighting doesn't work.

So, need to fix up the search term highlighting, then fix this.

Changed 11 years ago by kjdon

  • milestone changed from Release 2.82 to Release 2.83
Note: See TracTickets for help on using tickets.