Changeset 11245


Ignore:
Timestamp:
2006-02-14T15:36:29+13:00 (15 years ago)
Author:
kjdon
Message:

by default, lucene indexer will only index the first 10,000 words of a document, to avoid out of memory errors. I have set the max doc lenght to be max integer value. hope this is ok.

Location:
trunk
Files:
2 edited

Legend:

Unmodified
Added
Removed
  • trunk/gsdl/src/java/org/nzdl/gsdl/LuceneWrap/Indexer.java

    r10164 r11245  
    5252
    5353        writer_ = new IndexWriter(index_dir.getPath(), new StandardAnalyzer(), create);
     54        // by default, will only index 10,000 words per document
     55        // Can throw out_of_memory errors
     56        writer_.maxFieldLength = Integer.MAX_VALUE;
    5457        if (create) {
    5558        writer_.optimize();
  • trunk/indexers/lucene-gs/src/org/greenstone/LuceneWrapper/Indexer.java

    r10164 r11245  
    5252
    5353        writer_ = new IndexWriter(index_dir.getPath(), new StandardAnalyzer(), create);
     54        // by default, will only index 10,000 words per document
     55        // Can throw out_of_memory errors
     56        writer_.maxFieldLength = Integer.MAX_VALUE;
    5457        if (create) {
    5558        writer_.optimize();
Note: See TracChangeset for help on using the changeset viewer.