Changeset 31252 for other-projects/hathitrust/wcsa/extracted-features-solr/trunk/solr-ingest/src/main/java/org/hathitrust/extractedfeatures/PerVolumeWordStreamFlatmap.java
- Timestamp:
- 2016-12-20T14:15:05+13:00 (7 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
other-projects/hathitrust/wcsa/extracted-features-solr/trunk/solr-ingest/src/main/java/org/hathitrust/extractedfeatures/PerVolumeWordStreamFlatmap.java
r31242 r31252 20 20 protected double _progress_step; 21 21 22 boolean _icu_tokenize; 22 23 boolean _strict_file_io; 23 24 24 25 public PerVolumeWordStreamFlatmap(String input_dir, int verbosity, 25 26 DoubleAccumulator progress_accum, double progress_step, 27 boolean icu_tokenize, 26 28 boolean strict_file_io) 27 29 { … … 32 34 _progress_step = progress_step; 33 35 36 _icu_tokenize = icu_tokenize; 34 37 _strict_file_io = strict_file_io; 35 38 } … … 87 90 if (ef_page != null) { 88 91 89 ArrayList<String> page_word_list = SolrDocJSON.generateTokenPosCountText(volume_id, page_id, ef_page );92 ArrayList<String> page_word_list = SolrDocJSON.generateTokenPosCountText(volume_id, page_id, ef_page, _icu_tokenize); 90 93 all_word_list.addAll(page_word_list); 91 94 }
Note:
See TracChangeset
for help on using the changeset viewer.