Changeset 31252 for other-projects/hathitrust/wcsa/extracted-features-solr/trunk/solr-ingest/src/main/java/org/hathitrust/extractedfeatures/ProcessForWhitelist.java
- Timestamp:
- 2016-12-20T14:15:05+13:00 (7 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
other-projects/hathitrust/wcsa/extracted-features-solr/trunk/solr-ingest/src/main/java/org/hathitrust/extractedfeatures/ProcessForWhitelist.java
r31251 r31252 72 72 73 73 boolean strict_file_io = Boolean.getBoolean("wcsa-ef-ingest.strict-file-io"); 74 74 boolean icu_tokenize = Boolean.getBoolean("wcsa-ef-ingest.icu-tokenize"); 75 75 76 PerVolumeWordStreamFlatmap paged_solr_wordfreq_flatmap 76 77 = new PerVolumeWordStreamFlatmap(_input_dir,_verbosity, 77 78 per_vol_progress_accum,per_vol, 79 icu_tokenize, 78 80 strict_file_io); 79 81 JavaRDD<String> words = json_list_data.flatMap(paged_solr_wordfreq_flatmap);
Note:
See TracChangeset
for help on using the changeset viewer.