Changeset 31269 for other-projects/hathitrust/wcsa/extracted-features-solr/trunk/solr-ingest/src/main/java/org/hathitrust/extractedfeatures/ProcessForSolrIngest.java
- Timestamp:
- 2016-12-28T10:30:08+13:00 (7 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
other-projects/hathitrust/wcsa/extracted-features-solr/trunk/solr-ingest/src/main/java/org/hathitrust/extractedfeatures/ProcessForSolrIngest.java
r31266 r31269 113 113 DoubleAccumulator progress_accum = jsc.sc().doubleAccumulator("Progress Percent"); 114 114 115 System.err.println();116 System.err.println();117 System.err.println();118 System.err.println("****##### _input_dir = " + _input_dir);119 System.err.println();120 System.err.println();121 System.err.println();122 123 115 boolean icu_tokenize = Boolean.getBoolean("wcsa-ef-ingest.icu-tokenize"); 124 116 boolean strict_file_io = Boolean.getBoolean("wcsa-ef-ingest.strict-file-io"); … … 130 122 //json_list_data_rp.foreach(per_vol_json); 131 123 JavaRDD<String> per_page_ids = json_list_data_rp.flatMap(per_vol_json); 132 long num_page_ids = per_page_ids.count(); 133 134 long num_ids = num_volumes;124 long num_page_ids = per_page_ids.count(); // trigger lazy eval of: flatmap:per-vol 125 126 //long num_ids = num_volumes; 135 127 136 128 System.out.println("");
Note:
See TracChangeset
for help on using the changeset viewer.