Changeset 30949
- Timestamp:
- 2016-10-26T17:40:49+13:00 (6 years ago)
- Location:
- other-projects/hathitrust/solr-extracted-features/trunk/src/main/java/org/hathitrust
- Files:
-
- 2 edited
Legend:
- Unmodified
- Added
- Removed
-
other-projects/hathitrust/solr-extracted-features/trunk/src/main/java/org/hathitrust/PagedJSON.java
r30947 r30949 100 100 // write it out 101 101 102 String output_json_bz2 = page_json_dir +"/" + "pages/" +formatted_i + ".json.bz2";102 String output_json_bz2 = page_json_dir +"/" + formatted_i + ".json.bz2"; 103 103 104 104 ids.add(output_json_bz2); -
other-projects/hathitrust/solr-extracted-features/trunk/src/main/java/org/hathitrust/PrepareForIngest.java
r30945 r30949 45 45 JavaRDD<String> json_ids = json_list_data.flatMap(paged_json).cache(); 46 46 47 json_ids.saveAsTextFile(" foo");47 json_ids.saveAsTextFile("rdd-solr-json-page-files"); 48 48 49 49 long num_ids = json_ids.count();
Note:
See TracChangeset
for help on using the changeset viewer.