|
|
@30975
|
8 years |
davidb |
Introduction of new solr-url command line argument, leading to some …
|
|
|
@30974
|
8 years |
davidb |
update/add/doc JSON structure needed
|
|
|
@30973
|
8 years |
davidb |
Changed to saving Solr JSON file for debugging purposes
|
|
|
@30971
|
8 years |
davidb |
Adding in post to Solr cloud. Changed text_t to _text_
|
|
|
@30970
|
8 years |
davidb |
Added in mapping of EF-JSON to Solr 'add' JSON format
|
|
|
@30953
|
8 years |
davidb |
Need to specify _output_dir as part of output JSON filename
|
|
|
@30951
|
8 years |
davidb |
Save a JSONObject as a file in the output directory
|
|
|
@30949
|
8 years |
davidb |
Use better name than 'foo'. Further fix to JSON name generated
|
|
|
@30947
|
8 years |
davidb |
Correction to 'pages-' part of JSON.bz2 output filename used
|
|
|
@30946
|
8 years |
davidb |
Correction to output JSON.bz2 name generated
|
|
|
@30945
|
8 years |
davidb |
Getting closer to writing out JSON files
|
|
|
@30944
|
8 years |
davidb |
Forcer higher partition (6) than default, which seems to be 2
|
|
|
@30943
|
8 years |
davidb |
Extra debug info
|
|
|
@30942
|
8 years |
davidb |
Improved output printing for slave node
|
|
|
@30941
|
8 years |
davidb |
Moved to getFileSystemInstance() method to play nice on cluster
|
|
|
@30940
|
8 years |
davidb |
Change to using URI not fileIn directly
|
|
|
@30938
|
8 years |
davidb |
Experiment with using Hadoop's FileSystem class for local file:// access
|
|
|
@30937
|
8 years |
davidb |
Expanded set of ClusterFileIO methods
|
|
|
@30934
|
8 years |
davidb |
Providing json-filelist now a compulsory argument, rather than an option
|
|
|
@30933
|
8 years |
davidb |
More careful parsing of file prefix
|
|
|
@30932
|
8 years |
davidb |
Support both file:// and hdfs://
|
|
|
@30924
|
8 years |
davidb |
Tidy up of code. Removed commented out code
|
|
|
@30921
|
8 years |
davidb |
Code change to read in JSON file over HDFS
|
|
|
@30918
|
8 years |
davidb |
More flexible command-line args
|
|
|
@30898
|
8 years |
davidb |
Scripts for downloading sample JSON data from public domain extracted …
|