|
|
@31106
|
6 years |
davidb |
Scripts to help run an rsync'd copy of gslistcluster1 …
|
|
|
@31105
|
6 years |
davidb |
Additional scripts to help with running solr locally out of /tmp area
|
|
|
@31104
|
6 years |
davidb |
now configurable to be run from local disk (/tmp)
|
|
|
@31103
|
6 years |
davidb |
Changes made after testing with 20 solr nodes
|
|
|
@31102
|
6 years |
davidb |
Command line way of running a Solr test query
|
|
|
@31101
|
6 years |
davidb |
Correction to collection name
|
|
|
@31100
|
6 years |
davidb |
Change to using solr-cloud-nodes that include port number
|
|
|
@31099
|
6 years |
davidb |
Changes resulting from test runs to get Zookeeper and Solr running on …
|
|
|
@31098
|
6 years |
davidb |
Changes resulting from test runs to get Zookeeper and Solr running on …
|
|
|
@31097
|
6 years |
davidb |
Changed to .in style namne
|
|
|
@31096
|
6 years |
davidb |
Only need to create a volume's pages output directory is _output_dir …
|
|
|
@31095
|
6 years |
davidb |
Introduced num-partitions property
|
|
|
@31094
|
6 years |
davidb |
Changes triggered by running on gsliscluster1
|
|
|
@31093
|
6 years |
davidb |
Changes triggered by running on gsliscluster1
|
|
|
@31092
|
6 years |
davidb |
Minor tweak to spark/hadoop combination downloaded
|
|
|
@31091
|
6 years |
davidb |
Change of number of core for 'gsliscluster1' machine; commmented out …
|
|
|
@31090
|
6 years |
davidb |
Memory monitor debugging code, commented out
|
|
|
@31089
|
6 years |
davidb |
Change in way the JSON file is read in. Motivation was a …
|
|
|
@31088
|
6 years |
davidb |
Shift to newIstance for FileSystem due to StackOverflow page …
|
|
|
@31082
|
6 years |
davidb |
Changes in response to testing on gchead
|
|
|
@31081
|
6 years |
davidb |
Going live with generation of spark slaves file
|
|
|
@31080
|
6 years |
davidb |
echo formatting tidy up. Fixed some typos
|
|
|
@31079
|
6 years |
davidb |
Useful get started scripts
|
|
|
@31078
|
6 years |
davidb |
Some setup files and scripts to make running Spark and Solr easier on …
|
|
|
@31077
|
6 years |
davidb |
Move up to JDK1.8. Tidy up of Vagrant machine names. Support for YARN. …
|
|
|
@31065
|
6 years |
davidb |
Additional echo output
|
|
|
@31062
|
6 years |
davidb |
Added in -W option so check-sum calculation is skipped
|
|
|
@31058
|
6 years |
davidb |
echo for additional information added
|
|
|
@31057
|
6 years |
davidb |
Tweak to jps output formatting
|
|
|
@31053
|
6 years |
davidb |
Addition of second argument, optional, for where to save the files
|
|
|
@31051
|
6 years |
davidb |
Added in JDK to list of possible packages needed
|
|
|
@31046
|
6 years |
davidb |
Added property to control how severe a JSON IO problem is
|
|
|
@31045
|
6 years |
davidb |
More careful treatment of what to do when a JSON file isn't there
|
|
|
@31044
|
6 years |
davidb |
Fixed up error when output_dir is empty
|
|
|
@31043
|
6 years |
davidb |
Version for processing full EF set
|
|
|
@31042
|
6 years |
davidb |
Name changes, preparing the way for FULL-RUN versions
|
|
|
@31041
|
6 years |
davidb |
Test needs to be more careful if -read-only specified
|
|
|
@31036
|
6 years |
davidb |
Renaming to prepare way for YARN version of script
|
|
|
@31035
|
6 years |
davidb |
Changes after testing scripts
|
|
|
@31034
|
6 years |
davidb |
Development of scripts for working with Full EF dataset
|
|
|
@31033
|
6 years |
davidb |
Development of scripts for working with Full EF dataset
|
|
|
@31030
|
6 years |
davidb |
Tweak to some verbosity level 2 printing
|
|
|
@31029
|
6 years |
davidb |
Newline at end of file added
|
|
|
@31028
|
6 years |
davidb |
Support for randonly choosing Solr endpoints added in
|
|
|
@31027
|
6 years |
davidb |
Mixed typo in property name used
|
|
|
@31026
|
6 years |
davidb |
Corrected flag setting
|
|
|
@31025
|
6 years |
davidb |
Use property process-json-mode to determine which sort of Spark …
|
|
|
@31024
|
6 years |
davidb |
Support for Java properties file
|
|
|
@31022
|
6 years |
davidb |
No longer used
|
|
|
@31021
|
6 years |
davidb |
Folder restructure to remove 'trunk' part
|
|
|
@31020
|
6 years |
davidb |
No longer used
|
|
|
@31019
|
6 years |
davidb |
Part 2 or two-step folder restructure
|
|
|
@31018
|
6 years |
davidb |
Part 1 or two-step folder restructure
|
|
|
@31017
|
6 years |
davidb |
Moved to correct position
|
|
|
@31016
|
6 years |
davidb |
No longer used
|
|
|
@31015
|
6 years |
davidb |
Restructuring of projects into one
|
|
|
@31013
|
6 years |
davidb |
Accumulator for PerPageMap
|
|
|
@31011
|
6 years |
davidb |
Further RDD flatMap/map restructuring and refactoring, for per-page
|
|
|
@31010
|
6 years |
davidb |
Tidy up on generating Spark App name
|
|
|
@31009
|
6 years |
davidb |
Adjustments after latest fresh 'vagrant up' trial
|
|
|
@31008
|
6 years |
davidb |
Additional detail added into Spark app name
|
|
|
@31007
|
6 years |
davidb |
Class name refactoring
|
|
|
@31006
|
6 years |
davidb |
Further reversal of Base class. Switch to PerPage
|
|
|
@31005
|
6 years |
davidb |
Reversal of Base class in PerVolumeJSON
|
|
|
@31004
|
6 years |
davidb |
added debug
|
|
|
@31003
|
6 years |
davidb |
Explicity default constructors added
|
|
|
@31002
|
6 years |
davidb |
Need to separate flatMap and foreach calls in PagedJSON
|
|
|
@31001
|
6 years |
davidb |
Code to work per-volume and per-page
|
|
|
@31000
|
6 years |
davidb |
Class name refactoring
|
|
|
@30999
|
6 years |
davidb |
Class name refactoring
|
|
|
@30998
|
6 years |
davidb |
Class name refactoring
|
|
|
@30997
|
6 years |
davidb |
Verbosity control over printing
|
|
|
@30996
|
6 years |
davidb |
Code refactoring
|
|
|
@30995
|
6 years |
davidb |
Adjustment of NUM_PARTITIONS to be based on Spark recommended calculation
|
|
|
@30994
|
6 years |
davidb |
Additional useful links. Links open in new tab
|
|
|
@30993
|
6 years |
davidb |
Placeholder page to provide useful links to hadoop and solr cluster …
|
|
|
@30992
|
6 years |
davidb |
Additional adjustments after test run on cluster
|
|
|
@30991
|
6 years |
davidb |
Inital cut at README notes, and supporting links
|
|
|
@30990
|
6 years |
davidb |
opt name change
|
|
|
@30989
|
6 years |
davidb |
Changes to better suit EF set used with solr
|
|
|
@30988
|
6 years |
davidb |
Changed flag to 'read-only' and changed the filed name full text saved …
|
|
|
@30986
|
6 years |
davidb |
Debugging for double accumulator added
|
|
|
@30985
|
6 years |
davidb |
Changed to run main processing method as action rather than transform. …
|
|
|
@30984
|
6 years |
davidb |
Introduction of Spark accumulator to measure progress. Output of POST …
|
|
|
@30983
|
6 years |
davidb |
Useful helper script
|
|
|
@30982
|
6 years |
davidb |
Fixed to host_name for solr2 and solr3
|
|
|
@30981
|
6 years |
davidb |
Useful folder for 'on-the-side' packages
|
|
|
@30980
|
6 years |
davidb |
Code added to read response
|
|
|
@30979
|
6 years |
davidb |
_solr_url needs to be stored in class!
|
|
|
@30978
|
6 years |
davidb |
Additional debug statements
|
|
|
@30977
|
6 years |
davidb |
Only have RDD if an output directory was specified on the command-line …
|
|
|
@30976
|
6 years |
davidb |
Change to reflect changed order of command-line arguments
|
|
|
@30975
|
6 years |
davidb |
Introduction of new solr-url command line argument, leading to some …
|
|
|
@30974
|
6 years |
davidb |
update/add/doc JSON structure needed
|
|
|
@30973
|
6 years |
davidb |
Changed to saving Solr JSON file for debugging purposes
|
|
|
@30972
|
6 years |
davidb |
addition of useful command needed before re-running
|
|
|
@30971
|
6 years |
davidb |
Adding in post to Solr cloud. Changed text_t to _text_
|
|
|
@30970
|
6 years |
davidb |
Added in mapping of EF-JSON to Solr 'add' JSON format
|
|
|
@30969
|
6 years |
davidb |
Fine tuning resulting from testing the cloud/cluster
|
|
|
@30962
|
6 years |
davidb |
Corrections and improvements made after initial testing between …
|
|
|