|
|
@31215
|
7 years |
davidb |
Changed back to Guava 20 API, now mvn shading allows me to have this …
|
|
|
@31214
|
7 years |
davidb |
Not needed now using mvn shading
|
|
|
@31213
|
7 years |
davidb |
Tidy up
|
|
|
@31212
|
7 years |
davidb |
Changed from mvn assemblhy to shadowing, which has more control
|
|
|
@31211
|
7 years |
davidb |
Changing back to regular Guava classes. Looking to use maven shading …
|
|
|
@31209
|
7 years |
davidb |
checkArgument added in
|
|
|
@31207
|
7 years |
davidb |
And some more tweaking
|
|
|
@31206
|
7 years |
davidb |
More tweaking of Guava cloned code
|
|
|
@31205
|
7 years |
davidb |
Next added in part of new Guava code
|
|
|
@31204
|
7 years |
davidb |
Splicing in Guava verion 20 of BloomFilter into code as own class (now …
|
|
|
@31203
|
7 years |
davidb |
Use class provided stringFunnel
|
|
|
@31202
|
7 years |
davidb |
Turns out Spark uses Guava 14.0 not 20.0. Additional code to fill in …
|
|
|
@31201
|
7 years |
davidb |
Trigger serialization of whitelist in main program
|
|
|
@31200
|
7 years |
davidb |
Better output statement
|
|
|
@31199
|
7 years |
davidb |
Renaming of classname to reflect filename rename
|
|
|
@31198
|
7 years |
davidb |
File renaming to make way for newer version of classes needed in the …
|
|
|
@31197
|
7 years |
davidb |
File renaming to make way for newer version of classes needed in the …
|
|
|
@31196
|
7 years |
davidb |
File renaming to make way for newer version of classes needed in the …
|
|
|
@31195
|
7 years |
davidb |
File renaming to make way for newer version of classes needed in the …
|
|
|
@31194
|
7 years |
davidb |
Serialize in and out methods added
|
|
|
@31193
|
7 years |
davidb |
Peter's white-list file
|
|
|
@31184
|
7 years |
davidb |
New provision to run different main classes in _RUN.sh; New top-level …
|
|
|
@31183
|
7 years |
davidb |
Bump up to project using Java 1.8
|
|
|
@31177
|
7 years |
davidb |
Adding in Google jar that supports Bloom filters
|
|
|
@31176
|
7 years |
davidb |
Support added for producing whitelist word count
|
|
|
@31175
|
7 years |
davidb |
Trial to find memory difference betwen Hashmap and Bloom filters
|
|
|
@31174
|
7 years |
davidb |
One of the last scripts developed for getting ef dataset into HDFS
|
|
|
@31173
|
7 years |
davidb |
individual file sizes per top-level folder
|
|
|
@31172
|
7 years |
davidb |
to help track down missing files in HDFS copy
|
|
|
@31171
|
7 years |
davidb |
Util to help find where missing files are
|
|
|
@31170
|
7 years |
davidb |
Targetted sub-dir copy
|
|
|
@31169
|
7 years |
davidb |
Improved logic
|
|
|
@31161
|
7 years |
davidb |
Comparison of local disk version with HDFS version
|
|
|
@31152
|
7 years |
davidb |
Development of script
|
|
|
@31151
|
7 years |
davidb |
More nuanced version to help finish off the 'big put'
|
|
|
@31128
|
7 years |
davidb |
Some scripts to help with pushing and monitoring the progress of the …
|
|
|
@31112
|
7 years |
davidb |
To move out shards saved in /tmp on gsliscluter1 nodes to nema
|
|
|
@31106
|
7 years |
davidb |
Scripts to help run an rsync'd copy of gslistcluster1 …
|
|
|
@31105
|
7 years |
davidb |
Additional scripts to help with running solr locally out of /tmp area
|
|
|
@31104
|
7 years |
davidb |
now configurable to be run from local disk (/tmp)
|
|
|
@31103
|
7 years |
davidb |
Changes made after testing with 20 solr nodes
|
|
|
@31102
|
7 years |
davidb |
Command line way of running a Solr test query
|
|
|
@31101
|
7 years |
davidb |
Correction to collection name
|
|
|
@31100
|
7 years |
davidb |
Change to using solr-cloud-nodes that include port number
|
|
|
@31099
|
7 years |
davidb |
Changes resulting from test runs to get Zookeeper and Solr running on …
|
|
|
@31098
|
7 years |
davidb |
Changes resulting from test runs to get Zookeeper and Solr running on …
|
|
|
@31097
|
7 years |
davidb |
Changed to .in style namne
|
|
|
@31096
|
7 years |
davidb |
Only need to create a volume's pages output directory is _output_dir …
|
|
|
@31095
|
7 years |
davidb |
Introduced num-partitions property
|
|
|
@31094
|
7 years |
davidb |
Changes triggered by running on gsliscluster1
|
|
|
@31093
|
7 years |
davidb |
Changes triggered by running on gsliscluster1
|
|
|
@31092
|
7 years |
davidb |
Minor tweak to spark/hadoop combination downloaded
|
|
|
@31091
|
7 years |
davidb |
Change of number of core for 'gsliscluster1' machine; commmented out …
|
|
|
@31090
|
7 years |
davidb |
Memory monitor debugging code, commented out
|
|
|
@31089
|
7 years |
davidb |
Change in way the JSON file is read in. Motivation was a …
|
|
|
@31088
|
7 years |
davidb |
Shift to newIstance for FileSystem due to StackOverflow page …
|
|
|
@31082
|
7 years |
davidb |
Changes in response to testing on gchead
|
|
|
@31081
|
7 years |
davidb |
Going live with generation of spark slaves file
|
|
|
@31080
|
7 years |
davidb |
echo formatting tidy up. Fixed some typos
|
|
|
@31079
|
7 years |
davidb |
Useful get started scripts
|
|
|
@31078
|
7 years |
davidb |
Some setup files and scripts to make running Spark and Solr easier on …
|
|
|
@31077
|
7 years |
davidb |
Move up to JDK1.8. Tidy up of Vagrant machine names. Support for YARN. …
|
|
|
@31065
|
7 years |
davidb |
Additional echo output
|
|
|
@31062
|
7 years |
davidb |
Added in -W option so check-sum calculation is skipped
|
|
|
@31058
|
7 years |
davidb |
echo for additional information added
|
|
|
@31057
|
7 years |
davidb |
Tweak to jps output formatting
|
|
|
@31053
|
7 years |
davidb |
Addition of second argument, optional, for where to save the files
|
|
|
@31051
|
7 years |
davidb |
Added in JDK to list of possible packages needed
|
|
|
@31046
|
7 years |
davidb |
Added property to control how severe a JSON IO problem is
|
|
|
@31045
|
7 years |
davidb |
More careful treatment of what to do when a JSON file isn't there
|
|
|
@31044
|
7 years |
davidb |
Fixed up error when output_dir is empty
|
|
|
@31043
|
7 years |
davidb |
Version for processing full EF set
|
|
|
@31042
|
7 years |
davidb |
Name changes, preparing the way for FULL-RUN versions
|
|
|
@31041
|
7 years |
davidb |
Test needs to be more careful if -read-only specified
|
|
|
@31036
|
7 years |
davidb |
Renaming to prepare way for YARN version of script
|
|
|
@31035
|
7 years |
davidb |
Changes after testing scripts
|
|
|
@31034
|
7 years |
davidb |
Development of scripts for working with Full EF dataset
|
|
|
@31033
|
7 years |
davidb |
Development of scripts for working with Full EF dataset
|
|
|
@31030
|
7 years |
davidb |
Tweak to some verbosity level 2 printing
|
|
|
@31029
|
7 years |
davidb |
Newline at end of file added
|
|
|
@31028
|
7 years |
davidb |
Support for randonly choosing Solr endpoints added in
|
|
|
@31027
|
7 years |
davidb |
Mixed typo in property name used
|
|
|
@31026
|
7 years |
davidb |
Corrected flag setting
|
|
|
@31025
|
7 years |
davidb |
Use property process-json-mode to determine which sort of Spark …
|
|
|
@31024
|
7 years |
davidb |
Support for Java properties file
|
|
|
@31022
|
7 years |
davidb |
No longer used
|
|
|
@31021
|
7 years |
davidb |
Folder restructure to remove 'trunk' part
|
|
|
@31020
|
7 years |
davidb |
No longer used
|
|
|
@31019
|
7 years |
davidb |
Part 2 or two-step folder restructure
|
|
|
@31018
|
7 years |
davidb |
Part 1 or two-step folder restructure
|
|
|
@31017
|
7 years |
davidb |
Moved to correct position
|
|
|
@31015
|
7 years |
davidb |
Restructuring of projects into one
|