source: other-projects

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @31315   4 years davidb Further tweak
(edit) @31314   4 years davidb Another go at avoiding concurrency update exception
(edit) @31313   4 years davidb Alternative to avoid concurrency update exception
(edit) @31312   4 years davidb MongoDB can't have 'period' and 'dollar' in key, as reserved characters
(edit) @31311   4 years davidb Processing print statement added
(edit) @31310   4 years davidb Initial cut at files for working with MongoDB
(edit) @31309   4 years davidb Sparked MongoDB connector added
(edit) @31308   4 years davidb Minor tidy-up
(edit) @31307   4 years davidb convenience scripts
(edit) @31306   4 years davidb Final part of the mongodb shard puzzle -- router servers
(edit) @31305   4 years davidb Next good commit point. Initial testing of shard replset scripts
(edit) @31304   4 years davidb Changes made whe (it turned out) the real source of the error was an …
(edit) @31303   4 years davidb Adding in support to start and stop router server
(edit) @31302   4 years davidb Initial commit of scripts, after some testing, and subsequent changes …
(edit) @31301   4 years davidb Fix for gsliscluster1
(edit) @31300   4 years davidb Need to use NETWORK not PACKAGE
(edit) @31299   4 years davidb Additionally setup MongoDB
(edit) @31298   4 years davidb Initial cut at setup file for MongoDB
(edit) @31297   4 years davidb
(edit) @31296   4 years davidb Make loading in of ID file more portable
(edit) @31295   4 years davidb name change of webapp
(edit) @31294   4 years davidb Version for language counting the catalog assignment language …
(edit) @31283   4 years davidb Fixed typo
(edit) @31282   4 years davidb Jetty jar-runable server
(edit) @31281   4 years davidb
(edit) @31280   4 years davidb
(edit) @31279   4 years davidb First cut at servlet
(edit) @31278   4 years davidb To avoid null pointer on ids.iterator()
(edit) @31277   4 years davidb Tweak to minimum value
(edit) @31276   4 years davidb Min num partition guard put in
(edit) @31275   4 years davidb Changes to allow gc slave nodes to work with local disk versions of …
(edit) @31274   4 years davidb Need to use JSONArray no JSONObject for a multifield item
(edit) @31273   4 years davidb Code moved to store fields for multilingual use using dynamic Solr …
(edit) @31272   4 years davidb Use disk and memory to store main language RDD
(edit) @31271   4 years davidb Updating of POS code to new files-per-partition paramater, plus some …
(edit) @31270   4 years davidb Changed over to repartition approach
(edit) @31269   4 years davidb Some variable name changes, and printing tidy up
(edit) @31268   4 years davidb Adjustments to memory allocation in response to test runs on 10% of dataset
(edit) @31267   4 years davidb Values trialed on gsliscluster1. Rekindling idea of per-vol processing
(edit) @31266   4 years davidb Rekindling of per-volume approach. Also some tweaking to verbosity …
(edit) @31264   4 years davidb Switching to 'long' in counts to allow higher number representation
(edit) @31263   4 years davidb Change to using long for higher word counts
(edit) @31261   4 years davidb Overlooked changes from POS to lang
(edit) @31260   4 years davidb Language counting
(edit) @31259   4 years davidb Lambda sort had wrong boolean arg to sort descending. Now fixed
(edit) @31258   4 years davidb POS Label count, similar to Whitelist word count
(edit) @31257   4 years davidb Fixed typo
(edit) @31256   4 years davidb Earlier check of output directory to prevent large scale processing, …
(edit) @31255   4 years davidb Changed to using lambda functions
(edit) @31254   4 years davidb Experimenting with Lucene lowercase filter
(edit) @31253   4 years davidb Identified a typo, and changed to being true anyway
(edit) @31252   4 years davidb Support for icu-tokenize property added, plus relevant refactoring.
(edit) @31251   4 years davidb Code tidy up. Timed experiment showed sorting by key with …
(edit) @31250   4 years davidb Minor mods
(edit) @31247   4 years davidb Change sort order. Pick better output directory name
(edit) @31246   4 years davidb Experimenting with sorting
(edit) @31245   4 years davidb Refactored so processing of words from TokenPosCount now done by the …
(edit) @31244   4 years davidb Tidy up
(edit) @31243   4 years davidb Experimenting with Lucene/Solr's ICU tokenizer
(edit) @31242   4 years davidb Method name refactor
(edit) @31235   5 years davidb More fine-grained testing to help nema setup
(edit) @31234   5 years davidb More selective control of what to source/setup depending on hostname
(edit) @31233   5 years davidb Changes to operate on nema as well as gsliscluster1 and gc0-9
(edit) @31232   5 years davidb Hand edited version of state.json from gsliscluster1 suitable for …
(edit) @31231   5 years davidb Changes to allow SOLR to run on nodes in /hdfsd05/dbbridge/solr-ef
(edit) @31228   5 years davidb Change to see if code can be made more unified. If so, then …
(edit) @31227   5 years davidb Code tidy up
(edit) @31226   5 years davidb Fixed bloom test for init
(edit) @31225   5 years davidb Relocated bloomfilter creation to within call() method, so done on the …
(edit) @31224   5 years davidb Debug added
(edit) @31223   5 years davidb Exception printStackTrace
(edit) @31222   5 years davidb Changed to using ClusterFileIO supporting methods
(edit) @31221   5 years davidb Missing argument added in
(edit) @31220   5 years davidb Use of whitelist Bloom filter added to words going into Solr index
(edit) @31219   5 years ak19 Forgot to add to model-collect with previous commit.
(edit) @31217   5 years ak19 Adding the new oai-inf.db files, created by rebuilding the model …
(edit) @31215   5 years davidb Changed back to Guava 20 API, now mvn shading allows me to have this …
(edit) @31214   5 years davidb Not needed now using mvn shading
(edit) @31213   5 years davidb Tidy up
(edit) @31212   5 years davidb Changed from mvn assemblhy to shadowing, which has more control
(edit) @31211   5 years davidb Changing back to regular Guava classes. Looking to use maven shading …
(edit) @31209   5 years davidb checkArgument added in
(edit) @31207   5 years davidb And some more tweaking
(edit) @31206   5 years davidb More tweaking of Guava cloned code
(edit) @31205   5 years davidb Next added in part of new Guava code
(edit) @31204   5 years davidb Splicing in Guava verion 20 of BloomFilter into code as own class (now …
(edit) @31203   5 years davidb Use class provided stringFunnel
(edit) @31202   5 years davidb Turns out Spark uses Guava 14.0 not 20.0. Additional code to fill in …
(edit) @31201   5 years davidb Trigger serialization of whitelist in main program
(edit) @31200   5 years davidb Better output statement
(edit) @31199   5 years davidb Renaming of classname to reflect filename rename
(edit) @31198   5 years davidb File renaming to make way for newer version of classes needed in the …
(edit) @31197   5 years davidb File renaming to make way for newer version of classes needed in the …
(edit) @31196   5 years davidb File renaming to make way for newer version of classes needed in the …
(edit) @31195   5 years davidb File renaming to make way for newer version of classes needed in the …
(edit) @31194   5 years davidb Serialize in and out methods added
(edit) @31193   5 years davidb Peter's white-list file
(edit) @31184   5 years davidb New provision to run different main classes in _RUN.sh; New top-level …
(edit) @31183   5 years davidb Bump up to project using Java 1.8
(edit) @31177   5 years davidb Adding in Google jar that supports Bloom filters
Note: See TracRevisionLog for help on using the revision log.