root/other-projects

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Rev Chgset Date Author Log Message
(edit) @31363 [31363] 2 years davidb Control num of partitions on sort
(edit) @31362 [31362] 2 years davidb use Spark sample() to make for smaller test with Sequence files
(edit) @31361 [31361] 2 years davidb Change from String to Text
(edit) @31360 [31360] 2 years davidb Seems to be Text class not a String class coming out of the seuquenceFiles
(edit) @31359 [31359] 2 years davidb Changed over to use sequenceFiles as input
(edit) @31358 [31358] 2 years davidb Make workset download save as file
(edit) @31357 [31357] 2 years davidb Ensure all output sent to browser
(edit) @31356 [31356] 2 years davidb Tidy up on appending missing volumes
(edit) @31355 [31355] 2 years davidb Changed to using containsKey rather than get to avoid null pointer cast …
(edit) @31354 [31354] 2 years davidb import tidy-up
(edit) @31353 [31353] 2 years davidb Added debug print statement
(edit) @31352 [31352] 2 years davidb collection-to-workset now with id-check added to filter
(edit) @31351 [31351] 2 years davidb Powerpoint slides showing mahsup features
(edit) @31350 [31350] 2 years davidb Use new 'convert-col' action
(edit) @31349 [31349] 2 years davidb Change over to proxyied main web server
(edit) @31348 [31348] 2 years davidb Restructure of how convert-to works
(edit) @31347 [31347] 2 years davidb First stage of developing HT collection to HTRC workset. Code to allow …
(edit) @31342 [31342] 2 years davidb Some initial progress on collection to workset conversion
(edit) @31341 [31341] 2 years davidb Cody tidy-up
(edit) @31340 [31340] 2 years davidb Test worked OK. Removing debug code
(edit) @31339 [31339] 2 years davidb Debugging statement
(edit) @31338 [31338] 2 years davidb additional close()
(edit) @31337 [31337] 2 years davidb Output the downloaded rsync file
(edit) @31336 [31336] 2 years davidb Changes in response to testing
(edit) @31335 [31335] 2 years davidb Too expensive to hold pairtree filename in hashmap, so change to computing …
(edit) @31334 [31334] 2 years davidb Initial cut at rsync download
(edit) @31333 [31333] 2 years davidb Minor word tweak
(edit) @31332 [31332] 2 years davidb needed in Jetty CORS support
(edit) @31331 [31331] 2 years davidb Reworked to use CORS and $.ajax() so TamperMonkey? doesn't interceed with …
(edit) @31330 [31330] 2 years davidb Initial cut a files that explain how to install the user-script
(edit) @31329 [31329] 2 years davidb Tweaks after testing INSTALL.sh
(edit) @31328 [31328] 2 years davidb Install the necessary files in the jetty webapps dir
(edit) @31327 [31327] 2 years davidb name change to be more consistent
(edit) @31326 [31326] 2 years davidb Further tweaks
(edit) @31325 [31325] 2 years davidb Further tweaks
(edit) @31324 [31324] 2 years davidb More accurate name
(edit) @31323 [31323] 2 years davidb Download script plus setup instructions
(edit) @31322 [31322] 2 years davidb Location for the Java byte compiled code to link in with rest of servlet
(edit) @31321 [31321] 2 years davidb useful scripts
(edit) @31320 [31320] 2 years davidb build Document rather than parse JSON string
(edit) @31319 [31319] 2 years davidb Changed to replace existing MongoDB entry. Fixed up printt statement
(edit) @31318 [31318] 2 years davidb change to using contains()
(edit) @31317 [31317] 2 years davidb added debug statement
(edit) @31316 [31316] 2 years davidb fixed typo
(edit) @31315 [31315] 2 years davidb Further tweak
(edit) @31314 [31314] 2 years davidb Another go at avoiding concurrency update exception
(edit) @31313 [31313] 2 years davidb Alternative to avoid concurrency update exception
(edit) @31312 [31312] 2 years davidb MongoDB can't have 'period' and 'dollar' in key, as reserved characters
(edit) @31311 [31311] 2 years davidb Processing print statement added
(edit) @31310 [31310] 2 years davidb Initial cut at files for working with MongoDB
(edit) @31309 [31309] 2 years davidb Sparked MongoDB connector added
(edit) @31308 [31308] 2 years davidb Minor tidy-up
(edit) @31307 [31307] 2 years davidb convenience scripts
(edit) @31306 [31306] 2 years davidb Final part of the mongodb shard puzzle -- router servers
(edit) @31305 [31305] 2 years davidb Next good commit point. Initial testing of shard replset scripts
(edit) @31304 [31304] 2 years davidb Changes made whe (it turned out) the real source of the error was an error …
(edit) @31303 [31303] 2 years davidb Adding in support to start and stop router server
(edit) @31302 [31302] 2 years davidb Initial commit of scripts, after some testing, and subsequent changes to …
(edit) @31301 [31301] 2 years davidb Fix for gsliscluster1
(edit) @31300 [31300] 2 years davidb Need to use NETWORK not PACKAGE
(edit) @31299 [31299] 2 years davidb Additionally setup MongoDB
(edit) @31298 [31298] 2 years davidb Initial cut at setup file for MongoDB
(edit) @31297 [31297] 2 years davidb
(edit) @31296 [31296] 2 years davidb Make loading in of ID file more portable
(edit) @31295 [31295] 2 years davidb name change of webapp
(edit) @31294 [31294] 2 years davidb Version for language counting the catalog assignment language metadata. …
(edit) @31283 [31283] 2 years davidb Fixed typo
(edit) @31282 [31282] 2 years davidb Jetty jar-runable server
(edit) @31281 [31281] 2 years davidb
(edit) @31280 [31280] 2 years davidb
(edit) @31279 [31279] 2 years davidb First cut at servlet
(edit) @31278 [31278] 2 years davidb To avoid null pointer on ids.iterator()
(edit) @31277 [31277] 2 years davidb Tweak to minimum value
(edit) @31276 [31276] 2 years davidb Min num partition guard put in
(edit) @31275 [31275] 2 years davidb Changes to allow gc slave nodes to work with local disk versions of …
(edit) @31274 [31274] 2 years davidb Need to use JSONArray no JSONObject for a multifield item
(edit) @31273 [31273] 2 years davidb Code moved to store fields for multilingual use using dynamic Solr fields …
(edit) @31272 [31272] 2 years davidb Use disk and memory to store main language RDD
(edit) @31271 [31271] 2 years davidb Updating of POS code to new files-per-partition paramater, plus some other …
(edit) @31270 [31270] 2 years davidb Changed over to repartition approach
(edit) @31269 [31269] 2 years davidb Some variable name changes, and printing tidy up
(edit) @31268 [31268] 2 years davidb Adjustments to memory allocation in response to test runs on 10% of …
(edit) @31267 [31267] 2 years davidb Values trialed on gsliscluster1. Rekindling idea of per-vol processing
(edit) @31266 [31266] 2 years davidb Rekindling of per-volume approach. Also some tweaking to verbosity debug …
(edit) @31264 [31264] 3 years davidb Switching to 'long' in counts to allow higher number representation
(edit) @31263 [31263] 3 years davidb Change to using long for higher word counts
(edit) @31261 [31261] 3 years davidb Overlooked changes from POS to lang
(edit) @31260 [31260] 3 years davidb Language counting
(edit) @31259 [31259] 3 years davidb Lambda sort had wrong boolean arg to sort descending. Now fixed
(edit) @31258 [31258] 3 years davidb POS Label count, similar to Whitelist word count
(edit) @31257 [31257] 3 years davidb Fixed typo
(edit) @31256 [31256] 3 years davidb Earlier check of output directory to prevent large scale processing, when …
(edit) @31255 [31255] 3 years davidb Changed to using lambda functions
(edit) @31254 [31254] 3 years davidb Experimenting with Lucene lowercase filter
(edit) @31253 [31253] 3 years davidb Identified a typo, and changed to being true anyway
(edit) @31252 [31252] 3 years davidb Support for icu-tokenize property added, plus relevant refactoring.
(edit) @31251 [31251] 3 years davidb Code tidy up. Timed experiment showed sorting by key with num_partitions …
(edit) @31250 [31250] 3 years davidb Minor mods
(edit) @31247 [31247] 3 years davidb Change sort order. Pick better output directory name
(edit) @31246 [31246] 3 years davidb Experimenting with sorting
Note: See TracRevisionLog for help on using the revision log.