source: other-projects/hathitrust

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @31524   4 years davidb Main changes: Fix for page/seqnum; group by id; show-hide other …
(edit) @31510   4 years davidb Turns out some languages fields can be empty. Need to test for this
(edit) @31509   4 years davidb LangPos determination changed to lock into first match, rather than …
(edit) @31506   4 years davidb Forgot to add initialization line. Doh!
(edit) @31505   4 years davidb Added in storing of top-level document metadata as separate solr-doc
(edit) @31504   4 years davidb Adjusted call to work with added parameter
(edit) @31503   4 years davidb Monitor for missing POS keys, and print out details first time each …
(edit) @31502   4 years davidb Comment out section, useful for controlling a smaller run
(edit) @31501   4 years davidb No longer used
(edit) @31500   4 years davidb Synchronize on reading in of white-list and universal-lang-pos
(edit) @31499   4 years davidb Better exception handling
(edit) @31498   4 years davidb Tidy up on print statements
(edit) @31466   4 years davidb Fix to work out solr_host rather than assume it is gc0
(edit) @31465   4 years davidb Adjustment to run solr with more memory
(edit) @31464   4 years davidb More general version of script that let's you specify the collection …
(edit) @31455   4 years davidb deprecated
(edit) @31454   4 years davidb Deprecated
(edit) @31453   4 years davidb Added size() method
(edit) @31452   4 years davidb Additional Spark progs to run
(edit) @31451   4 years davidb shift to using solr-base-url and a specified solr-collection
(edit) @31450   4 years davidb Some debugging output to help see what is happening with …
(edit) @31393   4 years davidb Fixed typo
(edit) @31392   4 years davidb Support for Catalog page added
(edit) @31385   4 years davidb Next and previous pages
(edit) @31384   4 years davidb After next phase of development
(edit) @31383   4 years davidb Files for initial functioning search page
(edit) @31378   4 years davidb Fixed loop limit test
(edit) @31377   4 years davidb Switch to using URI not string
(edit) @31376   4 years davidb Universal language mappings for opennlp POS model tags
(edit) @31375   4 years davidb Initial cut at including POS information to solr index
(edit) @31374   4 years davidb simplified command line usage
(edit) @31373   4 years davidb Changes made to operate on solr1 and solr2 boxes
(edit) @31372   4 years davidb Reworked to use sequenceFiles
(edit) @31371   4 years davidb Trying to get saveAsSequenceFile working
(edit) @31370   4 years davidb Fixed incorrect version number. Using htrcstring so field values not …
(edit) @31369   4 years davidb Trial new save
(edit) @31368   4 years davidb downsample-100 added
(edit) @31367   4 years davidb Changes to work with solr1 and solr2
(edit) @31366   4 years davidb Updated to latest released version of Solr
(edit) @31365   4 years davidb Quick code added to downsample
(edit) @31364   4 years davidb removed sample() line
(edit) @31363   4 years davidb Control num of partitions on sort
(edit) @31362   4 years davidb use Spark sample() to make for smaller test with Sequence files
(edit) @31361   4 years davidb Change from String to Text
(edit) @31360   4 years davidb Seems to be Text class not a String class coming out of the seuquenceFiles
(edit) @31359   4 years davidb Changed over to use sequenceFiles as input
(edit) @31358   4 years davidb Make workset download save as file
(edit) @31357   4 years davidb Ensure all output sent to browser
(edit) @31356   4 years davidb Tidy up on appending missing volumes
(edit) @31355   4 years davidb Changed to using containsKey rather than get to avoid null pointer …
(edit) @31354   4 years davidb import tidy-up
(edit) @31353   4 years davidb Added debug print statement
(edit) @31352   4 years davidb collection-to-workset now with id-check added to filter
(edit) @31351   4 years davidb Powerpoint slides showing mahsup features
(edit) @31350   4 years davidb Use new 'convert-col' action
(edit) @31349   4 years davidb Change over to proxyied main web server
(edit) @31348   4 years davidb Restructure of how convert-to works
(edit) @31347   4 years davidb First stage of developing HT collection to HTRC workset. Code to …
(edit) @31342   4 years davidb Some initial progress on collection to workset conversion
(edit) @31341   4 years davidb Cody tidy-up
(edit) @31340   4 years davidb Test worked OK. Removing debug code
(edit) @31339   4 years davidb Debugging statement
(edit) @31338   4 years davidb additional close()
(edit) @31337   4 years davidb Output the downloaded rsync file
(edit) @31336   4 years davidb Changes in response to testing
(edit) @31335   4 years davidb Too expensive to hold pairtree filename in hashmap, so change to …
(edit) @31334   4 years davidb Initial cut at rsync download
(edit) @31333   4 years davidb Minor word tweak
(edit) @31332   4 years davidb needed in Jetty CORS support
(edit) @31331   4 years davidb Reworked to use CORS and $.ajax() so TamperMonkey doesn't interceed …
(edit) @31330   4 years davidb Initial cut a files that explain how to install the user-script
(edit) @31329   4 years davidb Tweaks after testing INSTALL.sh
(edit) @31328   4 years davidb Install the necessary files in the jetty webapps dir
(edit) @31327   4 years davidb name change to be more consistent
(edit) @31326   4 years davidb Further tweaks
(edit) @31325   4 years davidb Further tweaks
(edit) @31324   4 years davidb More accurate name
(edit) @31323   4 years davidb Download script plus setup instructions
(edit) @31322   4 years davidb Location for the Java byte compiled code to link in with rest of servlet
(edit) @31321   4 years davidb useful scripts
(edit) @31320   4 years davidb build Document rather than parse JSON string
(edit) @31319   4 years davidb Changed to replace existing MongoDB entry. Fixed up printt statement
(edit) @31318   4 years davidb change to using contains()
(edit) @31317   4 years davidb added debug statement
(edit) @31316   4 years davidb fixed typo
(edit) @31315   4 years davidb Further tweak
(edit) @31314   4 years davidb Another go at avoiding concurrency update exception
(edit) @31313   4 years davidb Alternative to avoid concurrency update exception
(edit) @31312   4 years davidb MongoDB can't have 'period' and 'dollar' in key, as reserved characters
(edit) @31311   4 years davidb Processing print statement added
(edit) @31310   4 years davidb Initial cut at files for working with MongoDB
(edit) @31309   4 years davidb Sparked MongoDB connector added
(edit) @31308   4 years davidb Minor tidy-up
(edit) @31307   4 years davidb convenience scripts
(edit) @31306   4 years davidb Final part of the mongodb shard puzzle -- router servers
(edit) @31305   4 years davidb Next good commit point. Initial testing of shard replset scripts
(edit) @31304   4 years davidb Changes made whe (it turned out) the real source of the error was an …
(edit) @31303   4 years davidb Adding in support to start and stop router server
(edit) @31302   4 years davidb Initial commit of scripts, after some testing, and subsequent changes …
(edit) @31301   4 years davidb Fix for gsliscluster1
Note: See TracRevisionLog for help on using the revision log.