root/other-projects/hathitrust

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Rev Chgset Date Author Log Message
(edit) @31786 [31786] 2 years davidb extra param in call; change to case-folding _htrctokentext
(edit) @31785 [31785] 2 years davidb Change to allow solr command to optioanlly issue 'restart' instead of …
(edit) @31784 [31784] 2 years davidb Output to highlight skipping per-page indexing
(edit) @31783 [31783] 2 years davidb Solr Doc Add changed to include volume-level metadata within every page …
(edit) @31782 [31782] 2 years davidb more careful separation into field types htrcstring and htrcstrings
(edit) @31779 [31779] 2 years davidb Change in how POS words are checked against the Whitelist. Previously …
(edit) @31772 [31772] 2 years davidb Accidentally committed
(edit) @31693 [31693] 2 years davidb Changes to workset information is pulled from sparql-endpoint for each …
(edit) @31677 [31677] 2 years davidb Supress processing governmentDocument for now in JSON metadata record, as …
(edit) @31676 [31676] 2 years davidb To make it easier to remember how to kill off a YARN task at the command …
(edit) @31675 [31675] 2 years davidb More careful set of metadata fields indexed
(edit) @31645 [31645] 2 years davidb Some initial work on drawing in workset info from sparql-endpoint. Sqparl …
(edit) @31626 [31626] 2 years davidb Links to blog entries added
(edit) @31625 [31625] 2 years davidb Tidy up
(edit) @31624 [31624] 2 years davidb Combined volume md and full-text page searching
(edit) @31623 [31623] 2 years davidb Removed commented out static HTML POS section
(edit) @31622 [31622] 2 years davidb Adding in CORS support to Solr
(edit) @31621 [31621] 2 years davidb Step towards making HTML/JS work with on different server, with AJAX …
(edit) @31619 [31619] 2 years davidb Further minor tidy up
(edit) @31618 [31618] 2 years davidb Code tidy up
(edit) @31614 [31614] 2 years davidb Separate off stream query page
(edit) @31613 [31613] 2 years davidb Multiple word support in POS search box. Tidy up of anchor for search …
(edit) @31601 [31601] 2 years davidb To get the look and feel of the HTRC portal web site, supporting files …
(edit) @31598 [31598] 2 years davidb Easier to remember what to do
(edit) @31597 [31597] 2 years davidb Additional _s and _ss fields to help with faceting. Temporarily commented …
(edit) @31571 [31571] 2 years davidb Simple search-all-langs feature added
(edit) @31570 [31570] 2 years davidb Solr-stream based search
(edit) @31524 [31524] 2 years davidb Main changes: Fix for page/seqnum; group by id; show-hide other languages; …
(edit) @31510 [31510] 2 years davidb Turns out some languages fields can be empty. Need to test for this
(edit) @31509 [31509] 2 years davidb LangPos? determination changed to lock into first match, rather than trying …
(edit) @31506 [31506] 2 years davidb Forgot to add initialization line. Doh!
(edit) @31505 [31505] 2 years davidb Added in storing of top-level document metadata as separate solr-doc
(edit) @31504 [31504] 2 years davidb Adjusted call to work with added parameter
(edit) @31503 [31503] 2 years davidb Monitor for missing POS keys, and print out details first time each …
(edit) @31502 [31502] 2 years davidb Comment out section, useful for controlling a smaller run
(edit) @31501 [31501] 2 years davidb No longer used
(edit) @31500 [31500] 2 years davidb Synchronize on reading in of white-list and universal-lang-pos
(edit) @31499 [31499] 2 years davidb Better exception handling
(edit) @31498 [31498] 2 years davidb Tidy up on print statements
(edit) @31466 [31466] 2 years davidb Fix to work out solr_host rather than assume it is gc0
(edit) @31465 [31465] 2 years davidb Adjustment to run solr with more memory
(edit) @31464 [31464] 2 years davidb More general version of script that let's you specify the collection name …
(edit) @31455 [31455] 2 years davidb deprecated
(edit) @31454 [31454] 2 years davidb Deprecated
(edit) @31453 [31453] 2 years davidb Added size() method
(edit) @31452 [31452] 2 years davidb Additional Spark progs to run
(edit) @31451 [31451] 2 years davidb shift to using solr-base-url and a specified solr-collection
(edit) @31450 [31450] 2 years davidb Some debugging output to help see what is happening with langmap_directory …
(edit) @31393 [31393] 3 years davidb Fixed typo
(edit) @31392 [31392] 3 years davidb Support for Catalog page added
(edit) @31385 [31385] 3 years davidb Next and previous pages
(edit) @31384 [31384] 3 years davidb After next phase of development
(edit) @31383 [31383] 3 years davidb Files for initial functioning search page
(edit) @31378 [31378] 3 years davidb Fixed loop limit test
(edit) @31377 [31377] 3 years davidb Switch to using URI not string
(edit) @31376 [31376] 3 years davidb Universal language mappings for opennlp POS model tags
(edit) @31375 [31375] 3 years davidb Initial cut at including POS information to solr index
(edit) @31374 [31374] 3 years davidb simplified command line usage
(edit) @31373 [31373] 3 years davidb Changes made to operate on solr1 and solr2 boxes
(edit) @31372 [31372] 3 years davidb Reworked to use sequenceFiles
(edit) @31371 [31371] 3 years davidb Trying to get saveAsSequenceFile working
(edit) @31370 [31370] 3 years davidb Fixed incorrect version number. Using htrcstring so field values not …
(edit) @31369 [31369] 3 years davidb Trial new save
(edit) @31368 [31368] 3 years davidb downsample-100 added
(edit) @31367 [31367] 3 years davidb Changes to work with solr1 and solr2
(edit) @31366 [31366] 3 years davidb Updated to latest released version of Solr
(edit) @31365 [31365] 3 years davidb Quick code added to downsample
(edit) @31364 [31364] 3 years davidb removed sample() line
(edit) @31363 [31363] 3 years davidb Control num of partitions on sort
(edit) @31362 [31362] 3 years davidb use Spark sample() to make for smaller test with Sequence files
(edit) @31361 [31361] 3 years davidb Change from String to Text
(edit) @31360 [31360] 3 years davidb Seems to be Text class not a String class coming out of the seuquenceFiles
(edit) @31359 [31359] 3 years davidb Changed over to use sequenceFiles as input
(edit) @31358 [31358] 3 years davidb Make workset download save as file
(edit) @31357 [31357] 3 years davidb Ensure all output sent to browser
(edit) @31356 [31356] 3 years davidb Tidy up on appending missing volumes
(edit) @31355 [31355] 3 years davidb Changed to using containsKey rather than get to avoid null pointer cast …
(edit) @31354 [31354] 3 years davidb import tidy-up
(edit) @31353 [31353] 3 years davidb Added debug print statement
(edit) @31352 [31352] 3 years davidb collection-to-workset now with id-check added to filter
(edit) @31351 [31351] 3 years davidb Powerpoint slides showing mahsup features
(edit) @31350 [31350] 3 years davidb Use new 'convert-col' action
(edit) @31349 [31349] 3 years davidb Change over to proxyied main web server
(edit) @31348 [31348] 3 years davidb Restructure of how convert-to works
(edit) @31347 [31347] 3 years davidb First stage of developing HT collection to HTRC workset. Code to allow …
(edit) @31342 [31342] 3 years davidb Some initial progress on collection to workset conversion
(edit) @31341 [31341] 3 years davidb Cody tidy-up
(edit) @31340 [31340] 3 years davidb Test worked OK. Removing debug code
(edit) @31339 [31339] 3 years davidb Debugging statement
(edit) @31338 [31338] 3 years davidb additional close()
(edit) @31337 [31337] 3 years davidb Output the downloaded rsync file
(edit) @31336 [31336] 3 years davidb Changes in response to testing
(edit) @31335 [31335] 3 years davidb Too expensive to hold pairtree filename in hashmap, so change to computing …
(edit) @31334 [31334] 3 years davidb Initial cut at rsync download
(edit) @31333 [31333] 3 years davidb Minor word tweak
(edit) @31332 [31332] 3 years davidb needed in Jetty CORS support
(edit) @31331 [31331] 3 years davidb Reworked to use CORS and $.ajax() so TamperMonkey? doesn't interceed with …
(edit) @31330 [31330] 3 years davidb Initial cut a files that explain how to install the user-script
(edit) @31329 [31329] 3 years davidb Tweaks after testing INSTALL.sh
(edit) @31328 [31328] 3 years davidb Install the necessary files in the jetty webapps dir
Note: See TracRevisionLog for help on using the revision log.