|
|
@31601
|
7 years |
davidb |
To get the look and feel of the HTRC portal web site, supporting files …
|
|
|
@31598
|
7 years |
davidb |
Easier to remember what to do
|
|
|
@31597
|
7 years |
davidb |
Additional _s and _ss fields to help with faceting. Temporarily …
|
|
|
@31571
|
7 years |
davidb |
Simple search-all-langs feature added
|
|
|
@31570
|
7 years |
davidb |
Solr-stream based search
|
|
|
@31524
|
7 years |
davidb |
Main changes: Fix for page/seqnum; group by id; show-hide other …
|
|
|
@31510
|
7 years |
davidb |
Turns out some languages fields can be empty. Need to test for this
|
|
|
@31509
|
7 years |
davidb |
LangPos determination changed to lock into first match, rather than …
|
|
|
@31506
|
7 years |
davidb |
Forgot to add initialization line. Doh!
|
|
|
@31505
|
7 years |
davidb |
Added in storing of top-level document metadata as separate solr-doc
|
|
|
@31504
|
7 years |
davidb |
Adjusted call to work with added parameter
|
|
|
@31503
|
7 years |
davidb |
Monitor for missing POS keys, and print out details first time each …
|
|
|
@31502
|
7 years |
davidb |
Comment out section, useful for controlling a smaller run
|
|
|
@31501
|
7 years |
davidb |
No longer used
|
|
|
@31500
|
7 years |
davidb |
Synchronize on reading in of white-list and universal-lang-pos
|
|
|
@31499
|
7 years |
davidb |
Better exception handling
|
|
|
@31498
|
7 years |
davidb |
Tidy up on print statements
|
|
|
@31466
|
7 years |
davidb |
Fix to work out solr_host rather than assume it is gc0
|
|
|
@31465
|
7 years |
davidb |
Adjustment to run solr with more memory
|
|
|
@31464
|
7 years |
davidb |
More general version of script that let's you specify the collection …
|
|
|
@31455
|
7 years |
davidb |
deprecated
|
|
|
@31454
|
7 years |
davidb |
Deprecated
|
|
|
@31453
|
7 years |
davidb |
Added size() method
|
|
|
@31452
|
7 years |
davidb |
Additional Spark progs to run
|
|
|
@31451
|
7 years |
davidb |
shift to using solr-base-url and a specified solr-collection
|
|
|
@31450
|
7 years |
davidb |
Some debugging output to help see what is happening with …
|
|
|
@31393
|
7 years |
davidb |
Fixed typo
|
|
|
@31392
|
7 years |
davidb |
Support for Catalog page added
|
|
|
@31385
|
7 years |
davidb |
Next and previous pages
|
|
|
@31384
|
7 years |
davidb |
After next phase of development
|
|
|
@31383
|
7 years |
davidb |
Files for initial functioning search page
|
|
|
@31378
|
7 years |
davidb |
Fixed loop limit test
|
|
|
@31377
|
7 years |
davidb |
Switch to using URI not string
|
|
|
@31376
|
7 years |
davidb |
Universal language mappings for opennlp POS model tags
|
|
|
@31375
|
7 years |
davidb |
Initial cut at including POS information to solr index
|
|
|
@31374
|
7 years |
davidb |
simplified command line usage
|
|
|
@31373
|
7 years |
davidb |
Changes made to operate on solr1 and solr2 boxes
|
|
|
@31372
|
7 years |
davidb |
Reworked to use sequenceFiles
|
|
|
@31371
|
7 years |
davidb |
Trying to get saveAsSequenceFile working
|
|
|
@31370
|
7 years |
davidb |
Fixed incorrect version number. Using htrcstring so field values not …
|
|
|
@31369
|
7 years |
davidb |
Trial new save
|
|
|
@31368
|
7 years |
davidb |
downsample-100 added
|
|
|
@31367
|
7 years |
davidb |
Changes to work with solr1 and solr2
|
|
|
@31366
|
7 years |
davidb |
Updated to latest released version of Solr
|
|
|
@31365
|
7 years |
davidb |
Quick code added to downsample
|
|
|
@31364
|
7 years |
davidb |
removed sample() line
|
|
|
@31363
|
7 years |
davidb |
Control num of partitions on sort
|
|
|
@31362
|
7 years |
davidb |
use Spark sample() to make for smaller test with Sequence files
|
|
|
@31361
|
7 years |
davidb |
Change from String to Text
|
|
|
@31360
|
7 years |
davidb |
Seems to be Text class not a String class coming out of the seuquenceFiles
|
|
|
@31359
|
7 years |
davidb |
Changed over to use sequenceFiles as input
|
|
|
@31358
|
7 years |
davidb |
Make workset download save as file
|
|
|
@31357
|
7 years |
davidb |
Ensure all output sent to browser
|
|
|
@31356
|
7 years |
davidb |
Tidy up on appending missing volumes
|
|
|
@31355
|
7 years |
davidb |
Changed to using containsKey rather than get to avoid null pointer …
|
|
|
@31354
|
7 years |
davidb |
import tidy-up
|
|
|
@31353
|
7 years |
davidb |
Added debug print statement
|
|
|
@31352
|
7 years |
davidb |
collection-to-workset now with id-check added to filter
|
|
|
@31351
|
7 years |
davidb |
Powerpoint slides showing mahsup features
|
|
|
@31350
|
7 years |
davidb |
Use new 'convert-col' action
|
|
|
@31349
|
7 years |
davidb |
Change over to proxyied main web server
|
|
|
@31348
|
7 years |
davidb |
Restructure of how convert-to works
|
|
|
@31347
|
7 years |
davidb |
First stage of developing HT collection to HTRC workset. Code to …
|
|
|
@31342
|
7 years |
davidb |
Some initial progress on collection to workset conversion
|
|
|
@31341
|
7 years |
davidb |
Cody tidy-up
|
|
|
@31340
|
7 years |
davidb |
Test worked OK. Removing debug code
|
|
|
@31339
|
7 years |
davidb |
Debugging statement
|
|
|
@31338
|
7 years |
davidb |
additional close()
|
|
|
@31337
|
7 years |
davidb |
Output the downloaded rsync file
|
|
|
@31336
|
7 years |
davidb |
Changes in response to testing
|
|
|
@31335
|
7 years |
davidb |
Too expensive to hold pairtree filename in hashmap, so change to …
|
|
|
@31334
|
7 years |
davidb |
Initial cut at rsync download
|
|
|
@31333
|
7 years |
davidb |
Minor word tweak
|
|
|
@31332
|
7 years |
davidb |
needed in Jetty CORS support
|
|
|
@31331
|
7 years |
davidb |
Reworked to use CORS and $.ajax() so TamperMonkey doesn't interceed …
|
|
|
@31330
|
7 years |
davidb |
Initial cut a files that explain how to install the user-script
|
|
|
@31329
|
7 years |
davidb |
Tweaks after testing INSTALL.sh
|
|
|
@31328
|
7 years |
davidb |
Install the necessary files in the jetty webapps dir
|
|
|
@31327
|
7 years |
davidb |
name change to be more consistent
|
|
|
@31326
|
7 years |
davidb |
Further tweaks
|
|
|
@31325
|
7 years |
davidb |
Further tweaks
|
|
|
@31324
|
7 years |
davidb |
More accurate name
|
|
|
@31323
|
7 years |
davidb |
Download script plus setup instructions
|
|
|
@31322
|
7 years |
davidb |
Location for the Java byte compiled code to link in with rest of servlet
|
|
|
@31321
|
7 years |
davidb |
useful scripts
|
|
|
@31320
|
7 years |
davidb |
build Document rather than parse JSON string
|
|
|
@31319
|
7 years |
davidb |
Changed to replace existing MongoDB entry. Fixed up printt statement
|
|
|
@31318
|
7 years |
davidb |
change to using contains()
|
|
|
@31317
|
7 years |
davidb |
added debug statement
|
|
|
@31316
|
7 years |
davidb |
fixed typo
|
|
|
@31315
|
7 years |
davidb |
Further tweak
|
|
|
@31314
|
7 years |
davidb |
Another go at avoiding concurrency update exception
|
|
|
@31313
|
7 years |
davidb |
Alternative to avoid concurrency update exception
|
|
|
@31312
|
7 years |
davidb |
MongoDB can't have 'period' and 'dollar' in key, as reserved characters
|
|
|
@31311
|
7 years |
davidb |
Processing print statement added
|
|
|
@31310
|
7 years |
davidb |
Initial cut at files for working with MongoDB
|
|
|
@31309
|
7 years |
davidb |
Sparked MongoDB connector added
|
|
|
@31308
|
7 years |
davidb |
Minor tidy-up
|
|
|
@31307
|
7 years |
davidb |
convenience scripts
|
|
|
@31306
|
7 years |
davidb |
Final part of the mongodb shard puzzle -- router servers
|
|
|