source: other-projects/hathitrust/wcsa/extracted-features-solr/trunk/solr-ingest@ 31300

Name Size Rev Age Author Last Change
../
.settings 31183   5 years davidb Bump up to project using Java 1.8
packages 31092   5 years davidb Minor tweak to spark/hadoop combination downloaded
scripts 31268   4 years davidb Adjustments to memory allocation in response to test runs on 10% of dataset
src 31294   4 years davidb Version for language counting the catalog assignment language …
.classpath 1.0 KB 31183   5 years davidb Bump up to project using Java 1.8
.project 566 bytes 30899   5 years davidb Files for compilation using Eclipse
COMPILE.bash 165 bytes 31212   5 years davidb Changed from mvn assemblhy to shadowing, which has more control
COMPILE.bat 75 bytes 30898   5 years davidb Scripts for downloading sample JSON data from public domain extracted …
ef-solr.properties 1.3 KB 31267   4 years davidb Values trialed on gsliscluster1. Rekindling idea of per-vol processing
pom.xml 3.1 KB 31243   4 years davidb Experimenting with Lucene/Solr's ICU tokenizer
README.txt 1.7 KB 30972   5 years davidb addition of useful command needed before re-running
wcsa-whitelist1.csv.gz 61.6 MB 31193   5 years davidb Peter's white-list file
Note: See TracBrowser for help on using the repository browser.