source: other-projects/hathitrust/wcsa/extracted-features-solr/trunk/solr-ingest

Name Size Rev Age Author Last Change
../
.settings 31183   7 years davidb Bump up to project using Java 1.8
opennlp-lang-pos-mappings 31376   7 years davidb Universal language mappings for opennlp POS model tags
packages 31092   7 years davidb Minor tweak to spark/hadoop combination downloaded
scripts 32109   6 years davidb Changes made after testing through YARN
src 32109   6 years davidb Changes made after testing through YARN
.classpath 1.0 KB 31183   7 years davidb Bump up to project using Java 1.8
.project 566 bytes 30899   7 years davidb Files for compilation using Eclipse
COMPILE.bash 165 bytes 31212   7 years davidb Changed from mvn assemblhy to shadowing, which has more control
COMPILE.bat 75 bytes 30898   7 years davidb Scripts for downloading sample JSON data from public domain extracted …
COMPILE.sh 17 bytes 32108   6 years davidb Useful breadcrumb for compiling
ef-solr.properties 1.3 KB 31267   7 years davidb Values trialed on gsliscluster1. Rekindling idea of per-vol processing
FULL-YARN-INGEST.sh 70 bytes 31598   7 years davidb Easier to remember what to do
FULL-YARN-KILL-EXAMPLE.sh 56 bytes 31676   7 years davidb To make it easier to remember how to kill off a YARN task at the …
JSONLIST-YARN-INGEST.sh 139 bytes 32109   6 years davidb Changes made after testing through YARN
pom.xml 3.5 KB 31377   7 years davidb Switch to using URI not string
README.txt 1.7 KB 30972   7 years davidb addition of useful command needed before re-running
serial-ef-solr.properties 517 bytes 32104   6 years davidb Serial version
wcsa-whitelist1.csv.gz 61.6 MB 31193   7 years davidb Peter's white-list file
Note: See TracBrowser for help on using the repository browser.