source:
other-projects/hathitrust/wcsa/extracted-features-solr/trunk/solr-ingest@
32107
Name | Size | Rev | Age | Author | Last Change |
---|---|---|---|---|---|
../ | |||||
.settings | 31183 | 7 years | Bump up to project using Java 1.8 | ||
opennlp-lang-pos-mappings | 31376 | 7 years | Universal language mappings for opennlp POS model tags | ||
packages | 31092 | 7 years | Minor tweak to spark/hadoop combination downloaded | ||
scripts | 32107 | 6 years | Rekindling the ability to run a JSON-filelist Spark run via YARN | ||
src | 32106 | 6 years | Rekindle ability to process a json-filelist.txt using Spark | ||
.classpath | 1.0 KB | 31183 | 7 years | Bump up to project using Java 1.8 | |
.project | 566 bytes | 30899 | 7 years | Files for compilation using Eclipse | |
COMPILE.bash | 165 bytes | 31212 | 7 years | Changed from mvn assemblhy to shadowing, which has more control | |
COMPILE.bat | 75 bytes | 30898 | 7 years | Scripts for downloading sample JSON data from public domain extracted … | |
ef-solr.properties | 1.3 KB | 31267 | 7 years | Values trialed on gsliscluster1. Rekindling idea of per-vol processing | |
FULL-YARN-INGEST.sh | 70 bytes | 31598 | 6 years | Easier to remember what to do | |
FULL-YARN-KILL-EXAMPLE.sh | 56 bytes | 31676 | 6 years | To make it easier to remember how to kill off a YARN task at the … | |
JSONLIST-YARN-INGEST.sh | 91 bytes | 32107 | 6 years | Rekindling the ability to run a JSON-filelist Spark run via YARN | |
pom.xml | 3.5 KB | 31377 | 7 years | Switch to using URI not string | |
README.txt | 1.7 KB | 30972 | 7 years | addition of useful command needed before re-running | |
serial-ef-solr.properties | 517 bytes | 32104 | 6 years | Serial version | |
wcsa-whitelist1.csv.gz | 61.6 MB | 31193 | 7 years | Peter's white-list file |
Note:
See TracBrowser
for help on using the repository browser.