source: other-projects/hathitrust/wcsa/extracted-features-solr/trunk/solr-ingest/scripts/FULL-RUN-MASTER-SPARK.sh@ 31268

Last change on this file since 31268 was 31268, checked in by davidb, 7 years ago

Adjustments to memory allocation in response to test runs on 10% of dataset

  • Property svn:executable set to *
File size: 750 bytes
Line 
1#!/bin/bash
2
3json_filelist=${1:-full-listing-step100000.txt}
4shift
5
6input_dir="hdfs://$SPARK_MASTER_HOST:9000/user/dbbridge/full-ef-json-files"
7#input_dir="hdfs://$SPARK_MASTER_HOST:9000/tmp/dbbridge/full-ef-json-files"
8#input_dir="hdfs://master:9000/user/htrc/full-ef-json-files"
9#input_dir="hdfs://10.10.0.52:9000/user/htrc/full-ef-json-files"
10
11#output_dir=hdfs://master:9000/user/htrc/full-solr-json-files
12solr_url="http://gc0:8983/solr/htrc-full-ef/update"
13
14#
15
16#master_opt="--driver-memory 50g --executor-memory 12g --conf spark.executor.extraJavaOptions=-XX:+HeapDumpOnOutOfMemoryError --master spark://$SPARK_MASTER_HOST:7077"
17master_opt="--driver-memory 50g --executor-memory 70g --master spark://$SPARK_MASTER_HOST:7077"
18
19. scripts/_RUN.sh
20
Note: See TracBrowser for help on using the repository browser.