---- Introduction ---- Vargrant provisioning files to spin up a modest Solr cluster/cloud (zookeeper + 3 solr nodes) for experiments in ingesting HTRC Extracted Feature JSON files, which in turn has been processed by a Hadoop cluster. *Assumptions* * You have VirtualBox and Vagrant installed (at time of writing VirtualBox v5.0.28, Vagrant 1.8.6) *Useful* * Installing the Vagrant VirutalBox Guest Additions plugin to stop warnings about potentially incompatible versions: vagrant plugin install vagrant-vbguest ---- Setup Procedure ---- 1. Spin up cloud: vagrant up If this is the first time run, then this can take a while to run. 2. Log into 'zookeeper' and change to the 'htrc' user: vagrant ssh zookeeper sudo su - htrc 3. Start the zookeeper daemon: zkServer.sh start 4. Use the 'start-solr-all.sh' script on 'zookeeper' to launch Solr daemons processes on each solr node in the cluster/cloud. ./solr-start-all.sh This runs a (password authenticated) ssh command on each node to start its solr instance. 5. If the first time the solr cloud has been spun up then create a solr collection for the htrc extracted features to be ingested into (htrc-pd-ef) ssh solr1 init-solr-collection.sh It doesn't matter which solr node is used to create the collection 'solr1' chosen for convenience. Now view the information in the Solr cloud through: http://10.11.0.53:8983/solr