Changeset 30905 for other-projects

Show
Ignore:
Timestamp:
23.10.2016 18:32:23 (3 years ago)
Author:
davidb
Message:

Additional resources

Files:
1 modified

Legend:

Unmodified
Added
Removed
  • other-projects/hathitrust/vagrant-hadoop-cluster/trunk/README.txt

    r30904 r30905  
    33cluster for experiments processing HTRC Extracted Feature JSON files 
    44suitable for ingesting into Solr. 
     5 
    56 
    67Top-level code Apache Spark, processing HDFS stored JSON files, hence 
     
    1415  https://github.com/calo81/vagrant-hadoop-cluster 
    1516 
     17 
     18Supporting Resources 
     19==================== 
     20 
     21---- 
     22Basic Hadoop Cluster 
     23---- 
    1624 
    1725Useful documentation about setting up a Hadoop cluster, read: 
     
    2735  https://xuri.me/2016/03/22/setup-hadoop-on-ubuntu-multi-node-cluster.html 
    2836 
     37For working with newer Linux OS and version of software: 
    2938 
     39  http://www.bogotobogo.com/Hadoop/BigData_hadoop_Install_on_ubuntu_single_node_cluster.php 
    3040 
     41---- 
     42Hadoop + Apache Ambari in 3 lines: 
     43---- 
     44 
     45  https://blog.codecentric.de/en/2014/04/hadoop-cluster-automation/ 
     46 
     47but looks like a fairly old version of software (currently unused). 
     48 
     49---- 
     50Vagrant 
     51---- 
     52 
     53To get rid of 'Guest Additions' warnins (about potentially 
     54incompatible version numbers) use 'vbguest' plugin: 
     55 
     56  vagrant plugin install vagrant-vbguest 
     57 
     58For more details see: 
     59 
     60http://kvz.io/blog/2013/01/16/vagrant-tip-keep-virtualbox-guest-additions-in-sync/ 
     61