Ignore:
Timestamp:
2016-10-23T18:32:23+13:00 (8 years ago)
Author:
davidb
Message:

Additional resources

File:
1 edited

Legend:

Unmodified
Added
Removed
  • other-projects/hathitrust/vagrant-hadoop-cluster/trunk/README.txt

    r30904 r30905  
    33cluster for experiments processing HTRC Extracted Feature JSON files
    44suitable for ingesting into Solr.
     5
    56
    67Top-level code Apache Spark, processing HDFS stored JSON files, hence
     
    1415  https://github.com/calo81/vagrant-hadoop-cluster
    1516
     17
     18Supporting Resources
     19====================
     20
     21----
     22Basic Hadoop Cluster
     23----
    1624
    1725Useful documentation about setting up a Hadoop cluster, read:
     
    2735  https://xuri.me/2016/03/22/setup-hadoop-on-ubuntu-multi-node-cluster.html
    2836
     37For working with newer Linux OS and version of software:
    2938
     39  http://www.bogotobogo.com/Hadoop/BigData_hadoop_Install_on_ubuntu_single_node_cluster.php
    3040
     41----
     42Hadoop + Apache Ambari in 3 lines:
     43----
     44
     45  https://blog.codecentric.de/en/2014/04/hadoop-cluster-automation/
     46
     47but looks like a fairly old version of software (currently unused).
     48
     49----
     50Vagrant
     51----
     52
     53To get rid of 'Guest Additions' warnins (about potentially
     54incompatible version numbers) use 'vbguest' plugin:
     55
     56  vagrant plugin install vagrant-vbguest
     57
     58For more details see:
     59
     60http://kvz.io/blog/2013/01/16/vagrant-tip-keep-virtualbox-guest-additions-in-sync/
     61
Note: See TracChangeset for help on using the changeset viewer.