# # ChangeLog for gs3-extensions/maori-lang-detection/hdfs-cc-work/GS_README.TXT # # Generated by Trac 1.4.2 # 2024-06-20T19:41:52+12:00 Thu, 03 Oct 2019 09:38:00 GMT ak19 [33545] * gs3-extensions/maori-lang-detection/MoreReading/Vagrant-Spark-Hadoop.txt (modified) * gs3-extensions/maori-lang-detection/MoreReading/crawling-Nutch.txt (modified) * gs3-extensions/maori-lang-detection/hdfs-cc-work/GS_README.TXT (modified) Mainly changes to crawling-Nutch.txt and some minor changes to other ... Wed, 02 Oct 2019 04:01:47 GMT ak19 [33543] * gs3-extensions/maori-lang-detection/hdfs-cc-work/GS_README.TXT (modified) * gs3-extensions/maori-lang-detection/hdfs-cc-work/vagrant-for-nutch2.tar.gz (modified) Filled in some missing instructions Tue, 01 Oct 2019 09:27:03 GMT ak19 [33541] * gs3-extensions/maori-lang-detection/MoreReading/crawling-Nutch.txt (modified) * gs3-extensions/maori-lang-detection/hdfs-cc-work/GS_README.TXT (modified) * gs3-extensions/maori-lang-detection/hdfs-cc-work/patches/GZRangeClient.java (added) * gs3-extensions/maori-lang-detection/hdfs-cc-work/patches/WATExtractorOutput.java (added) 1. hdfs-cc-work/GS_README.txt now contains the complete instructions ... Tue, 01 Oct 2019 08:36:38 GMT ak19 [33539] * gs3-extensions/maori-lang-detection/hdfs-cc-work/GS_README.TXT (moved) File rename Tue, 01 Oct 2019 08:36:06 GMT ak19 [33538] * gs3-extensions/maori-lang-detection/hdfs-cc-work/Readme.txt (modified) * gs3-extensions/maori-lang-detection/hdfs-cc-work/scripts/setup.sh (modified) Some additions to the setup.sh script to query commoncrawl for MRI ...