# # ChangeLog for gs3-extensions/maori-lang-detection/hdfs-cc-work/scripts # # Generated by Trac 1.4.2 # 2024-06-21T10:30:09+12:00 Mon, 14 Oct 2019 09:07:45 GMT ak19 [33566] * gs3-extensions/maori-lang-detection/hdfs-cc-work/scripts/batchcrawl.sh (modified) batchcrawl.sh script now supports taking a comma or space separated ... Mon, 14 Oct 2019 08:01:17 GMT ak19 [33564] * gs3-extensions/maori-lang-detection/hdfs-cc-work/scripts/batchcrawl.sh (modified) batchcrawl.sh now does the crawl and logs output of the crawl, dumps ... Fri, 11 Oct 2019 10:29:40 GMT ak19 [33563] * gs3-extensions/maori-lang-detection/hdfs-cc-work/scripts/batchcrawl.sh (added) Committing inactive testing batch scripts (only creates the regex- ... Tue, 01 Oct 2019 08:36:06 GMT ak19 [33538] * gs3-extensions/maori-lang-detection/hdfs-cc-work/Readme.txt (modified) * gs3-extensions/maori-lang-detection/hdfs-cc-work/scripts/setup.sh (modified) Some additions to the setup.sh script to query commoncrawl for MRI ... Mon, 30 Sep 2019 03:49:19 GMT ak19 [33535] * gs3-extensions/maori-lang-detection/hdfs-cc-work/Readme.txt (modified) * gs3-extensions/maori-lang-detection/hdfs-cc-work/scripts/setup.sh (added) 1. New setup.sh script for on a hadoop system to setup the git ... Fri, 27 Sep 2019 05:05:40 GMT ak19 [33534] * gs3-extensions/maori-lang-detection/hdfs-cc-work/scripts/get_Maori_WET_records_from_CCSep2018_on.sh (modified) Correction: toplevel script has to be placed inside cc-index-table ... Thu, 26 Sep 2019 08:39:38 GMT ak19 [33527] * gs3-extensions/maori-lang-detection/hdfs-cc-work (moved) Name change for folder Thu, 26 Sep 2019 08:38:14 GMT ak19 [33526] * gs3-extensions/maori-lang-detection/bin/script/get_Maori_WET_records_from_CCSep2018_on.sh (deleted) * gs3-extensions/maori-lang-detection/bin/script/get_maori_WET_records_for_crawl.sh (deleted) * gs3-extensions/maori-lang-detection/hdfs-instructions/scripts/get_Maori_WET_records_from_CCSep2018_on.sh (modified) Moved hadoop related scripts from bin/script into hdfs-instructions