# # ChangeLog for gs3-extensions/maori-lang-detection/conf # # Generated by Trac 1.4.2 # 2024-05-21T06:16:40+12:00 Fri, 04 Oct 2019 06:35:06 GMT ak19 [33551] * gs3-extensions/maori-lang-detection/conf/sites-too-big-to-exhaustively-crawl.txt (modified) Added in top 500 urls from moz.com/top500 and removed duplicates, and ... Fri, 04 Oct 2019 06:06:51 GMT ak19 [33550] * gs3-extensions/maori-lang-detection/conf/sites-too-big-to-exhaustively-crawl.txt (added) * gs3-extensions/maori-lang-detection/conf/url-greylist-filter.txt (modified) First stage of introducing sites-too-big-to-exhaustively-crawl.tx: ... Thu, 26 Sep 2019 11:06:11 GMT ak19 [33532] * gs3-extensions/maori-lang-detection/conf/url-greylist-filter.txt (modified) Found the other top 500 sites link again at last which Dr Bainbridge ... Thu, 26 Sep 2019 11:03:01 GMT ak19 [33531] * gs3-extensions/maori-lang-detection/conf/url-blacklist-filter.txt (modified) * gs3-extensions/maori-lang-detection/conf/url-greylist-filter.txt (modified) * gs3-extensions/maori-lang-detection/conf/url-whitelist-filter.txt (added) Added whitelist for mi.wikipedia.org, and updates to blacklist and ... Mon, 23 Sep 2019 11:11:29 GMT ak19 [33502] * gs3-extensions/maori-lang-detection/conf/url-blacklist-filter.txt (added) * gs3-extensions/maori-lang-detection/conf/url-greylist-filter.txt (added) Current url pattern blacklist and greylist filter files. Used by ... Mon, 16 Sep 2019 07:45:01 GMT ak19 [33480] * gs3-extensions/maori-lang-detection/conf/config.properties (modified) * gs3-extensions/maori-lang-detection/src/org/greenstone/atea/WETProcessor.java (modified) Much harder to remove pages where words are fused together as some ... Fri, 13 Sep 2019 05:44:41 GMT ak19 [33467] * gs3-extensions/maori-lang-detection/MoreReading/CommonCrawl.txt (modified) * gs3-extensions/maori-lang-detection/MoreReading/Vagrant-Spark-Hadoop.txt (modified) * gs3-extensions/maori-lang-detection/conf/config.properties (modified) * gs3-extensions/maori-lang-detection/src/org/greenstone/atea/Utility.java (modified) * gs3-extensions/maori-lang-detection/src/org/greenstone/atea/WETProcessor.java (modified) Improved the code to use a static block to load the needed properties ... Tue, 13 Aug 2019 09:54:31 GMT ak19 [33412] * gs3-extensions/maori-lang-detection/conf/config.properties (modified) config command for wgetting a single file Sun, 11 Aug 2019 09:15:26 GMT ak19 [33400] * gs3-extensions/maori-lang-detection/conf/log4j.properties (added) * gs3-extensions/maori-lang-detection/conf/log4j.properties.in (added) * gs3-extensions/maori-lang-detection/lib/log4j-1.2.8.jar (added) 1. Setting up log4j.properties based on the macronizer's basic one ... Sun, 11 Aug 2019 08:48:54 GMT ak19 [33399] * gs3-extensions/maori-lang-detection/conf (added) * gs3-extensions/maori-lang-detection/conf/config.properties (moved) * gs3-extensions/maori-lang-detection/lib/gutil.jar (added) Putting properties files into the conf folder and keeping the lib ...