Changeset 33556
- Timestamp:
- 2019-10-09T18:58:30+13:00 (5 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
gs3-extensions/maori-lang-detection/conf/url-blacklist-filter.txt
r33554 r33556 6 6 # Without either ^ or $ symbol, urls containing the given url will get blacklisted 7 7 8 9 # wikipedia pages in 10 # ksh (a German dialect), ilo (Filippino), ty Tahitian, wa for Walons/Walloon, 11 # io (Ido version of Esperanto) and zh-min-nan (Min-Nan-Chinese) are not in the Maori language 12 # Not sure why Commoncrawl had found them for language code MRI 13 ksh.wikipedia.org 14 ilo.wikipedia.org 15 wa.wikipedia.org 16 ty.m.wikipedia.org 17 io.m.wikipedia.org 18 zh-min-nan.wikipedia.org 19 zh-min-nan.wiktionary.org 8 20 9 21 # unwanted domains
Note:
See TracChangeset
for help on using the changeset viewer.