Changeset 33554 for gs3-extensions/maori-lang-detection/conf/sites-too-big-to-exhaustively-crawl.txt
- Timestamp:
- 2019-10-09T18:11:19+13:00 (5 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
gs3-extensions/maori-lang-detection/conf/sites-too-big-to-exhaustively-crawl.txt
r33553 r33554 1 # URL blacklist 2 # FORMAT: 3 # precede URL by ^ to blacklist urls that match the given prefix 4 # succeed URL by $ to blacklist urls that match the given suffix 5 # ^url$ will blacklist urls that match the given url completely 6 # Without either ^ or $ symbol, urls containing the given url will get blacklisted 1 # top sites - base url forms 7 2 8 3 # Contains alexa top sites (where only the first 50 were visible) … … 441 436 wikimedia.org 442 437 wikipedia.org 443 wikipedia.org444 wikipedia.org445 438 wiktionary.org 446 439 wiley.com … … 457 450 yahoo.co. 458 451 yahoo.com 459 yahoo.com460 452 yale.edu 461 453 yandex.ru … … 469 461 zendesk.com 470 462 471
Note:
See TracChangeset
for help on using the changeset viewer.