Changeset 33565 for gs3-extensions/maori-lang-detection/conf/sites-too-big-to-exhaustively-crawl.txt
- Timestamp:
- 2019-10-14T21:04:58+13:00 (5 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
gs3-extensions/maori-lang-detection/conf/sites-too-big-to-exhaustively-crawl.txt
r33562 r33565 50 50 # column 3: whether nutch should do fetch all or not 51 51 # column 4: number of crawl iterations 52 53 54 # NOT TOP SITES, BUT SITES WE INSPECTED AND WANT TO CONTROL SIMILARLY TO TOP SITES 55 00.gs,SINGLEPAGE 56 57 58 # TOP SITES 52 59 53 60 # docs.google.com is a special case: not all pages are public and any interlinking is likely to
Note:
See TracChangeset
for help on using the changeset viewer.