- Timestamp:
- 2019-09-13T17:44:41+12:00 (5 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
gs3-extensions/maori-lang-detection/MoreReading/CommonCrawl.txt
r33457 r33467 47 47 Sebastian 48 48 49 ==================== 50 wharariki:[239]/Scratch/ak19/gs3-extensions/maori-lang-detection/src>java -cp ".:../conf:../lib/*" org.greenstone.atea.WETProcessor ../tmp/processWET /Scratch/ak19/gs3-extensions/maori-lang-detection/tmp/processedWET 51 52 wharariki:[188]/Scratch/ak19/gs3-extensions/maori-lang-detection/tmp/processedWET>ls keep | wc 53 4090 4090 65440 54 wharariki:[189]/Scratch/ak19/gs3-extensions/maori-lang-detection/tmp/processedWET>ls discard | wc 55 1515 1515 24240 56 57 We keep 4090 WET records and are discarding 1515. 58 49 59 ======================= 50 60 Latest version of the index's schema:
Note:
See TracChangeset
for help on using the changeset viewer.