source: gs3-extensions/maori-lang-detection/src/org/greenstone/atea@ 33587

Name Size Rev Age Author Last Change
../
CCWETProcessor.java 36.9 KB 33582   5 years ak19 NutchTextDumpProcessor prints each crawled site's stats: number of …
MaoriTextDetector.java 12.7 KB 33587   4 years ak19 1. Better stats reporting on crawled sites: not just if a page was in …
MRIWebPageStats.java 1.4 KB 33587   4 years ak19 1. Better stats reporting on crawled sites: not just if a page was in …
NutchTextDumpProcessor.java 11.8 KB 33587   4 years ak19 1. Better stats reporting on crawled sites: not just if a page was in …
NZTLDProcessor.java 15.3 KB 33466   5 years ak19 1. WETProcessor.main() now processes a folder of *.warc.wet(.gz) …
TextDumpPage.java 4.9 KB 33582   5 years ak19 NutchTextDumpProcessor prints each crawled site's stats: number of …
TextLanguageDetector.java 14.4 KB 33587   4 years ak19 1. Better stats reporting on crawled sites: not just if a page was in …
Utility.java 1.2 KB 33467   5 years ak19 Improved the code to use a static block to load the needed properties …
WETProcessor.java 13.5 KB 33573   5 years ak19 Forgot to document that spaces were also allowed as separator in the …
Note: See TracBrowser for help on using the repository browser.