source: other-projects/maori-lang-detection/src/org/greenstone/atea@ 33652

Name Size Rev Age Author Last Change
../
morphia 33652   4 years ak19 Introducing morphia subpackage
CCWETProcessor.java 39.2 KB 33624   4 years ak19 Some cleanup surrounding the now renamed function createSeedURLsFile, …
MaoriTextDetector.java 12.9 KB 33615   4 years ak19 1. Worked out how to configure log4j to log both to console and …
MongoDBAccess.java 9.7 KB 33652   4 years ak19 Introducing morphia subpackage
MRIWebPageStats.java 1.7 KB 33602   4 years ak19 1. The final csv file, mri-sentences.csv, is now written out. 2. Only …
NutchTextDumpToCSV.java 16.5 KB 33634   4 years ak19 Rewrote NutchTextDumpProcessor as NutchTextDumpToMongoDB.java, which …
NutchTextDumpToMongoDB.java 13.7 KB 33652   4 years ak19 Introducing morphia subpackage
NZTLDProcessor.java 15.3 KB 33466   5 years ak19 1. WETProcessor.main() now processes a folder of *.warc.wet(.gz) …
SentenceInfo.java 379 bytes 33634   4 years ak19 Rewrote NutchTextDumpProcessor as NutchTextDumpToMongoDB.java, which …
TextDumpPage.java 6.4 KB 33652   4 years ak19 Introducing morphia subpackage
TextLanguageDetector.java 16.5 KB 33652   4 years ak19 Introducing morphia subpackage
Utility.java 5.0 KB 33623   4 years ak19 1. Incorporated Dr Nichols earlier suggestion of storing page modified …
WebpageInfo.java 1.4 KB 33651   4 years ak19 1. Bugfix: overlappingSentences works. 2. storing numSentencesInMaor
WebsiteInfo.java 1.3 KB 33634   4 years ak19 Rewrote NutchTextDumpProcessor as NutchTextDumpToMongoDB.java, which …
WETProcessor.java 13.1 KB 33615   4 years ak19 1. Worked out how to configure log4j to log both to console and …
Note: See TracBrowser for help on using the repository browser.