source: other-projects/maori-lang-detection/src/org/greenstone/atea@ 33978

Name Size Rev Age Author Last Change
../
morphia 33909   4 years ak19 1. Implementing tables 3 to 5. 2. Rolled back the introduction of the …
CCWETProcessor.java 40.3 KB 33666   4 years ak19 Having finished sending all the crawl data to mongodb 1. Recrawled the …
CountryCodeCountsMapData.java 31.7 KB 33978   4 years ak19 Opens all geoJSON maps in new tabs instead of waiting for user to have …
ManualURLInspection.java 30.1 KB 33965   4 years ak19 1. Adding a basicDomain column (stripped of http/https and www prefix) …
MaoriTextDetector.java 12.9 KB 33615   4 years ak19 1. Worked out how to configure log4j to log both to console and …
MongoDBAccess.java 9.8 KB 33911   4 years ak19 Correct commit message for previous and current commit: 1. After …
MongoDBQueryer.java 35.1 KB 33963   4 years ak19 Added a new helper method to MongoDBQueryer.java to add numPagesInMRI …
MRIWebPageStats.java 1.7 KB 33602   4 years ak19 1. The final csv file, mri-sentences.csv, is now written out. 2. Only …
NutchTextDumpToCSV.java 16.5 KB 33634   4 years ak19 Rewrote NutchTextDumpProcessor as NutchTextDumpToMongoDB.java, which …
NutchTextDumpToMongoDB.java 16.0 KB 33909   4 years ak19 1. Implementing tables 3 to 5. 2. Rolled back the introduction of the …
NZTLDProcessor.java 15.3 KB 33466   5 years ak19 1. WETProcessor.main() now processes a folder of *.warc.wet(.gz) …
RandomURLsForDomainGenerator.java 3.3 KB 33883   4 years ak19 Clarifications
SummaryTool.java 19.0 KB 33978   4 years ak19 Opens all geoJSON maps in new tabs instead of waiting for user to have …
TextDumpPage.java 6.4 KB 33652   4 years ak19 Introducing morphia subpackage
TextLanguageDetector.java 17.7 KB 33698   4 years ak19 Links to more reading
Utility.java 5.7 KB 33887   4 years ak19 1. Added support for writing out tables in csv format too. 2. Second …
WETProcessor.java 13.1 KB 33615   4 years ak19 1. Worked out how to configure log4j to log both to console and …
Note: See TracBrowser for help on using the repository browser.