source:
other-projects/maori-lang-detection/src/org/greenstone/atea@
33881
Name | Size | Rev | Age | Author | Last Change |
---|---|---|---|---|---|
../ | |||||
morphia | 33811 | 5 years | Returning to using a single variable, urlContainsLangCodeInPath, to … | ||
WETProcessor.java | 13.1 KB | 33615 | 5 years | 1. Worked out how to configure log4j to log both to console and … | |
WebPageURLsListing.java | 5.4 KB | 33880 | 4 years | Write out the 5counts_tentativeNonAutotranslatedSites.json file with … | |
Utility.java | 5.5 KB | 33666 | 5 years | Having finished sending all the crawl data to mongodb 1. Recrawled the … | |
TextLanguageDetector.java | 17.7 KB | 33698 | 5 years | Links to more reading | |
TextDumpPage.java | 6.4 KB | 33652 | 5 years | Introducing morphia subpackage | |
RandomURLsForDomainGenerator.java | 3.3 KB | 33871 | 4 years | Removed mostly duplicated older version of method but left the … | |
NZTLDProcessor.java | 15.3 KB | 33466 | 5 years | 1. WETProcessor.main() now processes a folder of *.warc.wet(.gz) … | |
NutchTextDumpToMongoDB.java | 15.8 KB | 33811 | 5 years | Returning to using a single variable, urlContainsLangCodeInPath, to … | |
NutchTextDumpToCSV.java | 16.5 KB | 33634 | 5 years | Rewrote NutchTextDumpProcessor as NutchTextDumpToMongoDB.java, which … | |
MRIWebPageStats.java | 1.7 KB | 33602 | 5 years | 1. The final csv file, mri-sentences.csv, is now written out. 2. Only … | |
MongoDBAccess.java | 20.6 KB | 33881 | 4 years | Uses lambda expression to process each doc in a mongodb aggregate … | |
MaoriTextDetector.java | 12.9 KB | 33615 | 5 years | 1. Worked out how to configure log4j to log both to console and … | |
CountryCodeCountsMapData.java | 22.7 KB | 33869 | 4 years | First cut at the RandomURLsForDomainGenerator.java class and the … | |
CCWETProcessor.java | 40.3 KB | 33666 | 5 years | Having finished sending all the crawl data to mongodb 1. Recrawled the … |
Note:
See TracBrowser
for help on using the repository browser.