source:
gs3-extensions/maori-lang-detection@
33573
Name | Size | Rev | Age | Author | Last Change |
---|---|---|---|---|---|
../ | |||||
bin | 33526 | 5 years | Moved hadoop related scripts from bin/script into hdfs-instructions | ||
ccrawl-data | 33572 | 4 years | Only meant to store the wet.gz versions of these files, not also the … | ||
conf | 33569 | 4 years | 1. batchcrawl.sh now does what it should have from the start, which is … | ||
hdfs-cc-work | 33573 | 4 years | Forgot to document that spaces were also allowed as separator in the … | ||
lib | 33562 | 4 years | 1. The sites-too-big-to-exhaustively-crawl.txt is now a csv file of a … | ||
logs | 33401 | 5 years | MaoriTextDetector.class file now generated inside its package folder … | ||
models-trainingdata-and-sampletxts | 33355 | 5 years | Changes for adding in the new gen_SentenceDetection_model.sh script, … | ||
MoreReading | 33565 | 4 years | CCWETProcessor: domain url now goes in as a seedURL after the … | ||
src | 33573 | 4 years | Forgot to document that spaces were also allowed as separator in the … | ||
apache-opennlp-1.9.1-bin.tar.gz | 10.6 MB | 33335 | 5 years | First java file for Māori language detection using openNLP with the … | |
feasibility.txt | 761 bytes | 33394 | 5 years | 1. Started a file on feasibility with the data now available and some … | |
mri-opennlp-corpus.tar.gz | 8.3 MB | 33355 | 5 years | Changes for adding in the new gen_SentenceDetection_model.sh script, … | |
README.txt | 14.0 KB | 33398 | 5 years | Committing the actual package structure and the updated README after … |
Note:
See TracBrowser
for help on using the repository browser.