source:
gs3-extensions/maori-lang-detection@
33606
Name | Size | Rev | Age | Author | Last Change |
---|---|---|---|---|---|
../ | |||||
bin | 33581 | 5 years | Minor fix. Noticed when looking for work I did on MRI sentence detection | ||
ccrawl-data | 33572 | 5 years | Only meant to store the wet.gz versions of these files, not also the … | ||
conf | 33604 | 5 years | 1. Better output into possible-product-sites.txt including the … | ||
hdfs-cc-work | 33598 | 5 years | More instructions on setting up Nutch now that I've remembered to … | ||
lib | 33603 | 5 years | Incorporating Dr Nichols suggestion to help weed out product sites: if … | ||
logs | 33401 | 5 years | MaoriTextDetector.class file now generated inside its package folder … | ||
models-trainingdata-and-sampletxts | 33588 | 5 years | Committing the MRI sentence model that I'm actually using, the one in … | ||
MoreReading | 33603 | 5 years | Incorporating Dr Nichols suggestion to help weed out product sites: if … | ||
src | 33604 | 5 years | 1. Better output into possible-product-sites.txt including the … | ||
apache-opennlp-1.9.1-bin.tar.gz | 10.6 MB | 33335 | 5 years | First java file for Māori language detection using openNLP with the … | |
crawledNode2.tar.gz | 606.8 MB | 33606 | 4 years | 1. Committing crawl data from node3 (2nd VM for nutch crawling). 2. … | |
crawledNode3.tar.gz | 370.6 MB | 33606 | 4 years | 1. Committing crawl data from node3 (2nd VM for nutch crawling). 2. … | |
crawledNode4.tar.gz | 357.3 MB | 33605 | 4 years | Node 4 VM still works, but committing first set of crawled sites on there | |
feasibility.txt | 761 bytes | 33394 | 5 years | 1. Started a file on feasibility with the data now available and some … | |
mri-opennlp-corpus.tar.gz | 8.3 MB | 33355 | 5 years | Changes for adding in the new gen_SentenceDetection_model.sh script, … | |
README.txt | 14.0 KB | 33398 | 5 years | Committing the actual package structure and the updated README after … |
Note:
See TracBrowser
for help on using the repository browser.