source:
other-projects/maori-lang-detection@
33869
Name | Size | Rev | Age | Author | Last Change |
---|---|---|---|---|---|
../ | |||||
bin | 33581 | 5 years | Minor fix. Noticed when looking for work I did on MRI sentence detection | ||
ccrawl-data | 33572 | 5 years | Only meant to store the wet.gz versions of these files, not also the … | ||
conf | 33823 | 4 years | Recommitting mongo-data folder with renamed files with numbering. | ||
hdfs-cc-work | 33825 | 4 years | Beginnings of first draft of write up. | ||
journal-paper | 33856 | 4 years | Forgot to commit. Last week, Dr Bainbridge had properly cropped the … | ||
lib | 33788 | 4 years | Adding all the jar files needed to work in Java with geojson Simple … | ||
logs | 33401 | 5 years | MaoriTextDetector.class file now generated inside its package folder … | ||
models-trainingdata-and-sampletxts | 33588 | 5 years | Committing the MRI sentence model that I'm actually using, the one in … | ||
mongodb-data | 33868 | 4 years | With the updated code for generating the maps from 6a and 6b manual … | ||
MoreReading | 33849 | 4 years | One less Australian site as it was an infographic containing Maori … | ||
src | 33869 | 4 years | First cut at the RandomURLsForDomainGenerator.java class and the … | ||
apache-opennlp-1.9.1-bin.tar.gz | 10.6 MB | 33335 | 5 years | First java file for Māori language detection using openNLP with the … | |
crawledNode2.tar | 606.8 MB | 33800 | 4 years | Removed an adult site from crawled contents and added its url to … | |
crawledNode3.tar | 370.6 MB | 33609 | 5 years | The tar files containing the crawled sites data shouldn't be called … | |
crawledNode4.tar | 374.6 MB | 33609 | 5 years | The tar files containing the crawled sites data shouldn't be called … | |
crawledNode5.tar | 544.3 MB | 33617 | 5 years | Node5 is now full and here is the finished crawl (up to and including … | |
crawledNode6.tar | 84.6 MB | 33666 | 4 years | Having finished sending all the crawl data to mongodb 1. Recrawled the … | |
feasibility.txt | 761 bytes | 33394 | 5 years | 1. Started a file on feasibility with the data now available and some … | |
mri-opennlp-corpus.tar.gz | 8.3 MB | 33355 | 5 years | Changes for adding in the new gen_SentenceDetection_model.sh script, … | |
README.txt | 14.0 KB | 33398 | 5 years | Committing the actual package structure and the updated README after … | |
to_crawl.tar.gz | 1.4 MB | 33666 | 4 years | Having finished sending all the crawl data to mongodb 1. Recrawled the … |
Note:
See TracBrowser
for help on using the repository browser.