Changeset 35529
- Timestamp:
- 2021-09-29T15:58:16+13:00 (3 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
other-projects/maori-lang-detection/README.txt
r33398 r35529 20 20 - the script works on opennlp's leipzig corpus of 100k Maori sentences from 2011 to get its sample sentences into the correct format in the mri-sent.train file 21 21 - from this file containing training sentences, it generates the Sentence Detector Model, mri-sent_trained.bin 22 - mri-opennlp-corpus.tar.gz: a tarball containing the 100k Maori sentences opennlp corpus checked out with svn in its original directory structure from https://svn.apache.org/repos/bigdata/opennlp/trunk/ mri_web_2011_100K-sentences.txt22 - mri-opennlp-corpus.tar.gz: a tarball containing the 100k Maori sentences opennlp corpus checked out with svn in its original directory structure from https://svn.apache.org/repos/bigdata/opennlp/trunk/leipzig/data/mri_web_2011_100K-sentences.txt (previously obtained from https://svn.apache.org/repos/bigdata/opennlp/trunk/mri_web_2011_100K-sentences.txt) 23 23 24 24
Note:
See TracChangeset
for help on using the changeset viewer.