source: gs3-extensions/maori-lang-detection/models-trainingdata-and-sampletxts/sample_mri_paragraphs.txt@ 33355

Last change on this file since 33355 was 33355, checked in by ak19, 5 years ago

Changes for adding in the new gen_SentenceDetection_model.sh script, which automates generating a Sentence Detector model for the Maori language, mri-sent_trained.bin, trained on the mri-sent.train file generated by appropritely formatting the 100k Maori sentences file from the opennlp corpus 2011

File size: 672 bytes
Line 
1Kua whakaemitia mai ki konei he kohikohinga niupepa i tāngia mō ngā kaipānui Māori o ngā tau 1842-1932. E taea te pānui niupepa mā te rapu kupu, i te rārangi taitara, me te rārangi wātaka hoki. He mea i whakatūria e te kaupapa New Zealand Digital Library, i te Tari Rorohiko, o te Whare Wānanga o Waikato. Kei raro nei he kupu whakamārama e pā ana ki te kohikohinga nei. Ka nui te mihi ki ngā rōpū tautoko i te kaupapa nei. Nā tā rātou tautoko kua kore he utu, mā koutou, mō te titiro ki ngā niupepa nei. Ko te Tāhuhu o te Mātauranga tērā, me ētehi whare pukapuka e noho mai nei ki ngā whare wānanga o te motu. Tirohia ngā rōpū tautoko.
Note: See TracBrowser for help on using the repository browser.