source: other-projects/maori-lang-detection/mongodb-data/6counts_sitesWithPagesContainingMRI_manualShortlist.json@ 33979

Last change on this file since 33979 was 33979, checked in by ak19, 4 years ago

Clearly stating that counts are of unique domains

File size: 861 bytes
Line 
1/*
2Uses ManualShortlisting2 .txt file. (ManualShortlisting2_afterMongoDBReingest.txt) Counts are of UNIQUE domain names, after protocol and www are stripped.
3
4Manually inspected UNIQUE non-NZ websites in tentativeNonProductSites1.json
5and made a list of sites with genuine Maori language content for each country.
6
7Includes 4 more sites from US with mi in URL path that do not appear to be autotranslated.
8See file 7miInURLPath_exclNZ_byCountryCode.json
9*/
10{
11 "_id" : "nz",
12 "count" : 113.0
13}
14{
15 "_id" : "us",
16 "count" : 33.0
17}
18{
19 "_id" : "de",
20 "count" : 2.0
21}
22{
23 "_id" : "dk",
24 "count" : 2.0
25}
26{
27 "_id" : "au",
28 "count" : 1.0
29}
30{
31 "_id" : "bg",
32 "count" : 1.0
33}
34{
35 "_id" : "cz",
36 "count" : 1.0
37}
38{
39 "_id" : "es",
40 "count" : 1.0
41}
42{
43 "_id" : "fr",
44 "count" : 1.0
45}
46{
47 "_id" : "ie",
48 "count" : 1.0
49}
50
Note: See TracBrowser for help on using the repository browser.