Changeset 33980

Show
Ignore:
Timestamp:
26.02.2020 21:11:58 (5 weeks ago)
Author:
ak19
Message:

Additional comments

Files:
1 modified

Legend:

Unmodified
Added
Removed
  • other-projects/maori-lang-detection/mongodb-data/6counts_sitesWithPagesContainingMRI_manualShortlist.json

    r33979 r33980  
    11/* 
    2 Uses ManualShortlisting2 .txt file. (ManualShortlisting2_afterMongoDBReingest.txt) Counts are of UNIQUE domain names, after protocol and www are stripped. 
     2This file was manually created and uses the ManualShortlisting2 .txt file (ManualShortlisting2_afterMongoDBReingest.txt). Counts are of UNIQUE domain names, after protocol and www are stripped. 
    33 
    4 Manually inspected UNIQUE non-NZ websites in tentativeNonProductSites1.json 
     4The counts in ManualShortlisting2 .txt are from: 
     5- Manually inspected UNIQUE non-NZ websites in tentativeNonProductSites1.json 
    56and made a list of sites with genuine Maori language content for each country. 
    6  
    7 Includes 4 more sites from US with mi in URL path that do not appear to be autotranslated. 
     7- Includes 4 more sites from US with mi in URL path that do not appear to be autotranslated. 
    88See file 7miInURLPath_exclNZ_byCountryCode.json 
    99*/