Changeset 33980 for other-projects


Ignore:
Timestamp:
2020-02-26T21:11:58+13:00 (4 years ago)
Author:
ak19
Message:

Additional comments

File:
1 edited

Legend:

Unmodified
Added
Removed
  • other-projects/maori-lang-detection/mongodb-data/6counts_sitesWithPagesContainingMRI_manualShortlist.json

    r33979 r33980  
    11/*
    2 Uses ManualShortlisting2 .txt file. (ManualShortlisting2_afterMongoDBReingest.txt) Counts are of UNIQUE domain names, after protocol and www are stripped.
     2This file was manually created and uses the ManualShortlisting2 .txt file (ManualShortlisting2_afterMongoDBReingest.txt). Counts are of UNIQUE domain names, after protocol and www are stripped.
    33
    4 Manually inspected UNIQUE non-NZ websites in tentativeNonProductSites1.json
     4The counts in ManualShortlisting2 .txt are from:
     5- Manually inspected UNIQUE non-NZ websites in tentativeNonProductSites1.json
    56and made a list of sites with genuine Maori language content for each country.
    6 
    7 Includes 4 more sites from US with mi in URL path that do not appear to be autotranslated.
     7- Includes 4 more sites from US with mi in URL path that do not appear to be autotranslated.
    88See file 7miInURLPath_exclNZ_byCountryCode.json
    99*/
Note: See TracChangeset for help on using the changeset viewer.