Changeset 33815
- Timestamp:
- 2019-12-19T17:17:16+13:00 (4 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
other-projects/maori-lang-detection/hdfs-cc-work/GS_README.TXT
r33814 r33815 724 724 20371 725 725 726 727 # Number of sites with URLs containing /mi(/) 728 db.getCollection('Websites').find({urlContainsLangCodeInPath:true}).count() 729 X 153 730 # Number of sites with URLs containing /mi(/) OR http(s)://mi.* 726 # Number of sites with crawled web pages that have URLs containing /mi(/) OR http(s)://mi.* 731 727 db.getCollection('Websites').find({urlContainsLangCodeInPath:true}).count() 732 728 670 733 729 734 # Number of websites that are outside NZ that contain /mi(/) in any of its sub-urls 735 db.getCollection('Websites').find({urlContainsLangCodeInPath:true, geoLocationCountryCode: {$ne : "NZ"} }).count() 736 X 147 737 # Number of websites that are outside NZ that contain /mi(/) OR http(s)://mi.* in any of its sub-urls 730 # Number of websites that are outside NZ that contain /mi(/) OR http(s)://mi.* 731 # in any of its crawled webpage urls 738 732 db.getCollection('Websites').find({urlContainsLangCodeInPath:true, geoLocationCountryCode: {$ne : "NZ"} }).count() 739 733 656 740 734 741 # 6 sites with URLs containing /mi(/) that are in NZ742 db.getCollection('Websites').find({urlContainsLangCodeInPath:true, geoLocationCountryCode: "NZ"}).count()743 X 6744 735 # 14 sites with URLs containing /mi(/) OR http(s)://mi.* that are in NZ 745 736 14
Note:
See TracChangeset
for help on using the changeset viewer.