# # ChangeLog for other-projects/maori-lang-detection/mongodb-data # # Generated by Trac 1.4.2 # 2024-06-02T19:05:54+12:00 Wed, 27 May 2020 07:43:03 GMT ak19 [34127] * other-projects/maori-lang-detection/mongodb-data/pieChart3c_screenshot_SimplerCrawledWebPages_EmptyVsInMongoDB.png (moved) Spelling correction in filename: screeMshot to screeNshot Thu, 21 May 2020 05:47:46 GMT ak19 [34120] * other-projects/maori-lang-detection/mongodb-data/random260.csv (added) CSV version of .ods file, so openoffice isn't required Mon, 23 Mar 2020 04:04:55 GMT ak19 [34097] * other-projects/maori-lang-detection/mongodb-data/InfoOnEmptyPagesNotInMongoDB.ods (added) Open office version of similarly named spreadsheet, just with columns ... Thu, 19 Mar 2020 03:40:20 GMT ak19 [34089] * other-projects/maori-lang-detection/mongodb-data/googlescholar.txt (added) * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) So far accumulated URLs to docs on Google scholar about or somewhat ... Tue, 10 Mar 2020 07:45:18 GMT ak19 [34011] * other-projects/maori-lang-detection/mongodb-data/pieChart4a_sitesPreparedForCrawling.png (added) * other-projects/maori-lang-detection/mongodb-data/pieChart4b_sitesPreparedForCrawling.svg (added) * other-projects/maori-lang-detection/mongodb-data/pieChart4c_screenshotSitesPreparedForCrawling.png (added) * other-projects/maori-lang-detection/mongodb-data/pieChart5a_sitesPreparedForCrawling.png (added) * other-projects/maori-lang-detection/mongodb-data/pieChart5b_sitesPreparedForCrawling.svg (added) * other-projects/maori-lang-detection/mongodb-data/pieChart5c_screenshotSitesPreparedForCrawling.png (added) * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) * other-projects/maori-lang-detection/mongodb-data/piechart_data2.txt (modified) Piechart data for sites prepared for crawling and the piecharts for these Tue, 10 Mar 2020 06:56:01 GMT ak19 [34007] * other-projects/maori-lang-detection/mongodb-data/pieChart2a_CrawledWebPages_EmptyVsInMongoDB.png (added) * other-projects/maori-lang-detection/mongodb-data/pieChart2b_CrawledWebPages_EmptyVsInMongoDB.svg (added) * other-projects/maori-lang-detection/mongodb-data/pieChart3a_SimplerCrawledWebPages_EmptyVsInMongoDB.png (added) * other-projects/maori-lang-detection/mongodb-data/pieChart3b_SimplerCrawledWebPages_EmptyVsInMongoDB.svg (added) * other-projects/maori-lang-detection/mongodb-data/pieChart3c_screemshot_SimplerCrawledWebPages_EmptyVsInMongoDB.png (added) * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) * other-projects/maori-lang-detection/mongodb-data/piechart_data2.txt (modified) Prepared more data for the piecharts. This time for empty web pages ... Tue, 10 Mar 2020 05:51:05 GMT ak19 [34006] * other-projects/maori-lang-detection/mongodb-data/pieChart01a_seedURLsForCrawling.png (added) * other-projects/maori-lang-detection/mongodb-data/pieChart01b_obtainingSeedURLs.png (added) * other-projects/maori-lang-detection/mongodb-data/pieChart01c_obtainingSeedURLs.svg (added) * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) * other-projects/maori-lang-detection/mongodb-data/piechart_data2.txt (added) Committing more data I've collected for generating pie charts and the ... Tue, 10 Mar 2020 04:27:07 GMT ak19 [34004] * other-projects/maori-lang-detection/mongodb-data/InfoOnEmptyPagesNotInMongoDB.csv (moved) * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) Renaming csv file to have csv extension Tue, 10 Mar 2020 04:26:45 GMT ak19 [34003] * other-projects/maori-lang-detection/mongodb-data/InfoOnEmptyPagesNotInMongoDB.txt (modified) Redid the file with info on empty URL web pages as a csv file with ... Mon, 09 Mar 2020 05:56:00 GMT ak19 [34001] * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) Tentative total urls from common crawl 12 month cral data. Mon, 09 Mar 2020 04:34:10 GMT ak19 [33999] * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) Common crawl 12 month urls and CC provided stats Fri, 28 Feb 2020 09:08:08 GMT ak19 [33987] * other-projects/maori-lang-detection/mongodb-data/InfoOnEmptyPagesNotInMongoDB.txt (added) Output of re-running NutchTextDumpToMongoDB to print out which web ... Fri, 28 Feb 2020 09:07:29 GMT ak19 [33986] * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) Dr Bainbridge investigated the original data set more Thu, 27 Feb 2020 08:49:00 GMT ak19 [33985] * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (added) Data to back the piechart I need to make that will illustrate how we ... Wed, 26 Feb 2020 08:11:58 GMT ak19 [33980] * other-projects/maori-lang-detection/mongodb-data/6counts_sitesWithPagesContainingMRI_manualShortlist.json (modified) Additional comments Wed, 26 Feb 2020 08:00:38 GMT ak19 [33979] * other-projects/maori-lang-detection/mongodb-data/6counts_sitesWithPagesContainingMRI_manualShortlist.json (modified) Clearly stating that counts are of unique domains Wed, 26 Feb 2020 05:37:08 GMT ak19 [33977] * other-projects/maori-lang-detection/mongodb-data/random260_results.txt (modified) Added something on precision vs recall being applicable to our ... Wed, 26 Feb 2020 05:28:09 GMT ak19 [33976] * other-projects/maori-lang-detection/mongodb-data/random260_results.txt (modified) Adding in what I could remember of Dr Bainbridge's statement about ... Fri, 21 Feb 2020 08:00:55 GMT ak19 [33966] * other-projects/maori-lang-detection/mongodb-data/random260.ods (added) * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) * other-projects/maori-lang-detection/mongodb-data/random260_results.txt (added) Added the origSequence and basicDomain columns to the random 260 web ... Fri, 21 Feb 2020 06:57:38 GMT ak19 [33964] * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) 2 records were missing a value for the qualityLevel column. Thu, 20 Feb 2020 09:07:20 GMT ak19 [33962] * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) 2 fields changed, as one was missed out and the other incorrectly ... Thu, 20 Feb 2020 07:22:38 GMT ak19 [33960] * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) Reviewed all the random sample web page URLs marked ... Tue, 18 Feb 2020 10:33:29 GMT ak19 [33951] * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) Reviewed the qualityLevel column where LITTLE_TEXT was assigned. Tue, 18 Feb 2020 10:28:55 GMT ak19 [33950] * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) Reviewed the qualityLevel column where MIXED_TEXT was assigned. Tue, 18 Feb 2020 10:22:53 GMT ak19 [33949] * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) Reviewed the qualityLevel column where NAV was assigned. Tue, 18 Feb 2020 09:56:44 GMT ak19 [33948] * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/ManualURLInspection.java (modified) Reviewed the random sampled web page URLs marked as ... Tue, 18 Feb 2020 09:07:33 GMT ak19 [33947] * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) Some more questionmarked field values assigned. Tue, 18 Feb 2020 08:48:14 GMT ak19 [33945] * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) Added a 4th column for all 260 sample web page URLs and have used the ... Tue, 18 Feb 2020 03:44:21 GMT ak19 [33944] * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) Added the isReallyInMRI column after manually inspecting the ... Mon, 17 Feb 2020 09:16:40 GMT ak19 [33940] * other-projects/maori-lang-detection/lib/commons-csv-1.7.jar (deleted) * other-projects/maori-lang-detection/lib/commons-csv-1.8.jar (added) * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/ManualURLInspection.java (added) * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBQueryer.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/SummaryTool.java (modified) 1. In order to make it easier to do the manual work of inspecting 260 ... Mon, 17 Feb 2020 03:22:08 GMT ak19 [33939] * other-projects/maori-lang-detection/mongodb-data/isMRI_full_manualList_globalDomains_whereAPageContainsMRI.txt (added) * other-projects/maori-lang-detection/mongodb-data/random255_domainsNZ_IsMRI.txt (deleted) * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (added) 1. Old random samples file doesn't apply as we're not sampling by ... Mon, 17 Feb 2020 03:06:40 GMT ak19 [33937] * other-projects/maori-lang-detection/mongodb-data/6counts_sitesWithPagesContainingMRI_manualShortlist.json (added) New counts of manual sites after reingesting into MongoDB. Forgot to ... Mon, 17 Feb 2020 03:05:55 GMT ak19 [33936] * other-projects/maori-lang-detection/mongodb-data/6counts_sitesWithPagesContainingMRI_manualShortlist.jsonOLD (moved) * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting2_afterMongoDBReingest.txt (modified) Renaming old file to place with new counts after reingesting into ... Thu, 13 Feb 2020 06:34:14 GMT ak19 [33918] * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting2_afterMongoDBReingest.txt (modified) * other-projects/maori-lang-detection/mongodb-data/manualList_globalDomains_whereAPageContainsMRI.txt (modified) Country codes added to each domain's URL of the manual site/domain ... Thu, 13 Feb 2020 04:42:11 GMT ak19 [33916] * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting2_afterMongoDBReingest.txt (modified) Updated the rest of the file after reingest Thu, 13 Feb 2020 04:12:06 GMT ak19 [33915] * other-projects/maori-lang-detection/mongodb-data/6counts_nonProductSites1_manualShortlist.json (added) * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting2_afterMongoDBReingest.txt (moved) Forgot to add a (manual) counts file created last week, and am now ... Thu, 13 Feb 2020 04:09:07 GMT ak19 [33914] * other-projects/maori-lang-detection/MoreReading/mongodb.txt (modified) * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting.txt (modified) * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting2.txt (modified) * other-projects/maori-lang-detection/mongodb-data/manualList_globalDomains_whereAPageContainsMRI.txt (added) Shortlisted just the domain sites by country into ... Wed, 12 Feb 2020 08:27:02 GMT ak19 [33913] * other-projects/maori-lang-detection/MoreReading/mongodb.txt (modified) * other-projects/maori-lang-detection/hdfs-cc-work/GS_README.TXT (modified) * other-projects/maori-lang-detection/mongodb-data/tables.txt (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBQueryer.java (modified) 1. Adjusted table mongodb query statements to be more exact, but same ... Wed, 05 Feb 2020 10:38:57 GMT ak19 [33907] * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting2.txt (added) See previous commit message. This will be the file with the results ... Mon, 03 Feb 2020 10:20:53 GMT ak19 [33895] * other-projects/maori-lang-detection/mongodb-data/5b_counts_containsMRI_groupedByNZorOverseasNoFilter.json (moved) Minor rename Mon, 03 Feb 2020 10:20:33 GMT ak19 [33894] * other-projects/maori-lang-detection/mongodb-data/5b_count_containsMRI_groupedByNZorOverseasNoFilter.json (added) * other-projects/maori-lang-detection/mongodb-data/5b_geojson-features_containsMRI_groupedByNZorOverseasNoFilter.json (added) * other-projects/maori-lang-detection/mongodb-data/5b_map_containsMRI_groupedByNZorOverseasNoFilter.png (added) * other-projects/maori-lang-detection/mongodb-data/5b_multipoint_containsMRI_groupedByNZorOverseasNoFilter.json (added) * other-projects/maori-lang-detection/mongodb-data/6counts_sitesWithPagesContainingMRI_manualShortlist.json (moved) * other-projects/maori-lang-detection/mongodb-data/6geojson-features_sitesWithPagesContainingMRI_manualShortlist.json (moved) * other-projects/maori-lang-detection/mongodb-data/6map_sitesWithPagesContainingMRI_manualShortlist.png (moved) * other-projects/maori-lang-detection/mongodb-data/6multipoint_sitesWithPagesContainingMRI_manualShortlist.json (moved) * other-projects/maori-lang-detection/mongodb-data/tables.txt (modified) 1. Adding map, counts.json and geo-json files for 5b count of sites ... Mon, 03 Feb 2020 09:41:47 GMT ak19 [33893] * other-projects/maori-lang-detection/mongodb-data/8TableOfNumDetectedVsManualSITESWithMRI.ods (modified) * other-projects/maori-lang-detection/mongodb-data/8table_siteCountSummary.png (modified) 1. Left out region code column. 2. Two more sheets of work in ... Mon, 03 Feb 2020 09:28:44 GMT ak19 [33892] * other-projects/maori-lang-detection/mongodb-data/8TableOfNumDetectedVsManualSITESWithMRI.ods (moved) Sheets renamed and spreadsheet renamed Mon, 03 Feb 2020 09:27:37 GMT ak19 [33891] * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/8table_siteCountSummary.png (added) * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting.txt (added) * other-projects/maori-lang-detection/mongodb-data/TableOfNumDetectedVsManualSITESWithMRI.ods (added) Site level detected vs manual inspected data: working shown in file ... Mon, 03 Feb 2020 07:31:33 GMT ak19 [33890] * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (modified) Finished going through NZ sites listing of numPagesContainingMRI > 0 ... Mon, 03 Feb 2020 02:48:40 GMT ak19 [33889] * other-projects/maori-lang-detection/mongodb-data/1a_table_miInUrlPath.csv (modified) * other-projects/maori-lang-detection/mongodb-data/1a_table_miInUrlPath.png (added) * other-projects/maori-lang-detection/mongodb-data/1b_table_noMiInUrlPath.csv (modified) * other-projects/maori-lang-detection/mongodb-data/1b_table_noMiInUrlPath.png (added) * other-projects/maori-lang-detection/mongodb-data/1table_allCrawledSites.csv (modified) * other-projects/maori-lang-detection/mongodb-data/1table_allCrawledSites.png (added) * other-projects/maori-lang-detection/mongodb-data/2table_sitesWithPagesInMRI.csv (modified) * other-projects/maori-lang-detection/mongodb-data/2table_sitesWithPagesInMRI.png (added) * other-projects/maori-lang-detection/mongodb-data/3table_sitesWithPagesContainingMRI.csv (modified) * other-projects/maori-lang-detection/mongodb-data/3table_sitesWithPagesContainingMRI.png (added) * other-projects/maori-lang-detection/mongodb-data/4table_tentativeNonProductSites.csv (modified) * other-projects/maori-lang-detection/mongodb-data/4table_tentativeNonProductSites.png (added) * other-projects/maori-lang-detection/mongodb-data/5b_table_containsMRI_groupedByNZorOverseasNoFilter.csv (added) * other-projects/maori-lang-detection/mongodb-data/5b_table_containsMRI_groupedByNZorOverseasNoFilter.png (added) * other-projects/maori-lang-detection/mongodb-data/5table_tentativeNonProductSites1.csv (modified) * other-projects/maori-lang-detection/mongodb-data/5table_tentativeNonProductSites1.png (added) * other-projects/maori-lang-detection/mongodb-data/tables.txt (modified) 1. Additional column: totalPagesAcrossMatchingSites. 2. Screengrab of ... Fri, 31 Jan 2020 10:17:47 GMT ak19 [33886] * other-projects/maori-lang-detection/mongodb-data/2table_sitesWithPagesInMRI.csv (moved) Minor. File rename Fri, 31 Jan 2020 09:21:40 GMT ak19 [33884] * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/WebPageURLsListing.java (modified) 0. Previous commit had lots of modifications, and only 2 files ... Fri, 31 Jan 2020 08:50:34 GMT ak19 [33883] * other-projects/maori-lang-detection/mongodb-data/5table_tentativeNonProductSites1.csv (modified) * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/RandomURLsForDomainGenerator.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/WebPageURLsListing.java (modified) Clarifications Thu, 30 Jan 2020 07:18:09 GMT ak19 [33878] * other-projects/maori-lang-detection/mongodb-data/tables.txt (modified) Better comment Thu, 30 Jan 2020 07:07:59 GMT ak19 [33877] * other-projects/maori-lang-detection/mongodb-data/5counts_tentativeNonProductSites1.json (modified) Reordering to have proper descending order of counts Wed, 29 Jan 2020 06:18:29 GMT ak19 [33875] * other-projects/maori-lang-detection/mongodb-data/6b_geojson-features_manualShortlist_numPagesContainingMRI.json (moved) * other-projects/maori-lang-detection/mongodb-data/6b_multipoint_manualShortlist_numPagesContainingMRI.json (moved) Renaming 2 more files correctly Wed, 29 Jan 2020 06:15:29 GMT ak19 [33874] * other-projects/maori-lang-detection/mongodb-data/6a_geojson-features_manualShortlist_numPagesInMRI.json (moved) * other-projects/maori-lang-detection/mongodb-data/6a_multipoint_manualShortlist_numPagesInMRI.json (moved) Renaming 2 files correctly Fri, 24 Jan 2020 08:44:04 GMT ak19 [33872] * other-projects/maori-lang-detection/mongodb-data/4counts_tentativeNonProductSites.json (modified) * other-projects/maori-lang-detection/mongodb-data/5counts_tentativeNonProductSites1.json (modified) * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/random255_domainsNZ_IsMRI.txt (added) * other-projects/maori-lang-detection/mongodb-data/tables.txt (modified) 1. Added the file containing the 255 random NZ page URLs to sample. ... Thu, 23 Jan 2020 08:16:44 GMT ak19 [33868] * other-projects/maori-lang-detection/mongodb-data/6a_counts_geojson-features_manualShortlist_numPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6a_counts_multipoint_manualShortlist_numPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6a_map_numPagesInMRI_fromManualInspectedSites.png (added) * other-projects/maori-lang-detection/mongodb-data/6b_counts_geojson-features_manualShortlist_numPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6b_counts_multipoint_manualShortlist_numPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6b_map_numPagesContainingMRI_fromManualInspectedSites.png (added) * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (modified) With the updated code for generating the maps from 6a and 6b manual ... Tue, 21 Jan 2020 09:01:07 GMT ak19 [33854] * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (modified) Manually gone over around 150 webpages of sample size of 255 webpages ... Fri, 17 Jan 2020 09:38:24 GMT ak19 [33851] * other-projects/maori-lang-detection/mongodb-data/6a_map_manuallyInspected_numPagesInMRI.png (deleted) * other-projects/maori-lang-detection/mongodb-data/6b_map_manuallyInspected_numPagesContainingMRI.png (deleted) Deleting faulty maps. NZ numPages inMRI and containingMRI count is ... Fri, 17 Jan 2020 09:38:00 GMT ak19 [33850] * other-projects/maori-lang-detection/mongodb-data/6a_map_manuallyInspected_numPagesInMRI.png (moved) * other-projects/maori-lang-detection/mongodb-data/6b_map_manuallyInspected_numPagesContainingMRI.png (moved) Renames before deleting faulty maps. NZ numPages inMRI and ... Fri, 17 Jan 2020 09:21:14 GMT ak19 [33848] * other-projects/maori-lang-detection/mongodb-data/1a_counts_miInUrlPath.json (modified) * other-projects/maori-lang-detection/mongodb-data/1a_table_miInUrlPath.csv (added) * other-projects/maori-lang-detection/mongodb-data/1b_counts_noMiInUrlPath.json (modified) * other-projects/maori-lang-detection/mongodb-data/1b_table_noMiInUrlPath.csv (added) * other-projects/maori-lang-detection/mongodb-data/1table_allCrawledSites.csv (added) * other-projects/maori-lang-detection/mongodb-data/2table__sitesWithPagesInMRI.csv (added) * other-projects/maori-lang-detection/mongodb-data/3table_sitesWithPagesContainingMRI.csv (added) * other-projects/maori-lang-detection/mongodb-data/4table_tentativeNonProductSites.csv (added) * other-projects/maori-lang-detection/mongodb-data/5table_tentativeNonProductSites1.csv (added) * other-projects/maori-lang-detection/mongodb-data/6a_counts_manualShortlist_numPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6a_geojson-features_manualShortlist_numPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6a_manuallyInspected_numPagesInMRI.png (added) * other-projects/maori-lang-detection/mongodb-data/6a_multipoint_manualShortlist_numPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6b_counts_manualShortlist_numPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6b_geojson-features_manualShortlist_numPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6b_manuallyInspected_numPagesContainingMRI.png (added) * other-projects/maori-lang-detection/mongodb-data/6b_multipoint_manualShortlist_numPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6counts_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/6geojson-features_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/6map_exclAutotranslatedSites1_manualShortlist.png (modified) * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (added) * other-projects/maori-lang-detection/mongodb-data/tables.txt (added) Tables of mongodb counts (1-5 table) and manual counts (6table). ... Fri, 17 Jan 2020 06:32:16 GMT ak19 [33847] * other-projects/maori-lang-detection/MoreReading/mongodb.txt (modified) * other-projects/maori-lang-detection/mongodb-data/6counts_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/6geojson-features_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/6map_exclAutotranslatedSites1_manualShortlist.png (modified) indigenousblogs.com did have one page actually in Maori (an XML ... Fri, 17 Jan 2020 03:49:05 GMT ak19 [33846] * other-projects/maori-lang-detection/mongodb-data/1map_allCrawledSites.png (modified) * other-projects/maori-lang-detection/mongodb-data/2map_sitesWithPagesInMRI.png (modified) * other-projects/maori-lang-detection/mongodb-data/3map_sitesWithPagesContainingMRI.png (modified) * other-projects/maori-lang-detection/mongodb-data/4map_exclTentativeAutotranslatedSites.png (modified) * other-projects/maori-lang-detection/mongodb-data/5map_exclTentativeAutotranslatedSites1.png (modified) * other-projects/maori-lang-detection/mongodb-data/6map_exclAutotranslatedSites1_manualShortlist.png (modified) Cropped out the json portion Fri, 17 Jan 2020 03:34:11 GMT ak19 [33845] * other-projects/maori-lang-detection/mongodb-data/6map_exclAutotranslatedSites1_manualShortlist.png (modified) Cropped out the json portion Fri, 17 Jan 2020 03:33:24 GMT ak19 [33844] * other-projects/maori-lang-detection/mongodb-data/6counts_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/6geojson-features_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/6map_exclAutotranslatedSites1_manualShortlist.png (modified) * other-projects/maori-lang-detection/mongodb-data/7miInURLPath_exclNZ_byCountryCode.json (added) Regenerated Mon, 13 Jan 2020 06:45:21 GMT ak19 [33823] * other-projects/maori-lang-detection/MoreReading/mongodb.txt (modified) * other-projects/maori-lang-detection/conf/url-blacklist-filter.txt (modified) * other-projects/maori-lang-detection/mongodb-data (added) * other-projects/maori-lang-detection/mongodb-data/1a_counts_miInUrlPath.json (added) * other-projects/maori-lang-detection/mongodb-data/1a_geojson-features_miInUrlPath.json (added) * other-projects/maori-lang-detection/mongodb-data/1a_multipoint_miInUrlPath.json (added) * other-projects/maori-lang-detection/mongodb-data/1b_counts_noMiInUrlPath.json (added) * other-projects/maori-lang-detection/mongodb-data/1b_geojson-features_noMiInUrlPath.json (added) * other-projects/maori-lang-detection/mongodb-data/1b_multipoint_noMiInUrlPath.json (added) * other-projects/maori-lang-detection/mongodb-data/1counts_allCrawledSites.json (added) * other-projects/maori-lang-detection/mongodb-data/1geojson-features_allCrawledSites.json (added) * other-projects/maori-lang-detection/mongodb-data/1map_allCrawledSites.png (added) * other-projects/maori-lang-detection/mongodb-data/1multipoint_allCrawledSites.json (added) * other-projects/maori-lang-detection/mongodb-data/2counts_sitesWithPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/2geojson-features_sitesWithPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/2map_sitesWithPagesInMRI.png (added) * other-projects/maori-lang-detection/mongodb-data/2multipoint_sitesWithPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/3counts_sitesWithPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/3geojson-features_sitesWithPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/3map_sitesWithPagesContainingMRI.png (added) * other-projects/maori-lang-detection/mongodb-data/3multipoint_sitesWithPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/4counts_tentativeNonProductSites.json (added) * other-projects/maori-lang-detection/mongodb-data/4geojson-features_tentativeNonProductSites.json (added) * other-projects/maori-lang-detection/mongodb-data/4map_exclTentativeAutotranslatedSites.png (added) * other-projects/maori-lang-detection/mongodb-data/4multipoint_tentativeNonProductSites.json (added) * other-projects/maori-lang-detection/mongodb-data/5counts_tentativeNonProductSites1.json (added) * other-projects/maori-lang-detection/mongodb-data/5geojson-features_tentativeNonProductSites1.json (added) * other-projects/maori-lang-detection/mongodb-data/5map_exclTentativeAutotranslatedSites1.png (added) * other-projects/maori-lang-detection/mongodb-data/5multipoint_tentativeNonProductSites1.json (added) * other-projects/maori-lang-detection/mongodb-data/6counts_nonProductSites1_manualShortlist.json (added) * other-projects/maori-lang-detection/mongodb-data/6geojson-features_nonProductSites1_manualShortlist.json (added) * other-projects/maori-lang-detection/mongodb-data/6map_exclAutotranslatedSites1_manualShortlist.png (added) * other-projects/maori-lang-detection/mongodb-data/6multipoint_nonProductSites1_manualShortlist.json (added) Recommitting mongo-data folder with renamed files with numbering.