source: other-projects/maori-lang-detection/mongodb-data

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @33918   4 years ak19 Country codes added to each domain's URL of the manual site/domain …
(edit) @33916   4 years ak19 Updated the rest of the file after reingest
(edit) @33915   4 years ak19 Forgot to add a (manual) counts file created last week, and am now …
(edit) @33914   4 years ak19 Shortlisted just the domain sites by country into ManualShortlist2.txt …
(edit) @33913   4 years ak19 1. Adjusted table mongodb query statements to be more exact, but same …
(edit) @33907   4 years ak19 See previous commit message. This will be the file with the results …
(edit) @33895   4 years ak19 Minor rename
(edit) @33894   4 years ak19 1. Adding map, counts.json and geo-json files for 5b count of sites by …
(edit) @33893   4 years ak19 1. Left out region code column. 2. Two more sheets of work in progress …
(edit) @33892   4 years ak19 Sheets renamed and spreadsheet renamed
(edit) @33891   4 years ak19 Site level detected vs manual inspected data: working shown in file …
(edit) @33890   4 years ak19 Finished going through NZ sites listing of numPagesContainingMRI > 0 …
(edit) @33889   4 years ak19 1. Additional column: totalPagesAcrossMatchingSites. 2. Screengrab of …
(edit) @33886   4 years ak19 Minor. File rename
(edit) @33884   4 years ak19 0. Previous commit had lots of modifications, and only 2 files matched …
(edit) @33883   4 years ak19 Clarifications
(edit) @33878   4 years ak19 Better comment
(edit) @33877   4 years ak19 Reordering to have proper descending order of counts
(edit) @33875   4 years ak19 Renaming 2 more files correctly
(edit) @33874   4 years ak19 Renaming 2 files correctly
(edit) @33872   4 years ak19 1. Added the file containing the 255 random NZ page URLs to sample. 2. …
(edit) @33868   4 years ak19 With the updated code for generating the maps from 6a and 6b manual …
(edit) @33854   4 years ak19 Manually gone over around 150 webpages of sample size of 255 webpages …
(edit) @33851   4 years ak19 Deleting faulty maps. NZ numPages inMRI and containingMRI count is …
(edit) @33850   4 years ak19 Renames before deleting faulty maps. NZ numPages inMRI and …
(edit) @33848   4 years ak19 Tables of mongodb counts (1-5 table) and manual counts (6table). …
(edit) @33847   4 years ak19 indigenousblogs.com did have one page actually in Maori (an XML feed). …
(edit) @33846   4 years ak19 Cropped out the json portion
(edit) @33845   4 years ak19 Cropped out the json portion
(edit) @33844   4 years ak19 Regenerated
(add) @33823   4 years ak19 Recommitting mongo-data folder with renamed files with numbering.
Note: See TracRevisionLog for help on using the revision log.