source: other-projects/maori-lang-detection

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @33802   4 years ak19 With an extra adult site removed and with setting countrycodes that …
(edit) @33801   4 years ak19 1. NutchTextDumpToMongoDB Added an extra field to each document in …
(edit) @33800   4 years ak19 Removed an adult site from crawled contents and added its url to …
(edit) @33799   4 years ak19 1. Adding breadcrumb for next step at end of running …
(edit) @33798   4 years ak19 Adding the geojson related files related to querying mongodb for sites …
(edit) @33797   4 years ak19 Updated json and imaegs files, and new files for when /mi(/) is in the …
(edit) @33796   4 years ak19 Instead of a hack for US' count being too great that its histogram …
(edit) @33794   4 years ak19 Wrote the geojson map data created from the site counts per …
(edit) @33790   4 years ak19 Got the MultiPoint geojson mapdata of the country code counts working: …
(edit) @33789   4 years ak19 Redid the mongodb query to get the countrycode counts for all the …
(edit) @33788   4 years ak19 Adding all the jar files needed to work in Java with geojson Simple …
(edit) @33787   4 years ak19 Documented another mongodb query that I'm using, the one to produce …
(edit) @33778   4 years ak19 Made a beginning on getting the geojson map data automated. Couldn't …
(edit) @33722   4 years ak19 Adding in additional instructions in mongodb.txt, before I forgot how …
(edit) @33710   5 years ak19 Working queries and map coords for geojson.tools (ironically, Lat and …
(edit) @33698   5 years ak19 Links to more reading
(edit) @33675   5 years ak19 Committing the newer query results (but from before today's …
(edit) @33674   5 years ak19 Changes to support the top 5 predicted langcodes and their confidence …
(edit) @33666   5 years ak19 Having finished sending all the crawl data to mongodb 1. Recrawled the …
(edit) @33657   5 years ak19 Some fixes after brief testing against 1/3 of the crawl. Restarted …
(edit) @33656   5 years ak19 Final minor changes before I start processing the crawls of node2.
(edit) @33655   5 years ak19 Minor change to print statement
(edit) @33654   5 years ak19 Removing jar file that wasn't used after all.
(edit) @33653   5 years ak19 1. As suggested by Dr Bainbridge, made the code changes to use Morphia …
(edit) @33652   5 years ak19 Introducing morphia subpackage
(edit) @33651   5 years ak19 1. Bugfix: overlappingSentences works. 2. storing numSentencesInMaor
(edit) @33646   5 years ak19 Saving the mongodb queries and learning links that Dr Bainbridge found …
(edit) @33645   5 years ak19 Fix to 2 bugs when sending data to MongoDB: 1. overlappingSentences …
(edit) @33644   5 years ak19 Just committing the growing mongodb.txt file with links and …
(edit) @33643   5 years ak19 Brought the template log4j.properties.in back up to speed. I forgot it …
(edit) @33642   5 years ak19 Forgot to commit the java driver for mongodb when I committed the Java …
(copy) @33635   5 years ak19 Maori-language-detection doesn't use Greenstone 3 at present, it's not …
copied from gs3-extensions/maori-lang-detection
(edit) @33634   5 years ak19 Rewrote NutchTextDumpProcessor as NutchTextDumpToMongoDB.java, which …
Note: See TracRevisionLog for help on using the revision log.