|
|
@33812
|
4 years |
ak19 |
Better handling of multi-line comment symbols, so I can now include …
|
|
|
@33811
|
4 years |
ak19 |
Returning to using a single variable, urlContainsLangCodeInPath, to …
|
|
|
@33810
|
4 years |
ak19 |
Bugfix: mi in url path should be checked for for each page of site, …
|
|
|
@33808
|
4 years |
ak19 |
Storing not just whether /mi(/) suffix is in path, but also whether …
|
|
|
@33805
|
4 years |
ak19 |
1. Moving the static countrycodes.json file to conf folder and updated …
|
|
|
@33801
|
4 years |
ak19 |
1. NutchTextDumpToMongoDB Added an extra field to each document in …
|
|
|
@33800
|
4 years |
ak19 |
Removed an adult site from crawled contents and added its url to …
|
|
|
@33799
|
4 years |
ak19 |
1. Adding breadcrumb for next step at end of running …
|
|
|
@33796
|
4 years |
ak19 |
Instead of a hack for US' count being too great that its histogram …
|
|
|
@33794
|
4 years |
ak19 |
Wrote the geojson map data created from the site counts per …
|
|
|
@33790
|
4 years |
ak19 |
Got the MultiPoint geojson mapdata of the country code counts working: …
|
|
|
@33778
|
4 years |
ak19 |
Made a beginning on getting the geojson map data automated. Couldn't …
|
|
|
@33698
|
4 years |
ak19 |
Links to more reading
|
|
|
@33674
|
4 years |
ak19 |
Changes to support the top 5 predicted langcodes and their confidence …
|
|
|
@33666
|
4 years |
ak19 |
Having finished sending all the crawl data to mongodb 1. Recrawled the …
|
|
|
@33657
|
4 years |
ak19 |
Some fixes after brief testing against 1/3 of the crawl. Restarted …
|
|
|
@33656
|
4 years |
ak19 |
Final minor changes before I start processing the crawls of node2.
|
|
|
@33655
|
4 years |
ak19 |
Minor change to print statement
|
|
|
@33653
|
4 years |
ak19 |
1. As suggested by Dr Bainbridge, made the code changes to use Morphia …
|
|
|
@33652
|
4 years |
ak19 |
Introducing morphia subpackage
|
|
|
@33651
|
4 years |
ak19 |
1. Bugfix: overlappingSentences works. 2. storing numSentencesInMaor
|
|
|
@33645
|
4 years |
ak19 |
Fix to 2 bugs when sending data to MongoDB: 1. overlappingSentences …
|
|
|
@33635
|
4 years |
ak19 |
Maori-language-detection doesn't use Greenstone 3 at present, it's not …
|
|
copied from gs3-extensions/maori-lang-detection/src
|
|
|
@33634
|
4 years |
ak19 |
Rewrote NutchTextDumpProcessor as NutchTextDumpToMongoDB.java, which …
|