|
|
@33880
|
4 years |
ak19 |
Write out the 5counts_tentativeNonAutotranslatedSites.json file with …
|
|
|
@33879
|
4 years |
ak19 |
Have the 2 mongodb aggregate() calls working that
|
|
|
@33878
|
4 years |
ak19 |
Better comment
|
|
|
@33877
|
4 years |
ak19 |
Reordering to have proper descending order of counts
|
|
|
@33876
|
4 years |
ak19 |
Some missteps, but have got complex collection.aggregate() working at last.
|
|
|
@33875
|
4 years |
ak19 |
Renaming 2 more files correctly
|
|
|
@33874
|
4 years |
ak19 |
Renaming 2 files correctly
|
|
|
@33873
|
4 years |
ak19 |
Beginnings of WebPageURLsListing program whose purpose Dr Bainbridge …
|
|
|
@33872
|
4 years |
ak19 |
1. Added the file containing the 255 random NZ page URLs to sample. 2. …
|
|
|
@33871
|
4 years |
ak19 |
Removed mostly duplicated older version of method but left the …
|
|
|
@33870
|
4 years |
ak19 |
Got the mongodb query working in Java in 2 different ways: the fully …
|
|
|
@33869
|
4 years |
ak19 |
First cut at the RandomURLsForDomainGenerator.java class and the …
|
|
|
@33868
|
4 years |
ak19 |
With the updated code for generating the maps from 6a and 6b manual …
|
|
|
@33867
|
4 years |
ak19 |
Moved the code handling of special case large rectangles and those …
|
|
|
@33866
|
4 years |
ak19 |
Dr Bainbridge's fix to Android mobile macronizer user (on Chrome …
|
|
|
@33865
|
4 years |
ak19 |
1. The gs3 context name changed from macronizer to macron-restoration. …
|
|
|
@33864
|
4 years |
davidb |
Changes to make the Whakatohea banner narrower
|
|
|
@33863
|
4 years |
davidb |
Script to get sample content for the DL collection
|
|
|
@33862
|
4 years |
davidb |
Change to specifying the About page text done through about.xml so it …
|
|
|
@33861
|
4 years |
davidb |
About page text done through about.xml so it can include xslt tags
|
|
|
@33860
|
4 years |
davidb |
Addition of 3 further CPAN packages, found to be needed on CentOS build
|
|
|
@33859
|
4 years |
davidb |
Additional CPAN Perl packages found to be needed when compiling up …
|
|
|
@33858
|
4 years |
ak19 |
Fixes to the code committed yesterday: correct calculation of the …
|
|
|
@33857
|
4 years |
davidb |
Next iteration of the about text
|
|
|
@33856
|
4 years |
ak19 |
Forgot to commit. Last week, Dr Bainbridge had properly cropped the …
|
|
|
@33855
|
4 years |
davidb |
Code added to detect if the CGI parameter already specifies a …
|
|
|
@33854
|
4 years |
ak19 |
Manually gone over around 150 webpages of sample size of 255 webpages …
|
|
|
@33853
|
4 years |
ak19 |
Handling map coordinates that are horizontally excessive (beyond …
|
|
|
@33852
|
4 years |
davidb |
Unused. XSL filename extension potentially causing a problem with how …
|
|
|
@33851
|
4 years |
ak19 |
Deleting faulty maps. NZ numPages inMRI and containingMRI count is …
|
|
|
@33850
|
4 years |
ak19 |
Renames before deleting faulty maps. NZ numPages inMRI and …
|
|
|
@33849
|
4 years |
ak19 |
One less Australian site as it was an infographic containing Maori …
|
|
|
@33848
|
4 years |
ak19 |
Tables of mongodb counts (1-5 table) and manual counts (6table). …
|
|
|
@33847
|
4 years |
ak19 |
indigenousblogs.com did have one page actually in Maori (an XML feed). …
|
|
|
@33846
|
4 years |
ak19 |
Cropped out the json portion
|
|
|
@33845
|
4 years |
ak19 |
Cropped out the json portion
|
|
|
@33844
|
4 years |
ak19 |
Regenerated
|
|
|
@33843
|
4 years |
ak19 |
Counting the 3 non-NZ sites that had mi in the URl path that manual …
|
|
|
@33842
|
4 years |
ak19 |
Jotted down some further paragraphs and notes of interest. Tentatively …
|
|
|
@33841
|
4 years |
ak19 |
Latest version of the flowchart of the process of getting Common Crawl …
|
|
|
@33840
|
4 years |
ak19 |
Older flowchart of the process of getting Common Crawl data into …
|
|
|
@33839
|
4 years |
ak19 |
Moving writeup text file into new folder so I can add the SVG …
|
|
|
@33838
|
4 years |
ak19 |
Updated after checking non-NZ and non-nz TLD sites with mi in URL path
|
|
|
@33837
|
4 years |
davidb |
Local notes for the site
|
|
|
@33836
|
4 years |
davidb |
Macron added
|
|
|
@33835
|
4 years |
davidb |
Supporting iframe files now located within interface area
|
|
|
@33834
|
4 years |
davidb |
Metadata shell ready for download of demonstration source content files
|
|
|
@33833
|
4 years |
davidb |
Initial collection design
|
|
|
@33832
|
4 years |
davidb |
Initial set of files for Whakatohea collections
|
|
|
@33831
|
4 years |
davidb |
Top-level folder for Whakatohea Maori Trust Board collections
|
|
|
@33830
|
4 years |
davidb |
Initial set of files for WMTB themed DL
|
|
|
@33829
|
4 years |
davidb |
Top-level folder for Whakatohea Maori Trust Board themes DL
|
|
|
@33828
|
4 years |
ak19 |
Additions and modifications to the write-up.
|
|
|
@33827
|
4 years |
davidb |
Updated text about groupConfig.xml file
|
|
|
@33826
|
4 years |
davidb |
Fix to help compiling on CentOS
|
|
|
@33825
|
4 years |
ak19 |
Beginnings of first draft of write up.
|
|
|
@33824
|
4 years |
ak19 |
More instructions and explaining the contents of the mongodb-data folder.
|
|
|
@33823
|
4 years |
ak19 |
Recommitting mongo-data folder with renamed files with numbering.
|
|
|
@33822
|
4 years |
ak19 |
Removing as I'm renaming all the files with prefixes. There are too …
|
|
|
@33821
|
4 years |
ak19 |
Manually created a shortlist of MRI sites from longer …
|
|
|
@33820
|
4 years |
ak19 |
Forgot to commit before holidays.
|
|
|
@33819
|
4 years |
davidb |
Moved to newer version of intltool. This still had a problem with …
|
|
|
@33818
|
4 years |
davidb |
Changed #include statement in gio/gdbusmessage.c to work with Ubuntu 18
|
|
|
@33817
|
4 years |
davidb |
Removal of api-key file, as no longer needed (and wasn't a good idea …
|
|
|
@33816
|
4 years |
ak19 |
Finished manually going through the sites that I couldn't easily …
|
|
|
@33815
|
4 years |
ak19 |
Removed old results from before bugfix and improvement to …
|
|
|
@33814
|
4 years |
ak19 |
Put the important mongodb queries and results into …
|
|
|
@33813
|
5 years |
ak19 |
With the bugfix from yesterday and the inclusion of http(s):mi.* …
|
|
|
@33812
|
5 years |
ak19 |
Better handling of multi-line comment symbols, so I can now include …
|
|
|
@33811
|
5 years |
ak19 |
Returning to using a single variable, urlContainsLangCodeInPath, to …
|
|
|
@33810
|
5 years |
ak19 |
Bugfix: mi in url path should be checked for for each page of site, …
|
|
|
@33809
|
5 years |
ak19 |
Some more GS_README.txt instructions. Not put the mongodb queries in …
|
|
|
@33808
|
5 years |
ak19 |
Storing not just whether /mi(/) suffix is in path, but also whether …
|
|
|
@33807
|
5 years |
ak19 |
Trying to manually go through a shortlisted set of domains to see if …
|
|
|
@33806
|
5 years |
ak19 |
More mongodb querying revealed that excluding tentative product sites …
|
|
|
@33805
|
5 years |
ak19 |
1. Moving the static countrycodes.json file to conf folder and updated …
|
|
|
@33804
|
5 years |
ak19 |
1. Updated results from mongodb querying after yesterday's …
|
|
|
@33803
|
5 years |
ak19 |
geojson mapdata and map for mongodb results on …
|
|
|
@33802
|
5 years |
ak19 |
With an extra adult site removed and with setting countrycodes that …
|
|
|
@33801
|
5 years |
ak19 |
1. NutchTextDumpToMongoDB Added an extra field to each document in …
|
|
|
@33800
|
5 years |
ak19 |
Removed an adult site from crawled contents and added its url to …
|
|
|
@33799
|
5 years |
ak19 |
1. Adding breadcrumb for next step at end of running …
|
|
|
@33798
|
5 years |
ak19 |
Adding the geojson related files related to querying mongodb for sites …
|
|
|
@33797
|
5 years |
ak19 |
Updated json and imaegs files, and new files for when /mi(/) is in the …
|
|
|
@33796
|
5 years |
ak19 |
Instead of a hack for US' count being too great that its histogram …
|
|
|
@33795
|
5 years |
kjdon |
remove edit bar and right side bar from print view of document
|
|
|
@33794
|
5 years |
ak19 |
Wrote the geojson map data created from the site counts per …
|
|
|
@33793
|
5 years |
ak19 |
Changes for getting a running GS3 server to display collections on …
|
|
|
@33792
|
5 years |
ak19 |
Correcting spellings
|
|
|
@33791
|
5 years |
ak19 |
1. Kathy renamed the gs3interface properties filename from …
|
|
|
@33790
|
5 years |
ak19 |
Got the MultiPoint geojson mapdata of the country code counts working: …
|
|
|
@33789
|
5 years |
ak19 |
Redid the mongodb query to get the countrycode counts for all the …
|
|
|
@33788
|
5 years |
ak19 |
Adding all the jar files needed to work in Java with geojson Simple …
|
|
|
@33787
|
5 years |
ak19 |
Documented another mongodb query that I'm using, the one to produce …
|
|
|
@33786
|
5 years |
kjdon |
add google analytics, stop the outputting of fr, es, ru links as those …
|
|
|
@33785
|
5 years |
kjdon |
added google analytics
|
|
|
@33784
|
5 years |
kjdon |
only generate english versions for now as ru, fr, es haven't been …
|
|
|
@33783
|
5 years |
kjdon |
only generate english versions for now as ru, fr, es haven't been …
|
|
|
@33782
|
5 years |
kjdon |
updated README
|
|
|
@33781
|
5 years |
kjdon |
tidied up the intro
|
|
|