|
|
@33935
|
4 years |
davidb |
Additional check added into get-isis target
|
|
|
@33934
|
4 years |
davidb |
Removal of static code block calling ancient/deprecated static …
|
|
|
@33933
|
4 years |
davidb |
Changed 8-spaces to tag chars in Makefile.in. Original problem caused …
|
|
|
@33932
|
4 years |
davidb |
Commented out Java version warning message, as it presents as …
|
|
|
@33931
|
4 years |
davidb |
Two changes to setup file. The first was to move the test for ant to …
|
|
|
@33930
|
4 years |
davidb |
Code used to assume that major number was a single digit, as in 1.6 or …
|
|
|
@33929
|
4 years |
davidb |
Newer JDKs don't have javah => make file change that takes account of this
|
|
|
@33928
|
4 years |
davidb |
Streamlining of how test for JDK/javac is done
|
|
|
@33927
|
4 years |
davidb |
Reworking of javah test
|
|
|
@33926
|
4 years |
ak19 |
Investigated some other options for screen capturing and Google chrome …
|
|
|
@33925
|
4 years |
ak19 |
1. Bugfix: oversight, should return uri encoded URL for mapData, …
|
|
|
@33924
|
4 years |
ak19 |
Adding in Dr Bainbridge's command to check the JSON generated is …
|
|
|
@33923
|
4 years |
davidb |
Removed non-UTF8 valid char from comment; regenerated tar file
|
|
|
@33922
|
4 years |
davidb |
Notes about using this site
|
|
|
@33921
|
4 years |
davidb |
Newer Java's don't have 'javah' any more. The functionality has been …
|
|
|
@33920
|
4 years |
davidb |
Found to be needed when compiling up on a Google Compute Engine (GCE) …
|
|
|
@33919
|
4 years |
ak19 |
SummaryTool now uses the CountryCodeCountsMapData.java class to …
|
|
|
@33918
|
4 years |
ak19 |
Country codes added to each domain's URL of the manual site/domain …
|
|
|
@33917
|
4 years |
ak19 |
Added some better reporting when confirming sample size was correct
|
|
|
@33916
|
4 years |
ak19 |
Updated the rest of the file after reingest
|
|
|
@33915
|
4 years |
ak19 |
Forgot to add a (manual) counts file created last week, and am now …
|
|
|
@33914
|
4 years |
ak19 |
Shortlisted just the domain sites by country into ManualShortlist2.txt …
|
|
|
@33913
|
4 years |
ak19 |
1. Adjusted table mongodb query statements to be more exact, but same …
|
|
|
@33912
|
4 years |
ak19 |
Forgot to svn add the new MongoDBQueryer.java class with commit 33909. …
|
|
|
@33911
|
4 years |
ak19 |
Correct commit message for previous and current commit: 1. After …
|
|
|
@33910
|
4 years |
ak19 |
1. Implementing tables 3 to 5. 2. Rolled back the introduction of the …
|
|
|
@33909
|
4 years |
ak19 |
1. Implementing tables 3 to 5. 2. Rolled back the introduction of the …
|
|
|
@33908
|
4 years |
kjdon |
meta values are already escaped. Don't want to escape them again …
|
|
|
@33907
|
4 years |
ak19 |
See previous commit message. This will be the file with the results …
|
|
|
@33906
|
4 years |
ak19 |
Code is intermediate state. 1. Introduced basicDomain field to MongoDB …
|
|
|
@33905
|
4 years |
ak19 |
More notes
|
|
|
@33904
|
4 years |
ak19 |
Shouldn't greylist anglican.org, as this prevented crawling of …
|
|
|
@33903
|
4 years |
ak19 |
My notes when preparing for today's meetings. Some of this may be …
|
|
|
@33902
|
4 years |
kjdon |
pass in new casefold and accentfold options to format_metadata_for_sorting
|
|
|
@33901
|
4 years |
kjdon |
new casefold_metadata_for_formatting and …
|
|
|
@33900
|
4 years |
kjdon |
BaseClassifier casefold/accentfold options
|
|
|
@33899
|
4 years |
kjdon |
pass in new casefold and accentfold options (BaseClassifier) to …
|
|
|
@33898
|
4 years |
kjdon |
format_metadata_for_sorting now takes two additional args - casefold …
|
|
|
@33897
|
4 years |
kjdon |
elsewhere in the code - GSXML.xmlSafe, we are escaping ' => ' we …
|
|
|
@33896
|
4 years |
ak19 |
Clarification in comments
|
|
|
@33895
|
4 years |
ak19 |
Minor rename
|
|
|
@33894
|
4 years |
ak19 |
1. Adding map, counts.json and geo-json files for 5b count of sites by …
|
|
|
@33893
|
4 years |
ak19 |
1. Left out region code column. 2. Two more sheets of work in progress …
|
|
|
@33892
|
4 years |
ak19 |
Sheets renamed and spreadsheet renamed
|
|
|
@33891
|
4 years |
ak19 |
Site level detected vs manual inspected data: working shown in file …
|
|
|
@33890
|
4 years |
ak19 |
Finished going through NZ sites listing of numPagesContainingMRI > 0 …
|
|
|
@33889
|
4 years |
ak19 |
1. Additional column: totalPagesAcrossMatchingSites. 2. Screengrab of …
|
|
|
@33888
|
4 years |
kjdon |
added propertyFile attribute to gsf:interfaceText so that you can …
|
|
|
@33887
|
4 years |
ak19 |
1. Added support for writing out tables in csv format too. 2. Second …
|
|
|
@33886
|
4 years |
ak19 |
Minor. File rename
|
|
|
@33885
|
4 years |
ak19 |
Attempting to write the tables. csv not yet supported. Table 1 done.
|
|
|
@33884
|
4 years |
ak19 |
0. Previous commit had lots of modifications, and only 2 files matched …
|
|
|
@33883
|
4 years |
ak19 |
Clarifications
|
|
|
@33882
|
4 years |
ak19 |
Code now writes both a listing of all non-autotranslated websites and …
|
|
|
@33881
|
4 years |
ak19 |
Uses lambda expression to process each doc in a mongodb aggregate …
|
|
|
@33880
|
4 years |
ak19 |
Write out the 5counts_tentativeNonAutotranslatedSites.json file with …
|
|
|
@33879
|
4 years |
ak19 |
Have the 2 mongodb aggregate() calls working that
|
|
|
@33878
|
4 years |
ak19 |
Better comment
|
|
|
@33877
|
4 years |
ak19 |
Reordering to have proper descending order of counts
|
|
|
@33876
|
4 years |
ak19 |
Some missteps, but have got complex collection.aggregate() working at last.
|
|
|
@33875
|
4 years |
ak19 |
Renaming 2 more files correctly
|
|
|
@33874
|
4 years |
ak19 |
Renaming 2 files correctly
|
|
|
@33873
|
4 years |
ak19 |
Beginnings of WebPageURLsListing program whose purpose Dr Bainbridge …
|
|
|
@33872
|
4 years |
ak19 |
1. Added the file containing the 255 random NZ page URLs to sample. 2. …
|
|
|
@33871
|
4 years |
ak19 |
Removed mostly duplicated older version of method but left the …
|
|
|
@33870
|
4 years |
ak19 |
Got the mongodb query working in Java in 2 different ways: the fully …
|
|
|
@33869
|
4 years |
ak19 |
First cut at the RandomURLsForDomainGenerator.java class and the …
|
|
|
@33868
|
4 years |
ak19 |
With the updated code for generating the maps from 6a and 6b manual …
|
|
|
@33867
|
4 years |
ak19 |
Moved the code handling of special case large rectangles and those …
|
|
|
@33866
|
4 years |
ak19 |
Dr Bainbridge's fix to Android mobile macronizer user (on Chrome …
|
|
|
@33865
|
4 years |
ak19 |
1. The gs3 context name changed from macronizer to macron-restoration. …
|
|
|
@33864
|
4 years |
davidb |
Changes to make the Whakatohea banner narrower
|
|
|
@33863
|
4 years |
davidb |
Script to get sample content for the DL collection
|
|
|
@33862
|
4 years |
davidb |
Change to specifying the About page text done through about.xml so it …
|
|
|
@33861
|
4 years |
davidb |
About page text done through about.xml so it can include xslt tags
|
|
|
@33860
|
4 years |
davidb |
Addition of 3 further CPAN packages, found to be needed on CentOS build
|
|
|
@33859
|
4 years |
davidb |
Additional CPAN Perl packages found to be needed when compiling up …
|
|
|
@33858
|
4 years |
ak19 |
Fixes to the code committed yesterday: correct calculation of the …
|
|
|
@33857
|
4 years |
davidb |
Next iteration of the about text
|
|
|
@33856
|
4 years |
ak19 |
Forgot to commit. Last week, Dr Bainbridge had properly cropped the …
|
|
|
@33855
|
4 years |
davidb |
Code added to detect if the CGI parameter already specifies a …
|
|
|
@33854
|
4 years |
ak19 |
Manually gone over around 150 webpages of sample size of 255 webpages …
|
|
|
@33853
|
4 years |
ak19 |
Handling map coordinates that are horizontally excessive (beyond …
|
|
|
@33852
|
4 years |
davidb |
Unused. XSL filename extension potentially causing a problem with how …
|
|
|
@33851
|
4 years |
ak19 |
Deleting faulty maps. NZ numPages inMRI and containingMRI count is …
|
|
|
@33850
|
4 years |
ak19 |
Renames before deleting faulty maps. NZ numPages inMRI and …
|
|
|
@33849
|
4 years |
ak19 |
One less Australian site as it was an infographic containing Maori …
|
|
|
@33848
|
4 years |
ak19 |
Tables of mongodb counts (1-5 table) and manual counts (6table). …
|
|
|
@33847
|
4 years |
ak19 |
indigenousblogs.com did have one page actually in Maori (an XML feed). …
|
|
|
@33846
|
4 years |
ak19 |
Cropped out the json portion
|
|
|
@33845
|
4 years |
ak19 |
Cropped out the json portion
|
|
|
@33844
|
4 years |
ak19 |
Regenerated
|
|
|
@33843
|
4 years |
ak19 |
Counting the 3 non-NZ sites that had mi in the URl path that manual …
|
|
|
@33842
|
4 years |
ak19 |
Jotted down some further paragraphs and notes of interest. Tentatively …
|
|
|
@33841
|
4 years |
ak19 |
Latest version of the flowchart of the process of getting Common Crawl …
|
|
|
@33840
|
4 years |
ak19 |
Older flowchart of the process of getting Common Crawl data into …
|
|
|
@33839
|
4 years |
ak19 |
Moving writeup text file into new folder so I can add the SVG …
|
|
|
@33838
|
4 years |
ak19 |
Updated after checking non-NZ and non-nz TLD sites with mi in URL path
|
|
|
@33837
|
4 years |
davidb |
Local notes for the site
|
|
|
@33836
|
4 years |
davidb |
Macron added
|
|
|