source:

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @33862   4 years davidb Change to specifying the About page text done through about.xml so it …
(edit) @33861   4 years davidb About page text done through about.xml so it can include xslt tags
(edit) @33860   4 years davidb Addition of 3 further CPAN packages, found to be needed on CentOS build
(edit) @33859   4 years davidb Additional CPAN Perl packages found to be needed when compiling up …
(edit) @33858   4 years ak19 Fixes to the code committed yesterday: correct calculation of the …
(edit) @33857   4 years davidb Next iteration of the about text
(edit) @33856   4 years ak19 Forgot to commit. Last week, Dr Bainbridge had properly cropped the …
(edit) @33855   4 years davidb Code added to detect if the CGI parameter already specifies a …
(edit) @33854   4 years ak19 Manually gone over around 150 webpages of sample size of 255 webpages …
(edit) @33853   4 years ak19 Handling map coordinates that are horizontally excessive (beyond …
(edit) @33852   4 years davidb Unused. XSL filename extension potentially causing a problem with how …
(edit) @33851   4 years ak19 Deleting faulty maps. NZ numPages inMRI and containingMRI count is …
(edit) @33850   4 years ak19 Renames before deleting faulty maps. NZ numPages inMRI and …
(edit) @33849   4 years ak19 One less Australian site as it was an infographic containing Maori …
(edit) @33848   4 years ak19 Tables of mongodb counts (1-5 table) and manual counts (6table). …
(edit) @33847   4 years ak19 indigenousblogs.com did have one page actually in Maori (an XML feed). …
(edit) @33846   4 years ak19 Cropped out the json portion
(edit) @33845   4 years ak19 Cropped out the json portion
(edit) @33844   4 years ak19 Regenerated
(edit) @33843   4 years ak19 Counting the 3 non-NZ sites that had mi in the URl path that manual …
(edit) @33842   4 years ak19 Jotted down some further paragraphs and notes of interest. Tentatively …
(edit) @33841   4 years ak19 Latest version of the flowchart of the process of getting Common Crawl …
(edit) @33840   4 years ak19 Older flowchart of the process of getting Common Crawl data into …
(edit) @33839   4 years ak19 Moving writeup text file into new folder so I can add the SVG …
(edit) @33838   4 years ak19 Updated after checking non-NZ and non-nz TLD sites with mi in URL path
(edit) @33837   4 years davidb Local notes for the site
(edit) @33836   4 years davidb Macron added
(edit) @33835   4 years davidb Supporting iframe files now located within interface area
(edit) @33834   4 years davidb Metadata shell ready for download of demonstration source content files
(edit) @33833   4 years davidb Initial collection design
(edit) @33832   4 years davidb Initial set of files for Whakatohea collections
(edit) @33831   4 years davidb Top-level folder for Whakatohea Maori Trust Board collections
(edit) @33830   4 years davidb Initial set of files for WMTB themed DL
(edit) @33829   4 years davidb Top-level folder for Whakatohea Maori Trust Board themes DL
(edit) @33828   4 years ak19 Additions and modifications to the write-up.
(edit) @33827   4 years davidb Updated text about groupConfig.xml file
(edit) @33826   4 years davidb Fix to help compiling on CentOS
(edit) @33825   4 years ak19 Beginnings of first draft of write up.
(edit) @33824   4 years ak19 More instructions and explaining the contents of the mongodb-data folder.
(edit) @33823   4 years ak19 Recommitting mongo-data folder with renamed files with numbering.
(edit) @33822   4 years ak19 Removing as I'm renaming all the files with prefixes. There are too …
(edit) @33821   4 years ak19 Manually created a shortlist of MRI sites from longer …
(edit) @33820   4 years ak19 Forgot to commit before holidays.
(edit) @33819   4 years davidb Moved to newer version of intltool. This still had a problem with …
(edit) @33818   4 years davidb Changed #include statement in gio/gdbusmessage.c to work with Ubuntu 18
(edit) @33817   4 years davidb Removal of api-key file, as no longer needed (and wasn't a good idea …
(edit) @33816   4 years ak19 Finished manually going through the sites that I couldn't easily …
(edit) @33815   4 years ak19 Removed old results from before bugfix and improvement to …
(edit) @33814   4 years ak19 Put the important mongodb queries and results into …
(edit) @33813   4 years ak19 With the bugfix from yesterday and the inclusion of http(s):mi.* …
(edit) @33812   4 years ak19 Better handling of multi-line comment symbols, so I can now include …
(edit) @33811   4 years ak19 Returning to using a single variable, urlContainsLangCodeInPath, to …
(edit) @33810   4 years ak19 Bugfix: mi in url path should be checked for for each page of site, …
(edit) @33809   4 years ak19 Some more GS_README.txt instructions. Not put the mongodb queries in …
(edit) @33808   4 years ak19 Storing not just whether /mi(/) suffix is in path, but also whether …
(edit) @33807   4 years ak19 Trying to manually go through a shortlisted set of domains to see if …
(edit) @33806   4 years ak19 More mongodb querying revealed that excluding tentative product sites …
(edit) @33805   4 years ak19 1. Moving the static countrycodes.json file to conf folder and updated …
(edit) @33804   4 years ak19 1. Updated results from mongodb querying after yesterday's …
(edit) @33803   4 years ak19 geojson mapdata and map for mongodb results on …
(edit) @33802   4 years ak19 With an extra adult site removed and with setting countrycodes that …
(edit) @33801   4 years ak19 1. NutchTextDumpToMongoDB Added an extra field to each document in …
(edit) @33800   4 years ak19 Removed an adult site from crawled contents and added its url to …
(edit) @33799   4 years ak19 1. Adding breadcrumb for next step at end of running …
(edit) @33798   4 years ak19 Adding the geojson related files related to querying mongodb for sites …
(edit) @33797   4 years ak19 Updated json and imaegs files, and new files for when /mi(/) is in the …
(edit) @33796   4 years ak19 Instead of a hack for US' count being too great that its histogram …
(edit) @33795   4 years kjdon remove edit bar and right side bar from print view of document
(edit) @33794   4 years ak19 Wrote the geojson map data created from the site counts per …
(edit) @33793   4 years ak19 Changes for getting a running GS3 server to display collections on …
(edit) @33792   4 years ak19 Correcting spellings
(edit) @33791   4 years ak19 1. Kathy renamed the gs3interface properties filename from …
(edit) @33790   4 years ak19 Got the MultiPoint geojson mapdata of the country code counts working: …
(edit) @33789   4 years ak19 Redid the mongodb query to get the countrycode counts for all the …
(edit) @33788   4 years ak19 Adding all the jar files needed to work in Java with geojson Simple …
(edit) @33787   4 years ak19 Documented another mongodb query that I'm using, the one to produce …
(edit) @33786   4 years kjdon add google analytics, stop the outputting of fr, es, ru links as those …
(edit) @33785   4 years kjdon added google analytics
(edit) @33784   4 years kjdon only generate english versions for now as ru, fr, es haven't been …
(edit) @33783   4 years kjdon only generate english versions for now as ru, fr, es haven't been …
(edit) @33782   4 years kjdon updated README
(edit) @33781   4 years kjdon tidied up the intro
(edit) @33780   4 years kjdon added code to add google-analytics to each page
(edit) @33779   4 years kjdon ServiceRack.properties has been renamed to …
(edit) @33778   4 years ak19 Made a beginning on getting the geojson map data automated. Couldn't …
(edit) @33777   4 years ak19 Forgot to document a link, with sample code to use nativecall jar file …
(edit) @33776   4 years ak19 Field Separator (IFS) conflicting with backticks and other ways of …
(edit) @33775   4 years kjdon fixed a typo in a comment
(edit) @33774   4 years kjdon getTextString code moved to Dictionary.getTextSTring, as its no longer …
(edit) @33773   4 years kjdon the default dictionary is not ServiceRack.properties any more. Instead …
(edit) @33772   4 years kjdon don't need to pass in ServiceRack to getTextSTring anymore. …
(edit) @33771   4 years kjdon use the new Dictionary.getTExtSTring instead of repeating all the code here
(edit) @33770   4 years kjdon updated to match new args for Dictionary.getTextString
(edit) @33769   4 years kjdon updated getTextSTring to contain all the functionality from …
(edit) @33768   4 years kjdon removed some code that was commented out, and some methods that were …
(edit) @33767   4 years kjdon renaming ServiceRack.properties to core_servlet_dictionary.properties
(edit) @33766   4 years kjdon renaming ServiceRack.properties to core_servlet_dictionary.properties
(edit) @33765   4 years kjdon renaming ServiceRack.properties to core_servlet_dictionary.properties
(edit) @33764   4 years kjdon renaming ServiceRack.properties to core_servlet_dictionary.properties
(edit) @33763   4 years kjdon renaming ServiceRack.properties to core_servlet_dictionary.properties
Note: See TracRevisionLog for help on using the revision log.