# # ChangeLog for / # # Generated by Trac 1.4.2 # 2024-06-17T17:00:22+12:00 Sun, 16 Feb 2020 01:19:46 GMT davidb [33933] * main/trunk/greenstone2/common-src/indexers/mg/java/org/greenstone/mg/Makefile.in (modified) * main/trunk/greenstone2/common-src/indexers/mgpp/java/org/greenstone/mgpp/Makefile.in (modified) Changed 8-spaces to tag chars in Makefile.in. Original problem ... Sat, 15 Feb 2020 06:14:24 GMT davidb [33932] * main/trunk/greenstone3/gs3-setup.bat (modified) Commented out Java version warning message, as it presents as ... Sat, 15 Feb 2020 06:10:54 GMT davidb [33931] * main/trunk/greenstone3/gs3-setup.sh (modified) Two changes to setup file. The first was to move the test for ant to ... Sat, 15 Feb 2020 06:00:05 GMT davidb [33930] * main/trunk/search4j/libsearch4j.cpp (modified) Code used to assume that major number was a single digit, as in 1.6 ... Sat, 15 Feb 2020 05:57:27 GMT davidb [33929] * main/trunk/greenstone3/src/packages/javagdbm/java/Makefile.in (modified) Newer JDKs don't have javah => make file change that takes account of ... Sat, 15 Feb 2020 05:55:27 GMT davidb [33928] * main/trunk/greenstone3/src/packages/javagdbm/aclocal.m4 (added) * main/trunk/greenstone3/src/packages/javagdbm/configure (modified) * main/trunk/greenstone3/src/packages/javagdbm/configure.in (modified) Streamlining of how test for JDK/javac is done Sat, 15 Feb 2020 01:57:35 GMT davidb [33927] * main/trunk/greenstone2/common-src/indexers/mg/java/org/greenstone/mg/Makefile.in (modified) * main/trunk/greenstone2/common-src/indexers/mgpp/java/org/greenstone/mgpp/Makefile.in (modified) Reworking of javah test Fri, 14 Feb 2020 10:03:21 GMT ak19 [33926] * other-projects/maori-lang-detection/src/org/greenstone/atea/CountryCodeCountsMapData.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/SummaryTool.java (modified) Investigated some other options for screen capturing and Google ... Fri, 14 Feb 2020 07:41:20 GMT ak19 [33925] * other-projects/maori-lang-detection/src/org/greenstone/atea/CountryCodeCountsMapData.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/SummaryTool.java (modified) 1. Bugfix: oversight, should return uri encoded URL for mapData, ... Fri, 14 Feb 2020 06:22:40 GMT ak19 [33924] * other-projects/maori-lang-detection/src/org/greenstone/atea/SummaryTool.java (modified) Adding in Dr Bainbridge's command to check the JSON generated is ... Fri, 14 Feb 2020 05:45:24 GMT davidb [33923] * main/trunk/greenstone2/common-src/packages/jdbm/README.txt (added) * main/trunk/greenstone2/common-src/packages/jdbm/gs-jdbm-1.0.tar.gz (modified) Removed non-UTF8 valid char from comment; regenerated tar file Fri, 14 Feb 2020 05:13:49 GMT davidb [33922] * main/trunk/model-sites-dev/multimodal-mdl/README.txt (added) Notes about using this site Fri, 14 Feb 2020 05:11:22 GMT davidb [33921] * main/trunk/greenstone2/common-src/indexers/mg/java/org/greenstone/mg/Makefile.in (modified) * main/trunk/greenstone2/common-src/indexers/mgpp/java/org/greenstone/mgpp/Makefile.in (modified) Newer Java's don't have 'javah' any more. The functionality has been ... Fri, 14 Feb 2020 03:55:49 GMT davidb [33920] * gs2-extensions/imagemagick/trunk/src/packages/CASCADE-MAKE/GS.sh (modified) Found to be needed when compiling up on a Google Compute Engine (GCE) ... Thu, 13 Feb 2020 09:40:41 GMT ak19 [33919] * other-projects/maori-lang-detection/MoreReading/mongodb.txt (modified) * other-projects/maori-lang-detection/lib/jna-platform.jar (added) * other-projects/maori-lang-detection/lib/jna.jar (added) * other-projects/maori-lang-detection/src/org/greenstone/atea/CountryCodeCountsMapData.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBQueryer.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/SummaryTool.java (modified) SummaryTool now uses the CountryCodeCountsMapData.java class to ... Thu, 13 Feb 2020 06:34:14 GMT ak19 [33918] * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting2_afterMongoDBReingest.txt (modified) * other-projects/maori-lang-detection/mongodb-data/manualList_globalDomains_whereAPageContainsMRI.txt (modified) Country codes added to each domain's URL of the manual site/domain ... Thu, 13 Feb 2020 05:18:13 GMT ak19 [33917] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBQueryer.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/SummaryTool.java (modified) Added some better reporting when confirming sample size was correct Thu, 13 Feb 2020 04:42:11 GMT ak19 [33916] * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting2_afterMongoDBReingest.txt (modified) Updated the rest of the file after reingest Thu, 13 Feb 2020 04:12:06 GMT ak19 [33915] * other-projects/maori-lang-detection/mongodb-data/6counts_nonProductSites1_manualShortlist.json (added) * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting2_afterMongoDBReingest.txt (moved) Forgot to add a (manual) counts file created last week, and am now ... Thu, 13 Feb 2020 04:09:07 GMT ak19 [33914] * other-projects/maori-lang-detection/MoreReading/mongodb.txt (modified) * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting.txt (modified) * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting2.txt (modified) * other-projects/maori-lang-detection/mongodb-data/manualList_globalDomains_whereAPageContainsMRI.txt (added) Shortlisted just the domain sites by country into ... Wed, 12 Feb 2020 08:27:02 GMT ak19 [33913] * other-projects/maori-lang-detection/MoreReading/mongodb.txt (modified) * other-projects/maori-lang-detection/hdfs-cc-work/GS_README.TXT (modified) * other-projects/maori-lang-detection/mongodb-data/tables.txt (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBQueryer.java (modified) 1. Adjusted table mongodb query statements to be more exact, but same ... Wed, 12 Feb 2020 06:53:48 GMT ak19 [33912] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBQueryer.java (added) Forgot to svn add the new MongoDBQueryer.java class with commit ... Wed, 12 Feb 2020 06:12:42 GMT ak19 [33911] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/SummaryTool.java (moved) Correct commit message for previous and current commit: 1. After ... Wed, 12 Feb 2020 06:05:50 GMT ak19 [33910] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) 1. Implementing tables 3 to 5. 2. Rolled back the introduction of the ... Wed, 12 Feb 2020 06:02:44 GMT ak19 [33909] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/NutchTextDumpToMongoDB.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/WebPageURLsListing.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/morphia/WebsiteInfo.java (modified) 1. Implementing tables 3 to 5. 2. Rolled back the introduction of the ... Sun, 09 Feb 2020 20:41:10 GMT kjdon [33908] * main/trunk/greenstone3/web/interfaces/default/transform/expand-gsf.xsl (modified) meta values are already escaped. Don't want to escape them again ... Wed, 05 Feb 2020 10:38:57 GMT ak19 [33907] * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting2.txt (added) See previous commit message. This will be the file with the results ... Wed, 05 Feb 2020 10:36:37 GMT ak19 [33906] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/NutchTextDumpToMongoDB.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/WebPageURLsListing.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/morphia/WebsiteInfo.java (modified) Code is intermediate state. 1. Introduced basicDomain field to ... Wed, 05 Feb 2020 05:49:16 GMT ak19 [33905] * other-projects/maori-lang-detection/MoreReading/mongodb.txt (modified) * other-projects/maori-lang-detection/hdfs-cc-work/GS_README.TXT (modified) More notes Wed, 05 Feb 2020 05:48:33 GMT ak19 [33904] * other-projects/maori-lang-detection/conf/sites-too-big-to-exhaustively-crawl.txt (modified) * other-projects/maori-lang-detection/conf/url-greylist-filter.txt (modified) * other-projects/maori-lang-detection/crawledNode6.tar (modified) * other-projects/maori-lang-detection/to_crawl.tar.gz (modified) Shouldn't greylist anglican.org, as this prevented crawling of ... Tue, 04 Feb 2020 02:50:43 GMT ak19 [33903] * other-projects/maori-lang-detection/journal-paper/MRI_slideNotes.txt (added) My notes when preparing for today's meetings. Some of this may be ... Tue, 04 Feb 2020 00:05:30 GMT kjdon [33902] * main/trunk/greenstone2/perllib/classify/AZCompactList.pm (modified) * main/trunk/greenstone2/perllib/classify/AZList.pm (modified) * main/trunk/greenstone2/perllib/classify/AZSectionList.pm (modified) * main/trunk/greenstone2/perllib/classify/DateList.pm (modified) * main/trunk/greenstone2/perllib/classify/Hierarchy.pm (modified) * main/trunk/greenstone2/perllib/classify/SectionList.pm (modified) * main/trunk/greenstone2/perllib/classify/SimpleList.pm (modified) pass in new casefold and accentfold options to ... Tue, 04 Feb 2020 00:04:35 GMT kjdon [33901] * main/trunk/greenstone2/perllib/classify/BaseClassifier.pm (modified) new casefold_metadata_for_formatting and ... Tue, 04 Feb 2020 00:03:37 GMT kjdon [33900] * main/trunk/greenstone2/perllib/strings.properties (modified) BaseClassifier casefold/accentfold options Tue, 04 Feb 2020 00:03:05 GMT kjdon [33899] * main/trunk/greenstone2/perllib/classify/List.pm (modified) pass in new casefold and accentfold options (BaseClassifier) to ... Mon, 03 Feb 2020 23:59:00 GMT kjdon [33898] * main/trunk/greenstone2/perllib/sorttools.pm (modified) format_metadata_for_sorting now takes two additional args - casefold ... Mon, 03 Feb 2020 21:06:11 GMT kjdon [33897] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/util/XMLConverter.java (modified) elsewhere in the code - GSXML.xmlSafe, we are escaping ' => ' we ... Mon, 03 Feb 2020 10:29:59 GMT ak19 [33896] * other-projects/maori-lang-detection/MoreReading/mongodb.txt (modified) Clarification in comments Mon, 03 Feb 2020 10:20:53 GMT ak19 [33895] * other-projects/maori-lang-detection/mongodb-data/5b_counts_containsMRI_groupedByNZorOverseasNoFilter.json (moved) Minor rename Mon, 03 Feb 2020 10:20:33 GMT ak19 [33894] * other-projects/maori-lang-detection/mongodb-data/5b_count_containsMRI_groupedByNZorOverseasNoFilter.json (added) * other-projects/maori-lang-detection/mongodb-data/5b_geojson-features_containsMRI_groupedByNZorOverseasNoFilter.json (added) * other-projects/maori-lang-detection/mongodb-data/5b_map_containsMRI_groupedByNZorOverseasNoFilter.png (added) * other-projects/maori-lang-detection/mongodb-data/5b_multipoint_containsMRI_groupedByNZorOverseasNoFilter.json (added) * other-projects/maori-lang-detection/mongodb-data/6counts_sitesWithPagesContainingMRI_manualShortlist.json (moved) * other-projects/maori-lang-detection/mongodb-data/6geojson-features_sitesWithPagesContainingMRI_manualShortlist.json (moved) * other-projects/maori-lang-detection/mongodb-data/6map_sitesWithPagesContainingMRI_manualShortlist.png (moved) * other-projects/maori-lang-detection/mongodb-data/6multipoint_sitesWithPagesContainingMRI_manualShortlist.json (moved) * other-projects/maori-lang-detection/mongodb-data/tables.txt (modified) 1. Adding map, counts.json and geo-json files for 5b count of sites ... Mon, 03 Feb 2020 09:41:47 GMT ak19 [33893] * other-projects/maori-lang-detection/mongodb-data/8TableOfNumDetectedVsManualSITESWithMRI.ods (modified) * other-projects/maori-lang-detection/mongodb-data/8table_siteCountSummary.png (modified) 1. Left out region code column. 2. Two more sheets of work in ... Mon, 03 Feb 2020 09:28:44 GMT ak19 [33892] * other-projects/maori-lang-detection/mongodb-data/8TableOfNumDetectedVsManualSITESWithMRI.ods (moved) Sheets renamed and spreadsheet renamed Mon, 03 Feb 2020 09:27:37 GMT ak19 [33891] * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/8table_siteCountSummary.png (added) * other-projects/maori-lang-detection/mongodb-data/ManualShortlisting.txt (added) * other-projects/maori-lang-detection/mongodb-data/TableOfNumDetectedVsManualSITESWithMRI.ods (added) Site level detected vs manual inspected data: working shown in file ... Mon, 03 Feb 2020 07:31:33 GMT ak19 [33890] * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (modified) Finished going through NZ sites listing of numPagesContainingMRI > 0 ... Mon, 03 Feb 2020 02:48:40 GMT ak19 [33889] * other-projects/maori-lang-detection/mongodb-data/1a_table_miInUrlPath.csv (modified) * other-projects/maori-lang-detection/mongodb-data/1a_table_miInUrlPath.png (added) * other-projects/maori-lang-detection/mongodb-data/1b_table_noMiInUrlPath.csv (modified) * other-projects/maori-lang-detection/mongodb-data/1b_table_noMiInUrlPath.png (added) * other-projects/maori-lang-detection/mongodb-data/1table_allCrawledSites.csv (modified) * other-projects/maori-lang-detection/mongodb-data/1table_allCrawledSites.png (added) * other-projects/maori-lang-detection/mongodb-data/2table_sitesWithPagesInMRI.csv (modified) * other-projects/maori-lang-detection/mongodb-data/2table_sitesWithPagesInMRI.png (added) * other-projects/maori-lang-detection/mongodb-data/3table_sitesWithPagesContainingMRI.csv (modified) * other-projects/maori-lang-detection/mongodb-data/3table_sitesWithPagesContainingMRI.png (added) * other-projects/maori-lang-detection/mongodb-data/4table_tentativeNonProductSites.csv (modified) * other-projects/maori-lang-detection/mongodb-data/4table_tentativeNonProductSites.png (added) * other-projects/maori-lang-detection/mongodb-data/5b_table_containsMRI_groupedByNZorOverseasNoFilter.csv (added) * other-projects/maori-lang-detection/mongodb-data/5b_table_containsMRI_groupedByNZorOverseasNoFilter.png (added) * other-projects/maori-lang-detection/mongodb-data/5table_tentativeNonProductSites1.csv (modified) * other-projects/maori-lang-detection/mongodb-data/5table_tentativeNonProductSites1.png (added) * other-projects/maori-lang-detection/mongodb-data/tables.txt (modified) 1. Additional column: totalPagesAcrossMatchingSites. 2. Screengrab of ... Mon, 03 Feb 2020 00:08:44 GMT kjdon [33888] * main/trunk/greenstone3/web/interfaces/default/transform/expand-gsf.xsl (modified) added propertyFile attribute to gsf:interfaceText so that you can ... Fri, 31 Jan 2020 10:49:11 GMT ak19 [33887] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/Utility.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/WebPageURLsListing.java (modified) 1. Added support for writing out tables in csv format too. 2. Second ... Fri, 31 Jan 2020 10:17:47 GMT ak19 [33886] * other-projects/maori-lang-detection/mongodb-data/2table_sitesWithPagesInMRI.csv (moved) Minor. File rename Fri, 31 Jan 2020 09:54:15 GMT ak19 [33885] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/WebPageURLsListing.java (modified) Attempting to write the tables. csv not yet supported. Table 1 done. Fri, 31 Jan 2020 09:21:40 GMT ak19 [33884] * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/WebPageURLsListing.java (modified) 0. Previous commit had lots of modifications, and only 2 files ... Fri, 31 Jan 2020 08:50:34 GMT ak19 [33883] * other-projects/maori-lang-detection/mongodb-data/5table_tentativeNonProductSites1.csv (modified) * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/RandomURLsForDomainGenerator.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/WebPageURLsListing.java (modified) Clarifications Thu, 30 Jan 2020 09:54:39 GMT ak19 [33882] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/WebPageURLsListing.java (modified) Code now writes both a listing of all non-autotranslated websites and ... Thu, 30 Jan 2020 09:08:00 GMT ak19 [33881] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) Uses lambda expression to process each doc in a mongodb aggregate ... Thu, 30 Jan 2020 08:17:40 GMT ak19 [33880] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/WebPageURLsListing.java (modified) Write out the 5counts_tentativeNonAutotranslatedSites.json file with ... Thu, 30 Jan 2020 07:21:31 GMT ak19 [33879] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/WebPageURLsListing.java (modified) Have the 2 mongodb aggregate() calls working that Thu, 30 Jan 2020 07:18:09 GMT ak19 [33878] * other-projects/maori-lang-detection/mongodb-data/tables.txt (modified) Better comment Thu, 30 Jan 2020 07:07:59 GMT ak19 [33877] * other-projects/maori-lang-detection/mongodb-data/5counts_tentativeNonProductSites1.json (modified) Reordering to have proper descending order of counts Wed, 29 Jan 2020 08:48:52 GMT ak19 [33876] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/WebPageURLsListing.java (modified) Some missteps, but have got complex collection.aggregate() working at ... Wed, 29 Jan 2020 06:18:29 GMT ak19 [33875] * other-projects/maori-lang-detection/mongodb-data/6b_geojson-features_manualShortlist_numPagesContainingMRI.json (moved) * other-projects/maori-lang-detection/mongodb-data/6b_multipoint_manualShortlist_numPagesContainingMRI.json (moved) Renaming 2 more files correctly Wed, 29 Jan 2020 06:15:29 GMT ak19 [33874] * other-projects/maori-lang-detection/mongodb-data/6a_geojson-features_manualShortlist_numPagesInMRI.json (moved) * other-projects/maori-lang-detection/mongodb-data/6a_multipoint_manualShortlist_numPagesInMRI.json (moved) Renaming 2 files correctly Fri, 24 Jan 2020 08:49:44 GMT ak19 [33873] * other-projects/maori-lang-detection/src/org/greenstone/atea/WebPageURLsListing.java (added) Beginnings of WebPageURLsListing program whose purpose Dr Bainbridge ... Fri, 24 Jan 2020 08:44:04 GMT ak19 [33872] * other-projects/maori-lang-detection/mongodb-data/4counts_tentativeNonProductSites.json (modified) * other-projects/maori-lang-detection/mongodb-data/5counts_tentativeNonProductSites1.json (modified) * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/random255_domainsNZ_IsMRI.txt (added) * other-projects/maori-lang-detection/mongodb-data/tables.txt (modified) 1. Added the file containing the 255 random NZ page URLs to sample. ... Fri, 24 Jan 2020 07:59:42 GMT ak19 [33871] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/RandomURLsForDomainGenerator.java (modified) Removed mostly duplicated older version of method but left the ... Fri, 24 Jan 2020 07:48:17 GMT ak19 [33870] * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/RandomURLsForDomainGenerator.java (modified) Got the mongodb query working in Java in 2 different ways: the fully ... Thu, 23 Jan 2020 09:59:46 GMT ak19 [33869] * other-projects/maori-lang-detection/src/org/greenstone/atea/CountryCodeCountsMapData.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBAccess.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/RandomURLsForDomainGenerator.java (added) First cut at the RandomURLsForDomainGenerator.java class and the ... Thu, 23 Jan 2020 08:16:44 GMT ak19 [33868] * other-projects/maori-lang-detection/mongodb-data/6a_counts_geojson-features_manualShortlist_numPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6a_counts_multipoint_manualShortlist_numPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6a_map_numPagesInMRI_fromManualInspectedSites.png (added) * other-projects/maori-lang-detection/mongodb-data/6b_counts_geojson-features_manualShortlist_numPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6b_counts_multipoint_manualShortlist_numPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6b_map_numPagesContainingMRI_fromManualInspectedSites.png (added) * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (modified) With the updated code for generating the maps from 6a and 6b manual ... Thu, 23 Jan 2020 08:12:17 GMT ak19 [33867] * other-projects/maori-lang-detection/src/org/greenstone/atea/CountryCodeCountsMapData.java (modified) Moved the code handling of special case large rectangles and those ... Thu, 23 Jan 2020 05:56:36 GMT ak19 [33866] * other-projects/the-macronizer/trunk/web/jsp/en/main.jsp (modified) * other-projects/the-macronizer/trunk/web/jsp/mi/main.jsp (modified) Dr Bainbridge's fix to Android mobile macronizer user (on Chrome ... Thu, 23 Jan 2020 05:49:56 GMT ak19 [33865] * other-projects/the-macronizer/trunk/build.xml (modified) * other-projects/the-macronizer/trunk/web/macronizer.xml.in (modified) 1. The gs3 context name changed from macronizer to macron- ... Thu, 23 Jan 2020 01:09:31 GMT davidb [33864] * main/trunk/model-interfaces-dev/whakatohea/iframe/background-images/whakatohea-banner-narrow.jpg (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/wmtb-header.html (modified) * main/trunk/model-interfaces-dev/whakatohea/transform/layouts/main.xsl (modified) Changes to make the Whakatohea banner narrower Wed, 22 Jan 2020 22:32:38 GMT davidb [33863] * main/trunk/model-sites-dev/whakatohea/collect/waiata/GET-EXAMPLE-SOURCE-DOCS.sh (added) Script to get sample content for the DL collection Wed, 22 Jan 2020 22:17:16 GMT davidb [33862] * main/trunk/model-sites-dev/whakatohea/collect/waiata/etc/collectionConfig.xml (modified) Change to specifying the About page text done through about.xml so it ... Wed, 22 Jan 2020 22:16:36 GMT davidb [33861] * main/trunk/model-sites-dev/whakatohea/collect/waiata/transform (added) * main/trunk/model-sites-dev/whakatohea/collect/waiata/transform/pages (added) * main/trunk/model-sites-dev/whakatohea/collect/waiata/transform/pages/about.xsl (added) About page text done through about.xml so it can include xslt tags Wed, 22 Jan 2020 21:22:28 GMT davidb [33860] * main/trunk/greenstone2/build-src/packages/Makefile (modified) * main/trunk/greenstone2/build-src/packages/Makefile.in (modified) Addition of 3 further CPAN packages, found to be needed on CentOS build Wed, 22 Jan 2020 20:56:01 GMT davidb [33859] * main/trunk/greenstone2/build-src/packages/configure (modified) Additional CPAN Perl packages found to be needed when compiling up ... Wed, 22 Jan 2020 06:31:09 GMT ak19 [33858] * other-projects/maori-lang-detection/src/org/greenstone/atea/CountryCodeCountsMapData.java (modified) Fixes to the code committed yesterday: correct calculation of the ... Wed, 22 Jan 2020 03:49:59 GMT davidb [33857] * main/trunk/model-sites-dev/whakatohea/collect/waiata/etc/collectionConfig.xml (modified) Next iteration of the about text Wed, 22 Jan 2020 03:33:31 GMT ak19 [33856] * other-projects/maori-lang-detection/journal-paper/CommonCrawl_flow.pdf (added) * other-projects/maori-lang-detection/journal-paper/CommonCrawl_flow.svg (modified) Forgot to commit. Last week, Dr Bainbridge had properly cropped the ... Wed, 22 Jan 2020 02:03:19 GMT davidb [33855] * main/trunk/greenstone3/web/interfaces/default/transform/pages/depositor_home.xsl (modified) Code added to detect if the CGI parameter already specifies a ... Tue, 21 Jan 2020 09:01:07 GMT ak19 [33854] * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (modified) Manually gone over around 150 webpages of sample size of 255 webpages ... Tue, 21 Jan 2020 08:58:29 GMT ak19 [33853] * other-projects/maori-lang-detection/src/org/greenstone/atea/CountryCodeCountsMapData.java (modified) Handling map coordinates that are horizontally excessive (beyond ... Tue, 21 Jan 2020 00:37:49 GMT davidb [33852] * main/trunk/model-interfaces-dev/whakatohea/iframe/wmtb-header-css-js.PREP (moved) Unused. XSL filename extension potentially causing a problem with ... Fri, 17 Jan 2020 09:38:24 GMT ak19 [33851] * other-projects/maori-lang-detection/mongodb-data/6a_map_manuallyInspected_numPagesInMRI.png (deleted) * other-projects/maori-lang-detection/mongodb-data/6b_map_manuallyInspected_numPagesContainingMRI.png (deleted) Deleting faulty maps. NZ numPages inMRI and containingMRI count is ... Fri, 17 Jan 2020 09:38:00 GMT ak19 [33850] * other-projects/maori-lang-detection/mongodb-data/6a_map_manuallyInspected_numPagesInMRI.png (moved) * other-projects/maori-lang-detection/mongodb-data/6b_map_manuallyInspected_numPagesContainingMRI.png (moved) Renames before deleting faulty maps. NZ numPages inMRI and ... Fri, 17 Jan 2020 09:22:18 GMT ak19 [33849] * other-projects/maori-lang-detection/MoreReading/mongodb.txt (modified) * other-projects/maori-lang-detection/journal-paper/writeup (modified) One less Australian site as it was an infographic containing Maori ... Fri, 17 Jan 2020 09:21:14 GMT ak19 [33848] * other-projects/maori-lang-detection/mongodb-data/1a_counts_miInUrlPath.json (modified) * other-projects/maori-lang-detection/mongodb-data/1a_table_miInUrlPath.csv (added) * other-projects/maori-lang-detection/mongodb-data/1b_counts_noMiInUrlPath.json (modified) * other-projects/maori-lang-detection/mongodb-data/1b_table_noMiInUrlPath.csv (added) * other-projects/maori-lang-detection/mongodb-data/1table_allCrawledSites.csv (added) * other-projects/maori-lang-detection/mongodb-data/2table__sitesWithPagesInMRI.csv (added) * other-projects/maori-lang-detection/mongodb-data/3table_sitesWithPagesContainingMRI.csv (added) * other-projects/maori-lang-detection/mongodb-data/4table_tentativeNonProductSites.csv (added) * other-projects/maori-lang-detection/mongodb-data/5table_tentativeNonProductSites1.csv (added) * other-projects/maori-lang-detection/mongodb-data/6a_counts_manualShortlist_numPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6a_geojson-features_manualShortlist_numPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6a_manuallyInspected_numPagesInMRI.png (added) * other-projects/maori-lang-detection/mongodb-data/6a_multipoint_manualShortlist_numPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6b_counts_manualShortlist_numPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6b_geojson-features_manualShortlist_numPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6b_manuallyInspected_numPagesContainingMRI.png (added) * other-projects/maori-lang-detection/mongodb-data/6b_multipoint_manualShortlist_numPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data/6counts_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/6geojson-features_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/6map_exclAutotranslatedSites1_manualShortlist.png (modified) * other-projects/maori-lang-detection/mongodb-data/6table_nonProductSites1_manualShortlist.json (added) * other-projects/maori-lang-detection/mongodb-data/tables.txt (added) Tables of mongodb counts (1-5 table) and manual counts (6table). ... Fri, 17 Jan 2020 06:32:16 GMT ak19 [33847] * other-projects/maori-lang-detection/MoreReading/mongodb.txt (modified) * other-projects/maori-lang-detection/mongodb-data/6counts_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/6geojson-features_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/6map_exclAutotranslatedSites1_manualShortlist.png (modified) indigenousblogs.com did have one page actually in Maori (an XML ... Fri, 17 Jan 2020 03:49:05 GMT ak19 [33846] * other-projects/maori-lang-detection/mongodb-data/1map_allCrawledSites.png (modified) * other-projects/maori-lang-detection/mongodb-data/2map_sitesWithPagesInMRI.png (modified) * other-projects/maori-lang-detection/mongodb-data/3map_sitesWithPagesContainingMRI.png (modified) * other-projects/maori-lang-detection/mongodb-data/4map_exclTentativeAutotranslatedSites.png (modified) * other-projects/maori-lang-detection/mongodb-data/5map_exclTentativeAutotranslatedSites1.png (modified) * other-projects/maori-lang-detection/mongodb-data/6map_exclAutotranslatedSites1_manualShortlist.png (modified) Cropped out the json portion Fri, 17 Jan 2020 03:34:11 GMT ak19 [33845] * other-projects/maori-lang-detection/mongodb-data/6map_exclAutotranslatedSites1_manualShortlist.png (modified) Cropped out the json portion Fri, 17 Jan 2020 03:33:24 GMT ak19 [33844] * other-projects/maori-lang-detection/mongodb-data/6counts_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/6geojson-features_nonProductSites1_manualShortlist.json (modified) * other-projects/maori-lang-detection/mongodb-data/6map_exclAutotranslatedSites1_manualShortlist.png (modified) * other-projects/maori-lang-detection/mongodb-data/7miInURLPath_exclNZ_byCountryCode.json (added) Regenerated Fri, 17 Jan 2020 03:24:28 GMT ak19 [33843] * other-projects/maori-lang-detection/MoreReading/mongodb.txt (modified) Counting the 3 non-NZ sites that had mi in the URl path that manual ... Thu, 16 Jan 2020 09:30:09 GMT ak19 [33842] * other-projects/maori-lang-detection/journal-paper/writeup (modified) Jotted down some further paragraphs and notes of interest. ... Thu, 16 Jan 2020 08:23:09 GMT ak19 [33841] * other-projects/maori-lang-detection/journal-paper/CommonCrawl_flow.svg (modified) Latest version of the flowchart of the process of getting Common ... Thu, 16 Jan 2020 08:22:15 GMT ak19 [33840] * other-projects/maori-lang-detection/journal-paper/CommonCrawl_flow.svg (added) Older flowchart of the process of getting Common Crawl data into ... Thu, 16 Jan 2020 08:18:43 GMT ak19 [33839] * other-projects/maori-lang-detection/journal-paper (added) * other-projects/maori-lang-detection/journal-paper/writeup (moved) Moving writeup text file into new folder so I can add the SVG ... Thu, 16 Jan 2020 04:56:50 GMT ak19 [33838] * other-projects/maori-lang-detection/MoreReading/mongodb.txt (modified) Updated after checking non-NZ and non-nz TLD sites with mi in URL path Wed, 15 Jan 2020 23:15:30 GMT davidb [33837] * main/trunk/model-sites-dev/whakatohea/README.txt (added) Local notes for the site Tue, 14 Jan 2020 21:14:22 GMT davidb [33836] * main/trunk/model-sites-dev/whakatohea/siteConfig.xml (modified) Macron added Tue, 14 Jan 2020 21:12:30 GMT davidb [33835] * main/trunk/model-interfaces-dev/whakatohea/iframe (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/3653928.jpg (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/analytics.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/commerce-core.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/css.css (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/css_002.css (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/css_003.css (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/custom.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/fancybox.css (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/ga.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/gdprscript.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/jquery.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/jquery_002.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/jquery_003.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/jquery_004.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/jquery_005.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/main-commerce-browse.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/main-customer-accounts-site.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/main-membership-site.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/main.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/main_style.css (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/plugins.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/site_membership.css (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/sites.css (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/snowday262.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/social-icons.css (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/stl.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/templateArtifacts.js (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/Tīpuna-WMTB_files/whakatohea-01.jpg (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/background-images (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/background-images/1620214336.jpg (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/index.html (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/wmtb-footer.html (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/wmtb-header-css-js.xsl (added) * main/trunk/model-interfaces-dev/whakatohea/iframe/wmtb-header.html (added) * main/trunk/model-interfaces-dev/whakatohea/transform/layouts/main.xsl (modified) Supporting iframe files now located within interface area Tue, 14 Jan 2020 21:08:46 GMT davidb [33834] * main/trunk/model-sites-dev/whakatohea/collect/waiata/import/01 (added) * main/trunk/model-sites-dev/whakatohea/collect/waiata/import/01/metadata.xml (added) * main/trunk/model-sites-dev/whakatohea/collect/waiata/import/02 (added) * main/trunk/model-sites-dev/whakatohea/collect/waiata/import/02/metadata.xml (added) * main/trunk/model-sites-dev/whakatohea/collect/waiata/import/03 (added) * main/trunk/model-sites-dev/whakatohea/collect/waiata/import/03/metadata.xml (added) * main/trunk/model-sites-dev/whakatohea/collect/waiata/import/04 (added) * main/trunk/model-sites-dev/whakatohea/collect/waiata/import/04/metadata.xml (added) * main/trunk/model-sites-dev/whakatohea/collect/waiata/import/05 (added) * main/trunk/model-sites-dev/whakatohea/collect/waiata/import/05/metadata.xml (added) * main/trunk/model-sites-dev/whakatohea/collect/waiata/import/06 (added) * main/trunk/model-sites-dev/whakatohea/collect/waiata/import/06/metadata.xml (added) * main/trunk/model-sites-dev/whakatohea/collect/waiata/import/metadata.xml (added) Metadata shell ready for download of demonstration source content files