# # ChangeLog for / # # Generated by Trac 1.4.2 # 2024-04-28T06:58:09+12:00 Sat, 14 Mar 2020 04:51:52 GMT davidb [34060] * main/trunk/model-interfaces-dev/atea-deprecated (moved) Deprecating this version of the Atea interface Sat, 14 Mar 2020 04:48:29 GMT davidb [34059] * main/trunk/model-sites-dev/atea/resources/OBSOLETE/siteConfig_es.properties (moved) * main/trunk/model-sites-dev/atea/resources/OBSOLETE/siteConfig_fr.properties (moved) * main/trunk/model-sites-dev/atea/resources/OBSOLETE/siteConfig_gu.properties (moved) * main/trunk/model-sites-dev/atea/resources/OBSOLETE/siteConfig_ja.properties (moved) * main/trunk/model-sites-dev/atea/resources/OBSOLETE/siteConfig_pl.properties (moved) * main/trunk/model-sites-dev/atea/resources/siteConfig.properties (modified) Moving the original localhost _LL.properties files out of the way for ... Sat, 14 Mar 2020 04:47:00 GMT davidb [34058] * main/trunk/model-sites-dev/atea/resources/OBSOLETE (added) To be used to hold the original localhost _LL.properties files Sat, 14 Mar 2020 04:41:46 GMT davidb [34057] * main/trunk/model-sites-dev/atea/siteConfig.xml (modified) Updated reference to glTF.png icon Sat, 14 Mar 2020 04:40:47 GMT davidb [34056] * main/trunk/model-sites-dev/atea/images (added) * main/trunk/model-sites-dev/atea/images/igltf.png (added) * main/trunk/model-sites-dev/atea/images/igltf64.png (added) Moved from interface location, as referenced in the siteConfig.xml file Sat, 14 Mar 2020 04:39:58 GMT davidb [34055] * main/trunk/model-sites-dev/atea/collect/he-herenga-korero/etc (modified) files to ignore Sat, 14 Mar 2020 04:38:15 GMT davidb [34054] * main/trunk/model-sites-dev/atea/collect/he-herenga-korero (modified) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero/etc/collectionConfig.xml (modified) Directories to ignore Sat, 14 Mar 2020 04:36:28 GMT davidb [34053] * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc/oai-inf.jdb (deleted) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc/oai-inf.jdb.bak (deleted) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc/oai-inf.lg (deleted) Don't want these under SVN control in a collection that starts with ... Sat, 14 Mar 2020 04:34:49 GMT davidb [34052] * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc/collectionConfig.bak (deleted) Not working with GLIL, so having such a file will only increasingly ... Sat, 14 Mar 2020 04:26:45 GMT davidb [34051] * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc (modified) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc/oai-inf.jdb (modified) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc/oai-inf.jdb.bak (modified) Update on files to ignore Sat, 14 Mar 2020 04:23:17 GMT davidb [34050] * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/import (modified) Once the zip files are added in, do not want svn to report they files ... Sat, 14 Mar 2020 04:19:26 GMT davidb [34049] * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc (modified) files to ignore Sat, 14 Mar 2020 04:18:37 GMT davidb [34048] * main/trunk/model-sites-dev/atea/collect/global-digital-heritage (modified) directories to ignore Sat, 14 Mar 2020 04:15:10 GMT davidb [34047] * main/trunk/model-sites-dev/atea/collect/digital-nz (modified) Further directory to ignore Sat, 14 Mar 2020 04:10:45 GMT davidb [34046] * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s (modified) Some directories to ignore Sat, 14 Mar 2020 04:09:56 GMT davidb [34045] * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/etc (modified) Some files to ignore Sat, 14 Mar 2020 04:08:36 GMT davidb [34044] * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs (modified) Some directories to ignore Sat, 14 Mar 2020 04:07:25 GMT davidb [34043] * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/etc (modified) Some files to ignore Sat, 14 Mar 2020 04:06:16 GMT davidb [34042] * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/ACTIVATE.sh (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/BUILDCOL.sh (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/CHECKOUT-PDFS.sh (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/IMPORT.sh (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/RECONFIGURE.sh (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/etc (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/etc/collectionConfig.xml (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/etc/fail.log (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/etc/hierarchy-cdnum-track.txt (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/etc/hierarchy-volume-issue.txt (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/etc/oai-inf.jdb (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/etc/oai-inf.jdb.bak (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/etc/oai-inf.lg (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/images (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/script (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/style (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs/tmp (added) Initial set of files Sat, 14 Mar 2020 04:06:00 GMT davidb [34041] * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/ACTIVATE.sh (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/BUILDCOL.sh (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/CHECKOUT-AUDIO.sh (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/IMPORT.sh (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/RECONFIGURE.sh (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/etc (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/etc/collectionConfig.xml (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/etc/fail.log (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/etc/hierarchy-cdnum-track.txt (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/etc/hierarchy-volume-issue.txt (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/etc/oai-inf.jdb (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/etc/oai-inf.jdb.bak (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/etc/oai-inf.lg (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/images (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/script (added) * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s/style (added) Initial set of files Sat, 14 Mar 2020 04:00:47 GMT davidb [34040] * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-mp3s (added) Top-level folder for MP3s collection sourced from hemi-dl Sat, 14 Mar 2020 04:00:22 GMT davidb [34039] * main/trunk/model-sites-dev/atea/collect/he-herenga-korero-pdfs (added) Top-level folder for PDF collection sourced from hemi-dl Sat, 14 Mar 2020 03:54:00 GMT davidb [34038] * main/trunk/model-sites-dev/atea/collect/digital-nz/PREPARE.sh (added) Run this script to populate the import folder Sat, 14 Mar 2020 03:52:19 GMT davidb [34037] * main/trunk/model-sites-dev/atea/collect/digital-nz/import (deleted) No longer needed as will be formed by untarring import.tar.gz Sat, 14 Mar 2020 03:52:15 GMT davidb [34036] * main/trunk/model-sites-dev/atea/collect/digital-nz/import.tar.gz (added) import folder with the result of running the NZ Digital API searching ... Sat, 14 Mar 2020 03:49:53 GMT davidb [34035] * main/trunk/model-sites-dev/atea/collect/digital-nz (modified) Directories to ignore Sat, 14 Mar 2020 03:32:51 GMT davidb [34034] * main/trunk/model-sites-dev/atea/collect/digital-nz/etc (modified) Some files to ignore Sat, 14 Mar 2020 03:32:08 GMT davidb [34033] * main/trunk/model-sites-dev/atea/collect/digital-nz/etc/conf/schema.xml (modified) Solr schema.xml changes that have flowed through from changes in the ... Sat, 14 Mar 2020 03:25:49 GMT davidb [34032] * main/trunk/model-sites-dev/atea/collect/voxelvid/etc (modified) Files to ignore Sat, 14 Mar 2020 03:24:30 GMT davidb [34031] * main/trunk/model-sites-dev/atea/collect/voxelvid/ACTIVATE.sh (added) * main/trunk/model-sites-dev/atea/collect/voxelvid/BUILDCOL.sh (added) * main/trunk/model-sites-dev/atea/collect/voxelvid/IMPORT.sh (added) Dedicated scripts for this collection to build and activate it Sat, 14 Mar 2020 03:23:07 GMT davidb [34030] * main/trunk/model-sites-dev/atea/collect/voxelvid (modified) Ignore archives and index dirs Sat, 14 Mar 2020 03:15:07 GMT davidb [34029] * main/trunk/model-interfaces-dev/alt-atea/images/igltf.png (deleted) * main/trunk/model-interfaces-dev/alt-atea/images/igltf64.png (deleted) Moved to atea site Fri, 13 Mar 2020 10:19:46 GMT davidb [34028] * main/trunk/model-interfaces-dev/alt-atea/transform/layouts/main.xsl (modified) Tweaks to overall interface look-and-feel Fri, 13 Mar 2020 10:19:12 GMT davidb [34027] * main/trunk/model-interfaces-dev/alt-atea/style/custom.css (modified) Tweaks to overall interface look-and-feel Fri, 13 Mar 2020 10:18:36 GMT davidb [34026] * main/trunk/model-interfaces-dev/alt-atea/style/themes (added) * main/trunk/model-interfaces-dev/alt-atea/style/themes/Aristo (added) * main/trunk/model-interfaces-dev/alt-atea/style/themes/Aristo/Aristo.css (added) * main/trunk/model-interfaces-dev/alt-atea/style/themes/Aristo/images (added) * main/trunk/model-interfaces-dev/alt-atea/style/themes/Aristo/images/bg_fallback.png (added) * main/trunk/model-interfaces-dev/alt-atea/style/themes/Aristo/images/diamond-background.png (added) * main/trunk/model-interfaces-dev/alt-atea/style/themes/Aristo/images/icon_sprite.png (added) * main/trunk/model-interfaces-dev/alt-atea/style/themes/Aristo/images/noise-background.png (added) * main/trunk/model-interfaces-dev/alt-atea/style/themes/Aristo/images/progress_bar.gif (added) * main/trunk/model-interfaces-dev/alt-atea/style/themes/Aristo/images/slider_handles.png (added) * main/trunk/model-interfaces-dev/alt-atea/style/themes/Aristo/images/ui-icons_222222_256x240.png (added) * main/trunk/model-interfaces-dev/alt-atea/style/themes/Aristo/images/ui-icons_454545_256x240.png (added) Used to provide the gray jquery-ui theme to Atea Fri, 13 Mar 2020 10:16:56 GMT davidb [34025] * main/trunk/model-interfaces-dev/alt-atea/images/igltf.png (added) * main/trunk/model-interfaces-dev/alt-atea/images/igltf64.png (added) Icon to glTF 3D model/zip files Fri, 13 Mar 2020 10:15:26 GMT davidb [34024] * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/README.txt (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/images (added) Couple of over-looked files for the initial set of files for Global ... Fri, 13 Mar 2020 10:14:45 GMT davidb [34023] * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/ACTIVATE.sh (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/BUILDCOL.sh (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/IMPORT.sh (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/PREPARE.sh (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/RECONFIGURE.sh (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc/collectionConfig.bak (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc/collectionConfig.xml (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc/fail.log (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc/oai-inf.jdb (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc/oai-inf.jdb.bak (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/etc/oai-inf.lg (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/import (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/import/metadata.xml (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/metadata (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/metadata/dublin.mds (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/metadata/ex.mds (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/metadata/greenstone.mds (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/metadata/profile.xml (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/pre-import (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/pre-import/sketchfab (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/pre-import/sketchfab/README.txt (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/script (added) * main/trunk/model-sites-dev/atea/collect/global-digital-heritage/style (added) Initial set of files for Global Digital Heritage glTF demonstration ... Fri, 13 Mar 2020 10:09:03 GMT davidb [34022] * main/trunk/model-sites-dev/atea/collect/global-digital-heritage (added) Collection for demonstration VR model artefacts Thu, 12 Mar 2020 04:22:16 GMT davidb [34021] * main/trunk/greenstone3/build.xml (modified) Tidy up on help/usage message Thu, 12 Mar 2020 04:20:12 GMT davidb [34020] * main/trunk/greenstone3/build.xml (modified) Changed to using newer version (8.5.51) of Tomcat Thu, 12 Mar 2020 02:04:42 GMT kjdon [34019] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/collection/Collection.java (modified) replaced a couple of text strings Thu, 12 Mar 2020 00:42:39 GMT kjdon [34018] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/action/SystemAction.java (modified) check for error element in response - add that in if present, instead ... Thu, 12 Mar 2020 00:41:25 GMT kjdon [34017] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/core/MessageRouter.java (modified) add error element, don't just print a message to log, if we have ... Thu, 12 Mar 2020 00:32:51 GMT kjdon [34016] * main/trunk/greenstone2/bin/script/explode_metadata_database.pl (modified) added cpan folder to @INC, as something is expecting to find JSON.pm ... Tue, 10 Mar 2020 08:03:24 GMT davidb [34015] * main/trunk/model-interfaces-dev/alt-atea/transform/layouts/header.xsl (modified) * main/trunk/model-interfaces-dev/alt-atea/transform/layouts/main.xsl (modified) Further elimination of PJ related HTML/templates Tue, 10 Mar 2020 08:02:24 GMT davidb [34014] * main/trunk/model-interfaces-dev/alt-atea/transform/pages/document.xsl (modified) Added in vidoe player template; remove PJ templates Tue, 10 Mar 2020 08:01:28 GMT davidb [34013] * main/trunk/model-interfaces-dev/alt-atea/transform/pages/home.xsl (modified) Added hr line to break up sections Tue, 10 Mar 2020 07:53:26 GMT davidb [34012] * main/trunk/model-interfaces-dev/alt-atea/images (added) * main/trunk/model-interfaces-dev/alt-atea/images/Atea-background-topline.png (added) * main/trunk/model-interfaces-dev/alt-atea/images/Atea-homeBG.jpg (added) * main/trunk/model-interfaces-dev/alt-atea/images/Atea-homeBG.xcf (added) * main/trunk/model-interfaces-dev/alt-atea/images/Atea-logo.png (added) * main/trunk/model-interfaces-dev/alt-atea/images/Atea-logo.xcf (added) * main/trunk/model-interfaces-dev/alt-atea/images/background.jpg (added) Images for Atea alt interface Tue, 10 Mar 2020 07:45:18 GMT ak19 [34011] * other-projects/maori-lang-detection/mongodb-data/pieChart4a_sitesPreparedForCrawling.png (added) * other-projects/maori-lang-detection/mongodb-data/pieChart4b_sitesPreparedForCrawling.svg (added) * other-projects/maori-lang-detection/mongodb-data/pieChart4c_screenshotSitesPreparedForCrawling.png (added) * other-projects/maori-lang-detection/mongodb-data/pieChart5a_sitesPreparedForCrawling.png (added) * other-projects/maori-lang-detection/mongodb-data/pieChart5b_sitesPreparedForCrawling.svg (added) * other-projects/maori-lang-detection/mongodb-data/pieChart5c_screenshotSitesPreparedForCrawling.png (added) * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) * other-projects/maori-lang-detection/mongodb-data/piechart_data2.txt (modified) Piechart data for sites prepared for crawling and the piecharts for these Tue, 10 Mar 2020 07:45:02 GMT davidb [34010] * main/trunk/greenstone3/web/interfaces/default/images/imp4.png (added) icon image for MP4 video Tue, 10 Mar 2020 07:25:28 GMT davidb [34009] * main/trunk/model-interfaces-dev/alt-atea/interfaceConfig.xml (added) * main/trunk/model-interfaces-dev/alt-atea/style (added) * main/trunk/model-interfaces-dev/alt-atea/style/custom.css (added) * main/trunk/model-interfaces-dev/alt-atea/transform (added) * main/trunk/model-interfaces-dev/alt-atea/transform/layouts (added) * main/trunk/model-interfaces-dev/alt-atea/transform/layouts/header.xsl (added) * main/trunk/model-interfaces-dev/alt-atea/transform/layouts/main.xsl (added) * main/trunk/model-interfaces-dev/alt-atea/transform/pages (added) * main/trunk/model-interfaces-dev/alt-atea/transform/pages/classifier.xsl (added) * main/trunk/model-interfaces-dev/alt-atea/transform/pages/document.xsl (added) * main/trunk/model-interfaces-dev/alt-atea/transform/pages/home.xsl (added) * main/trunk/model-interfaces-dev/alt-atea/transform/pages/query.xsl (added) * main/trunk/model-interfaces-dev/alt-atea/wrapper-style (added) * main/trunk/model-interfaces-dev/alt-atea/wrapper-style/default.css (added) * main/trunk/model-interfaces-dev/alt-atea/wrapper-style/favicon.ico (added) * main/trunk/model-interfaces-dev/alt-atea/wrapper-style/images (added) * main/trunk/model-interfaces-dev/alt-atea/wrapper-style/images/Atea-homeBG.jpg (added) * main/trunk/model-interfaces-dev/alt-atea/wrapper-style/images/Atea-homeBG.xcf (added) * main/trunk/model-interfaces-dev/alt-atea/wrapper-style/images/bodyBG.jpg (added) * main/trunk/model-interfaces-dev/alt-atea/wrapper-style/large.css (added) * main/trunk/model-interfaces-dev/alt-atea/wrapper-style/medium.css (added) * main/trunk/model-interfaces-dev/alt-atea/wrapper-style/print.css (added) * main/trunk/model-interfaces-dev/alt-atea/wrapper-style/small.css (added) PJ based alternative interface for Atea Tue, 10 Mar 2020 07:17:26 GMT davidb [34008] * main/trunk/model-interfaces-dev/alt-atea (added) Alternative interface look-and-feel for the Atea project Tue, 10 Mar 2020 06:56:01 GMT ak19 [34007] * other-projects/maori-lang-detection/mongodb-data/pieChart2a_CrawledWebPages_EmptyVsInMongoDB.png (added) * other-projects/maori-lang-detection/mongodb-data/pieChart2b_CrawledWebPages_EmptyVsInMongoDB.svg (added) * other-projects/maori-lang-detection/mongodb-data/pieChart3a_SimplerCrawledWebPages_EmptyVsInMongoDB.png (added) * other-projects/maori-lang-detection/mongodb-data/pieChart3b_SimplerCrawledWebPages_EmptyVsInMongoDB.svg (added) * other-projects/maori-lang-detection/mongodb-data/pieChart3c_screemshot_SimplerCrawledWebPages_EmptyVsInMongoDB.png (added) * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) * other-projects/maori-lang-detection/mongodb-data/piechart_data2.txt (modified) Prepared more data for the piecharts. This time for empty web pages ... Tue, 10 Mar 2020 05:51:05 GMT ak19 [34006] * other-projects/maori-lang-detection/mongodb-data/pieChart01a_seedURLsForCrawling.png (added) * other-projects/maori-lang-detection/mongodb-data/pieChart01b_obtainingSeedURLs.png (added) * other-projects/maori-lang-detection/mongodb-data/pieChart01c_obtainingSeedURLs.svg (added) * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) * other-projects/maori-lang-detection/mongodb-data/piechart_data2.txt (added) Committing more data I've collected for generating pie charts and the ... Tue, 10 Mar 2020 04:33:20 GMT ak19 [34005] * other-projects/maori-lang-detection/src/org/greenstone/atea/NutchTextDumpToMongoDB.java (modified) InfoOnEmptyPagesNotInMongoDB.txt is now written out to a file, ... Tue, 10 Mar 2020 04:27:07 GMT ak19 [34004] * other-projects/maori-lang-detection/mongodb-data/InfoOnEmptyPagesNotInMongoDB.csv (moved) * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) Renaming csv file to have csv extension Tue, 10 Mar 2020 04:26:45 GMT ak19 [34003] * other-projects/maori-lang-detection/mongodb-data/InfoOnEmptyPagesNotInMongoDB.txt (modified) Redid the file with info on empty URL web pages as a csv file with ... Mon, 09 Mar 2020 23:09:23 GMT davidb [34002] * main/trunk/greenstone3/resources/tomcat/web8.xml.svn (modified) Comment-based changes resulting from: (i) merging in differences from ... Mon, 09 Mar 2020 05:56:00 GMT ak19 [34001] * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) Tentative total urls from common crawl 12 month cral data. Mon, 09 Mar 2020 05:55:01 GMT ak19 [34000] * other-projects/maori-lang-detection/src/org/greenstone/atea/AllDomainCount.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/CountryCodeCountsMapData.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/Utility.java (modified) Some debugging and other minor changes Mon, 09 Mar 2020 04:34:10 GMT ak19 [33999] * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) Common crawl 12 month urls and CC provided stats Fri, 06 Mar 2020 04:49:49 GMT davidb [33998] * main/trunk/gli/src/org/greenstone/gatherer/feedback/CompListener.java (modified) * main/trunk/gli/src/org/greenstone/gatherer/feedback/ComponentInformation.java (modified) Removed import statement that is no longer used, and was stopping ... Fri, 06 Mar 2020 02:55:44 GMT davidb [33997] * gs3-extensions/mars-src (added) * gs3-extensions/mars-src/trunk (added) Top-level folder for MARS related Greenstone3 code Fri, 06 Mar 2020 02:18:11 GMT ak19 [33996] * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.18/auto/XML/Parser/Expat/Expat.so (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.18/perllocal.pod (modified) Accidentally committed the wrong thing in previous commit. Attempting ... Fri, 06 Mar 2020 02:14:51 GMT ak19 [33995] * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.18/perllocal.pod (modified) There was no Expat.so for perl 5.18 so am recompiling and committing that Tue, 03 Mar 2020 01:42:14 GMT davidb [33994] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/util/Dictionary.java (modified) * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/util/UTF8Control.java (added) The introduction of UTF8Control class means we can now work directly ... Mon, 02 Mar 2020 01:10:20 GMT kjdon [33993] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/core/URLFilter.java (modified) when downloading a pdf, browsers seem to make more than one request - ... Sun, 01 Mar 2020 03:41:35 GMT davidb [33992] * main/trunk/greenstone3/resources/tomcat/server_tomcat8.xml.svn (modified) Notes at start of file updated Sun, 01 Mar 2020 03:35:09 GMT davidb [33991] * main/trunk/greenstone3/resources/tomcat/server_tomcat8.xml.svn (modified) A version of the tomcat/conf/server.xml file that is better aligned ... Sun, 01 Mar 2020 03:29:34 GMT davidb [33990] * main/trunk/greenstone3/resources/tomcat/server_tomcat8.xml.svn (modified) Some white-space changes for consistency with newer ... Sun, 01 Mar 2020 02:16:01 GMT davidb [33989] * main/trunk/greenstone3/resources/tomcat/server_tomcat8.xml.svn (modified) In a default setup, AJP is not used => so not needed. Commented out ... Fri, 28 Feb 2020 09:09:15 GMT ak19 [33988] * other-projects/maori-lang-detection/src/org/greenstone/atea/NutchTextDumpToMongoDB.java (modified) 1. Print out which web pages of which web site's dump.txt were empty. ... Fri, 28 Feb 2020 09:08:08 GMT ak19 [33987] * other-projects/maori-lang-detection/mongodb-data/InfoOnEmptyPagesNotInMongoDB.txt (added) Output of re-running NutchTextDumpToMongoDB to print out which web ... Fri, 28 Feb 2020 09:07:29 GMT ak19 [33986] * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (modified) Dr Bainbridge investigated the original data set more Thu, 27 Feb 2020 08:49:00 GMT ak19 [33985] * other-projects/maori-lang-detection/mongodb-data/piechart_data.txt (added) Data to back the piechart I need to make that will illustrate how we ... Thu, 27 Feb 2020 08:44:06 GMT ak19 [33984] * other-projects/maori-lang-detection/src/org/greenstone/atea/AllDomainCount.java (added) Simple class to summarise some basic counts of the input common crawl ... Thu, 27 Feb 2020 07:26:53 GMT ak19 [33983] * other-projects/maori-lang-detection/src/org/greenstone/atea/CCWETProcessor.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/NutchTextDumpToMongoDB.java (modified) More sensible name for method which had too long kept its old name ... Wed, 26 Feb 2020 08:59:55 GMT ak19 [33982] * other-projects/maori-lang-detection/src/org/greenstone/atea/SummaryTool.java (modified) SummaryTool.java now processed the handcrafted UNIQUE domains counts ... Wed, 26 Feb 2020 08:19:23 GMT ak19 [33981] * other-projects/maori-lang-detection/src/org/greenstone/atea/CountryCodeCountsMapData.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/SummaryTool.java (modified) As Dr Bainbridge suggested, code now opens a new firefox tab with a ... Wed, 26 Feb 2020 08:11:58 GMT ak19 [33980] * other-projects/maori-lang-detection/mongodb-data/6counts_sitesWithPagesContainingMRI_manualShortlist.json (modified) Additional comments Wed, 26 Feb 2020 08:00:38 GMT ak19 [33979] * other-projects/maori-lang-detection/mongodb-data/6counts_sitesWithPagesContainingMRI_manualShortlist.json (modified) Clearly stating that counts are of unique domains Wed, 26 Feb 2020 06:57:05 GMT ak19 [33978] * other-projects/maori-lang-detection/src/org/greenstone/atea/CountryCodeCountsMapData.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/SummaryTool.java (modified) Opens all geoJSON maps in new tabs instead of waiting for user to ... Wed, 26 Feb 2020 05:37:08 GMT ak19 [33977] * other-projects/maori-lang-detection/mongodb-data/random260_results.txt (modified) Added something on precision vs recall being applicable to our ... Wed, 26 Feb 2020 05:28:09 GMT ak19 [33976] * other-projects/maori-lang-detection/mongodb-data/random260_results.txt (modified) Adding in what I could remember of Dr Bainbridge's statement about ... Tue, 25 Feb 2020 01:46:51 GMT kjdon [33975] * main/trunk/greenstone3/build.xml (modified) some mods to do with allowing multiple oaiservers. need OAIConfig- ... Tue, 25 Feb 2020 01:14:52 GMT kjdon [33974] * main/trunk/greenstone3/build.properties.svn (modified) added in new oai.servlets field - if you want to run two oaiservlets, ... Tue, 25 Feb 2020 01:01:18 GMT kjdon [33973] * main/trunk/greenstone3/web/WEB-INF/web.xml (modified) tidied up the file a bit. added new servlet_url param to oaiserver - ... Tue, 25 Feb 2020 00:47:48 GMT kjdon [33972] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/service/OAIPMH.java (modified) fixed a typo in a comment Tue, 25 Feb 2020 00:47:12 GMT kjdon [33971] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/OAIServer.java (modified) get servlet_url param and pass to getOAIConfigXML, as now the files ... Tue, 25 Feb 2020 00:46:03 GMT kjdon [33970] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/util/OAIXML.java (modified) changed OAIConfig naming to OAIConfig-oaiserver.xml - so multiple ... Tue, 25 Feb 2020 00:39:10 GMT kjdon [33969] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/util/OAIXML.java (modified) we no longer use OAIConfig.xml as the filename, now we use eg ... Tue, 25 Feb 2020 00:37:20 GMT kjdon [33968] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/core/OAIMessageRouter.java (modified) pass in oai_config from server, rather than reading it in itself Tue, 25 Feb 2020 00:36:08 GMT kjdon [33967] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/service/OAIPMH.java (modified) you might want to change the oaiserver url, eg if you have 2 oai ... Fri, 21 Feb 2020 08:00:55 GMT ak19 [33966] * other-projects/maori-lang-detection/mongodb-data/random260.ods (added) * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) * other-projects/maori-lang-detection/mongodb-data/random260_results.txt (added) Added the origSequence and basicDomain columns to the random 260 web ... Fri, 21 Feb 2020 07:59:07 GMT ak19 [33965] * other-projects/maori-lang-detection/src/org/greenstone/atea/ManualURLInspection.java (modified) 1. Adding a basicDomain column (stripped of http/https and www ... Fri, 21 Feb 2020 06:57:38 GMT ak19 [33964] * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) 2 records were missing a value for the qualityLevel column. Thu, 20 Feb 2020 09:12:43 GMT ak19 [33963] * other-projects/maori-lang-detection/src/org/greenstone/atea/ManualURLInspection.java (modified) * other-projects/maori-lang-detection/src/org/greenstone/atea/MongoDBQueryer.java (modified) Added a new helper method to MongoDBQueryer.java to add numPagesInMRI ... Thu, 20 Feb 2020 09:07:20 GMT ak19 [33962] * other-projects/maori-lang-detection/mongodb-data/random260_manualList_globalDomains_whereAPageContainsMRI.txt (modified) 2 fields changed, as one was missed out and the other incorrectly ... Thu, 20 Feb 2020 07:24:19 GMT ak19 [33961] * other-projects/maori-lang-detection/src/org/greenstone/atea/ManualURLInspection.java (modified) New category, LINK_TEXT, introduced for the random web page URL samples.