# # ChangeLog for / # # Generated by Trac 1.4.2 # 2024-06-07T04:37:37+12:00 Thu, 18 Jun 2020 08:30:04 GMT ak19 [34208] * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/import/technical/metadata.xml (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/import/videos/metadata.xml (modified) Forgot to commit the meta for technical docs. And metadata.xml for ... Thu, 18 Jun 2020 07:25:04 GMT ak19 [34207] * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/etc (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/etc/collectionConfig.xml (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/etc/fail.log (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/etc/oai-inf.jdb (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/etc/oai-inf.jdb.bak (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/etc/oai-inf.lg (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/images (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/import (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/import/presentations (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/import/presentations/metadata.xml (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/import/technical (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/import/videos (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/import/videos/metadata.xml (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/metadata (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/metadata/dublin.mds (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/metadata/ex.mds (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/metadata/greenstone.mds (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/script (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/style (added) * main/trunk/model-sites-dev/opotiki/collect/gs3tutorials/tmp (added) Collection design and metadata for the demonstration/test but also ... Thu, 18 Jun 2020 07:22:34 GMT ak19 [34206] * main/trunk/model-sites-dev/opotiki/collect/imagesco/etc/oai-inf.jdb (modified) * main/trunk/model-sites-dev/opotiki/collect/imagesco/etc/oai-inf.jdb.bak (modified) * main/trunk/model-sites-dev/opotiki/collect/textdemo/etc/collectionConfig.xml (modified) * main/trunk/model-sites-dev/opotiki/collect/textdemo/etc/oai-inf.jdb (modified) * main/trunk/model-sites-dev/opotiki/collect/textdemo/etc/oai-inf.jdb.bak (modified) * main/trunk/model-sites-dev/opotiki/collect/waiatade/etc/collectionConfig.xml (modified) * main/trunk/model-sites-dev/opotiki/collect/waiatade/etc/fail.log (modified) * main/trunk/model-sites-dev/opotiki/collect/waiatade/etc/oai-inf.jdb (modified) * main/trunk/model-sites-dev/opotiki/collect/waiatade/etc/oai-inf.jdb.bak (modified) UnknownConverterPlugin configured to use Tika for doc processing Wed, 17 Jun 2020 12:06:23 GMT ak19 [34205] * main/trunk/gli/src/org/greenstone/gatherer/cdm/CollectionConfiguration.java (modified) * main/trunk/gli/src/org/greenstone/gatherer/cdm/CollectionDesignManager.java (modified) * main/trunk/gli/src/org/greenstone/gatherer/collection/CollectionManager.java (modified) * main/trunk/gli/src/org/greenstone/gatherer/gui/ConfigFileEditor.java (modified) * main/trunk/gli/src/org/greenstone/gatherer/gui/GUIManager.java (modified) Collection ConfigFileEditor related changes, bugfixes and ... Wed, 17 Jun 2020 09:20:48 GMT ak19 [34204] * gs2-extensions/tesseract/trunk/README.txt (modified) Another todo. Tue, 16 Jun 2020 08:01:17 GMT ak19 [34203] * gs2-extensions/tesseract/trunk/README.txt (modified) Reminder. Tue, 16 Jun 2020 07:43:39 GMT ak19 [34202] * gs2-extensions/tesseract/trunk/src/packages/tmp (deleted) I think the svn:externals is working, so removing the tmp folder Tue, 16 Jun 2020 07:42:06 GMT ak19 [34201] * gs2-extensions/tesseract/trunk/src/packages (modified) Have now attempted to set the svn:externals property on tesseract's ... Tue, 16 Jun 2020 07:27:32 GMT ak19 [34200] * gs2-extensions/tesseract/trunk/src/packages/tmp (added) * gs2-extensions/tesseract/trunk/src/packages/tmp/jasper-1.900.1.tar.gz (moved) * gs2-extensions/tesseract/trunk/src/packages/tmp/jpeg-8b.tar.gz (moved) * gs2-extensions/tesseract/trunk/src/packages/tmp/libpng-1.4.4.tar.gz (moved) * gs2-extensions/tesseract/trunk/src/packages/tmp/tiff-4.0.10.tar.gz (moved) * gs2-extensions/tesseract/trunk/src/packages/tmp/zlib-1.2.7.tar.gz (moved) Going to attempt to set svn externals to grab zlib, libpng, tiff, ... Tue, 16 Jun 2020 07:24:18 GMT ak19 [34199] * gs2-extensions/gstika/trunk/makedists.sh (added) * gs2-extensions/tesseract/trunk/README.txt (modified) * gs2-extensions/tesseract/trunk/makedists.sh (modified) A makedists.sh script for gstika to make the cutdown zip and tarball. ... Tue, 16 Jun 2020 06:59:14 GMT ak19 [34198] * gs2-extensions/tesseract/trunk/README.txt (modified) * gs2-extensions/tesseract/trunk/makedists.sh (added) * gs2-extensions/tesseract/trunk/tesseract-linux-x64.tar.gz (modified) * gs2-extensions/tesseract/trunk/tesseract-linux-x64.zip (added) 1. Added a script to generate the cut-down ('binary only') tesseract ... Tue, 16 Jun 2020 06:15:29 GMT ak19 [34197] * main/trunk/greenstone2/collect/modelcol/etc/collectionConfig.xml (modified) Name of Tika config file for ocr-ing pdfs has been updated. Tue, 16 Jun 2020 06:13:13 GMT ak19 [34196] * gs2-extensions/gstika/trunk/gstika.tar.gz (modified) * gs2-extensions/gstika/trunk/gstika.zip (modified) Updating gstika tarballs too with the latest changes to the tika ... Tue, 16 Jun 2020 06:05:13 GMT ak19 [34195] * gs2-extensions/gstika/trunk/java/no-ocr-config.xml (added) * gs2-extensions/gstika/trunk/java/ocr-pdfs-config.xml (moved) Renaming config files so one is configured for OCR-ing PDFs, the ... Tue, 16 Jun 2020 06:03:40 GMT ak19 [34194] * main/trunk/greenstone2/ext/tika/no-ocr-config.xml (added) * main/trunk/greenstone2/ext/tika/ocr-pdfs-config.xml (moved) Renaming config files so one is configured for OCR-ing PDFs, the ... Tue, 16 Jun 2020 05:54:09 GMT ak19 [34193] * gs2-extensions/gstika/trunk/java/tika-config.xml (modified) Further useful links before I rename the tika-config file Tue, 16 Jun 2020 05:53:04 GMT ak19 [34192] * main/trunk/greenstone2/ext/tika/tika-config.xml (modified) Further useful links before I rename the tika-config file Tue, 16 Jun 2020 05:44:17 GMT ak19 [34191] * main/trunk/greenstone2/collect/modelcol/etc/collectionConfig.xml (modified) Model colectionConfig.xml with commented out UnknownConverterPlugin ... Tue, 16 Jun 2020 05:20:50 GMT ak19 [34190] * gs2-extensions/tesseract/trunk/GETTING-OCR-SUPPORT-FOR-MORE-LANGS.txt (added) * gs2-extensions/tesseract/trunk/README.txt (modified) * gs2-extensions/tesseract/trunk/pdf05-notext-ocr-with-tikaTesseract.pdf (added) * gs2-extensions/tesseract/trunk/sample.jpg (deleted) * gs2-extensions/tesseract/trunk/sample.tif (added) * gs2-extensions/tesseract/trunk/src/packages/CASCADE-MAKE/TESSERACT.sh (modified) * gs2-extensions/tesseract/trunk/src/setup.bash (modified) * gs2-extensions/tesseract/trunk/src/setup.bat (modified) * gs2-extensions/tesseract/trunk/tesseract-linux-x64.tar.gz (modified) 1. The tessdata folder was being created when compiling tesseract, ... Tue, 16 Jun 2020 04:32:58 GMT ak19 [34189] * gs2-extensions/tesseract/trunk/src/packages/tessdata-langs.tar.gz (modified) Added osd (onscreen display) OCR language support. Said to be the ... Tue, 16 Jun 2020 03:23:46 GMT ak19 [34188] * main/trunk/greenstone2/ext/tika/tika-config.xml (added) Tika config file to get Tika+Tesseract to OCR PDFs. This file must be ... Tue, 16 Jun 2020 03:22:04 GMT ak19 [34187] * gs2-extensions/gstika/trunk/GS_TIKA_README.txt (modified) * gs2-extensions/gstika/trunk/gstika.tar.gz (modified) * gs2-extensions/gstika/trunk/gstika.zip (modified) * gs2-extensions/gstika/trunk/java/tika-config.xml (added) Committing the tika-config.xml that sets up Tika's PDFParser and ... Tue, 16 Jun 2020 03:00:39 GMT ak19 [34186] * gs2-extensions/tesseract/trunk/README.txt (modified) * gs2-extensions/tesseract/trunk/src/CASCADE-MAKE.sh (modified) * gs2-extensions/tesseract/trunk/src/packages/CASCADE-MAKE/TESSERACT.sh (modified) * gs2-extensions/tesseract/trunk/tesseract-linux-x64.tar.gz (modified) In order to get tika + tesseract to OCR PDFs (note that tesseract ... Mon, 15 Jun 2020 21:53:16 GMT kjdon [34185] * main/trunk/gli/src/org/greenstone/gatherer/cdm/CollectionConfigXMLReadWrite.java (modified) changed doClassifier to doClassifiers as it was misleading and bugged ... Mon, 15 Jun 2020 12:36:12 GMT ak19 [34184] * gs2-extensions/tesseract/trunk/README.txt (modified) * gs2-extensions/tesseract/trunk/src/packages/TESSERACT-APACHE-LICENSE.txt (added) * gs2-extensions/tesseract/trunk/tesseract-linux-x64.tar.gz (modified) The Leptonica license reminded me I had forgotten to look into the ... Mon, 15 Jun 2020 12:28:21 GMT ak19 [34183] * gs2-extensions/tesseract/trunk/src/packages/LEPTONICA-LICENSE.txt (added) * gs2-extensions/tesseract/trunk/tesseract-linux-x64.tar.gz (modified) Forgot to commit the distnct license of Leptonica. Mon, 15 Jun 2020 11:48:57 GMT ak19 [34182] * gs2-extensions/tesseract/trunk/sample.jpg (added) A sample image to test the built tesseract extension on. After ... Mon, 15 Jun 2020 11:40:17 GMT ak19 [34181] * gs2-extensions/tesseract/trunk/README.txt (added) * gs2-extensions/tesseract/trunk/tesseract-linux-x64.tar.gz (added) Committing the cut-down, binaries-only tesseract tarball for x64 ... Mon, 15 Jun 2020 11:16:27 GMT ak19 [34180] * gs2-extensions/tesseract/trunk/src/LinksAndNotesOnCompilingManually.txt (modified) * gs2-extensions/tesseract/trunk/src/devel.bash (modified) * gs2-extensions/tesseract/trunk/src/setup.bash (moved) * gs2-extensions/tesseract/trunk/src/setup.bat (moved) Gnome-lib has setup.bash_old and setup.bat_old, but imagemagick and ... Mon, 15 Jun 2020 10:51:14 GMT ak19 [34179] * gs2-extensions/tesseract/trunk/src (modified) Imitating the gnome-lib gs2-extension by Setting the svn externals ... Mon, 15 Jun 2020 10:44:34 GMT ak19 [34178] * gs2-extensions/tesseract (added) * gs2-extensions/tesseract/trunk (added) * gs2-extensions/tesseract/trunk/src (added) * gs2-extensions/tesseract/trunk/src/CASCADE-MAKE.sh (added) * gs2-extensions/tesseract/trunk/src/LinksAndNotesOnCompilingManually.txt (added) * gs2-extensions/tesseract/trunk/src/devel.bash (added) * gs2-extensions/tesseract/trunk/src/packages (added) * gs2-extensions/tesseract/trunk/src/packages/CASCADE-MAKE (added) * gs2-extensions/tesseract/trunk/src/packages/CASCADE-MAKE.sh (added) * gs2-extensions/tesseract/trunk/src/packages/CASCADE-MAKE/JPEG.sh (added) * gs2-extensions/tesseract/trunk/src/packages/CASCADE-MAKE/JPEG2000.sh (added) * gs2-extensions/tesseract/trunk/src/packages/CASCADE-MAKE/LEPTONICA.sh (added) * gs2-extensions/tesseract/trunk/src/packages/CASCADE-MAKE/LIBPNG.sh (added) * gs2-extensions/tesseract/trunk/src/packages/CASCADE-MAKE/LIBTOOL.sh (added) * gs2-extensions/tesseract/trunk/src/packages/CASCADE-MAKE/LIBZ.sh (added) * gs2-extensions/tesseract/trunk/src/packages/CASCADE-MAKE/TESSERACT.sh (added) * gs2-extensions/tesseract/trunk/src/packages/CASCADE-MAKE/TIFF.sh (added) * gs2-extensions/tesseract/trunk/src/packages/jasper-1.900.1.tar.gz (added) * gs2-extensions/tesseract/trunk/src/packages/jpeg-8b.tar.gz (added) * gs2-extensions/tesseract/trunk/src/packages/leptonica-1.79.0.tar.gz (added) * gs2-extensions/tesseract/trunk/src/packages/libpng-1.4.4.tar.gz (added) * gs2-extensions/tesseract/trunk/src/packages/libtool-2.4.6.tar.gz (added) * gs2-extensions/tesseract/trunk/src/packages/tessdata-langs.tar.gz (added) * gs2-extensions/tesseract/trunk/src/packages/tesseract-5.0.0.tar.gz (added) * gs2-extensions/tesseract/trunk/src/packages/tiff-4.0.10.tar.gz (added) * gs2-extensions/tesseract/trunk/src/packages/zlib-1.2.7.tar.gz (added) * gs2-extensions/tesseract/trunk/src/setup.bash_old (added) * gs2-extensions/tesseract/trunk/src/setup.bat_old (added) CASCADE-MAKE for Tesseract, the OCR tool. I'm thinking of expanding ... Sun, 14 Jun 2020 15:51:13 GMT ak19 [34177] * gs2-extensions/gstika/trunk/GS_TIKA_README.txt (modified) Minor Sun, 14 Jun 2020 15:34:52 GMT ak19 [34176] * gs2-extensions/gstika/trunk/gstika.tar.gz (added) * gs2-extensions/gstika/trunk/gstika.zip (added) Zipping and tarring just the binary version of the extension Sun, 14 Jun 2020 15:28:28 GMT ak19 [34175] * gs2-extensions/gstika/trunk/GS_TIKA_README.txt (modified) Minor changes to folder names Sun, 14 Jun 2020 15:23:30 GMT ak19 [34174] * gs2-extensions/gstika (added) * gs2-extensions/gstika/trunk (added) * gs2-extensions/gstika/trunk/GS_TIKA_README.txt (added) * gs2-extensions/gstika/trunk/java (added) * gs2-extensions/gstika/trunk/java/GSTikaCLI.sh (added) * gs2-extensions/gstika/trunk/java/build (added) * gs2-extensions/gstika/trunk/java/build/LICENSE.txt (added) * gs2-extensions/gstika/trunk/java/build/NOTICE.txt (added) * gs2-extensions/gstika/trunk/java/build/org (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$1.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$10.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$11.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$12.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$13.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$14.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$2.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$3.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$4.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$5.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$6.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$7.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$8.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$9$1.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$9.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$FileEmbeddedDocumentExtractor.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$NoDocumentJSONMetHandler.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$NoDocumentMetHandler.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$NoDocumentXMPMetaHandler.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$OutputType.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI$SimplePasswordProvider.class (added) * gs2-extensions/gstika/trunk/java/build/org/greenstone/tika/GSTikaCLI.class (added) * gs2-extensions/gstika/trunk/java/gs3-setup.sh (added) * gs2-extensions/gstika/trunk/java/lib (added) * gs2-extensions/gstika/trunk/java/lib/LICENSE.txt (added) * gs2-extensions/gstika/trunk/java/lib/tika-app-1.24.1.jar (added) * gs2-extensions/gstika/trunk/java/makeGSTikaCLI.sh (added) * gs2-extensions/gstika/trunk/java/setup.bash (added) * gs2-extensions/gstika/trunk/java/src (added) * gs2-extensions/gstika/trunk/java/src/LICENSE.txt (added) * gs2-extensions/gstika/trunk/java/src/NOTICE.txt (added) * gs2-extensions/gstika/trunk/java/src/org (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$1.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$10.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$11.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$12.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$13.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$14.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$2.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$3.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$4.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$5.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$6.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$7.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$8.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$9$1.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$9.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$FileEmbeddedDocumentExtractor.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$NoDocumentJSONMetHandler.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$NoDocumentMetHandler.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$NoDocumentXMPMetaHandler.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$OutputType.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI$SimplePasswordProvider.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI.class (added) * gs2-extensions/gstika/trunk/java/src/org/greenstone/tika/GSTikaCLI.java (added) 1. Created GSTikaCLI.java based off TikaCLI.java of the apache tika- ... Sun, 14 Jun 2020 13:34:57 GMT ak19 [34173] * main/trunk/greenstone2/collect/modelcol/etc/collectionConfig.xml (modified) The more general way of launching the apache tika-app jar file. This ... Sun, 14 Jun 2020 07:11:13 GMT ak19 [34172] * main/trunk/greenstone2/collect/modelcol/etc/collectionConfig.xml (modified) * main/trunk/greenstone2/ext/tika/README.txt (modified) Some minor improvements to the UnknownConverterPlugin settings for ... Sat, 13 Jun 2020 15:50:30 GMT ak19 [34171] * main/trunk/greenstone2/ext/tika/README.txt (modified) Minor Sat, 13 Jun 2020 15:46:19 GMT ak19 [34170] * main/trunk/greenstone2/perllib/strings.properties (modified) Helpful instruction Sat, 13 Jun 2020 15:40:21 GMT ak19 [34169] * main/trunk/greenstone2/collect/modelcol/etc/collectionConfig.xml (modified) * main/trunk/greenstone2/ext/tika (added) * main/trunk/greenstone2/ext/tika/LICENSE.txt (added) * main/trunk/greenstone2/ext/tika/NOTICE.txt (added) * main/trunk/greenstone2/ext/tika/README.txt (added) * main/trunk/greenstone2/ext/tika/tika-app-1.24.1.jar (added) All GS3 needs to convert docx files to basic html (no images) out of ... Sat, 13 Jun 2020 14:34:57 GMT ak19 [34168] * main/trunk/gli/src/org/greenstone/gatherer/collection/CollectionManager.java (modified) Stupid oversight on my part yesterday: when fixing up client-GLI so ... Sat, 13 Jun 2020 09:05:08 GMT ak19 [34167] * main/trunk/gs3-collection-configs (modified) Added svn externals properties to gs3colcfg module for the Italian ... Sat, 13 Jun 2020 08:52:59 GMT ak19 [34166] * gs3-extensions/solr/trunk/src/collect/solr-jdbm-demo/resources/collectionConfig_it.properties (added) * main/trunk/greenstone3/web/sites/localsite/collect/lucene-jdbm-demo/resources/collectionConfig_it.properties (added) * main/trunk/greenstone3/web/sites/localsite/resources/siteConfig_it.properties (added) Adding Italian language translations of the gs3colcfg module. Many ... Sat, 13 Jun 2020 08:40:30 GMT ak19 [34165] * main/trunk/greenstone2/macros/italian.dm (modified) * main/trunk/greenstone2/macros/italian2.dm (modified) * main/trunk/greenstone3/web/WEB-INF/classes/core_servlet_dictionary_it.properties (modified) * main/trunk/greenstone3/web/WEB-INF/classes/interface_default_it.properties (modified) * main/trunk/greenstone3/web/WEB-INF/classes/metadata_names_it.properties (modified) Italian language updates to GS2 core module and translations for GS3 ... Sat, 13 Jun 2020 06:13:31 GMT ak19 [34164] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/util/ServletRealmCheck.java (modified) Adding warning comments about where stderr messages n ... Sat, 13 Jun 2020 04:49:44 GMT ak19 [34163] * documentation/trunk/tutorials/xml-source/tutorial_en.xml (modified) Minor changes to the recent commit Sat, 13 Jun 2020 04:37:06 GMT ak19 [34162] * documentation/trunk/tutorials/xml-source/tutorial_en.xml (modified) Tutorial document now contains the crucial id=gs_content in the ... Fri, 12 Jun 2020 20:43:50 GMT ak19 [34161] * main/trunk/gli/src/org/greenstone/gatherer/remote/RemoteGreenstoneServerAction.java (modified) Fixed last of the client-gli/remoe GS3 bugs discovered yesterday. ... Fri, 12 Jun 2020 18:50:06 GMT ak19 [34160] * main/trunk/gli/src/org/greenstone/gatherer/collection/CollectionManager.java (modified) * main/trunk/gli/src/org/greenstone/gatherer/remote/RemoteGreenstoneServer.java (modified) * main/trunk/greenstone2/common-src/cgi-bin/gliserver.pl (modified) * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/util/ServletRealmCheck.java (modified) Completing TODO from Kathy's commit message for 34116 for ... Fri, 12 Jun 2020 18:35:29 GMT ak19 [34159] * main/trunk/greenstone2/common-src/cgi-bin/gliserver.pl (modified) Kathy had earlier requested that I recommit the gliserver.pl file she ... Fri, 12 Jun 2020 11:47:19 GMT ak19 [34158] * main/trunk/gli/src/org/greenstone/gatherer/collection/CollectionManager.java (modified) * main/trunk/greenstone2/bin/script/full-rebuild.pl (modified) * main/trunk/greenstone2/bin/script/incremental-rebuild.pl (modified) Fixing discovery of client-gli issues with previewing a different ... Fri, 12 Jun 2020 11:36:28 GMT ak19 [34157] * main/trunk/greenstone3/web/etc/usersDB/log/log.ctrl (modified) * main/trunk/greenstone3/web/etc/usersDB/log/log1.dat (modified) * main/trunk/greenstone3/web/etc/usersDB/log/logmirror.ctrl (modified) * main/trunk/greenstone3/web/etc/usersDB/seg0/c10.dat (modified) * main/trunk/greenstone3/web/etc/usersDB/seg0/c230.dat (modified) Undoing accidental commit of unintended files, part3 Fri, 12 Jun 2020 11:32:27 GMT ak19 [34156] * main/trunk/greenstone3/web/WEB-INF/servlets.xml (modified) Undoing accidental commit of unintended files, part2 Fri, 12 Jun 2020 11:30:24 GMT ak19 [34155] * main/trunk/greenstone3/web/sites/localsite/collect/lucene-jdbm-demo/etc/collectionConfig.xml (modified) Undoing accidental commit of unintended files Fri, 12 Jun 2020 11:23:04 GMT ak19 [34154] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/collection/Collection.java (modified) * main/trunk/greenstone3/web/WEB-INF/servlets.xml (modified) * main/trunk/greenstone3/web/etc/usersDB/log/log.ctrl (modified) * main/trunk/greenstone3/web/etc/usersDB/log/log1.dat (modified) * main/trunk/greenstone3/web/etc/usersDB/log/logmirror.ctrl (modified) * main/trunk/greenstone3/web/etc/usersDB/seg0/c10.dat (modified) * main/trunk/greenstone3/web/etc/usersDB/seg0/c230.dat (modified) * main/trunk/greenstone3/web/sites/localsite/collect/lucene-jdbm-demo/etc/collectionConfig.xml (modified) Useful debugging statement. Would have helped me solve a bug sooner ... Fri, 12 Jun 2020 09:23:42 GMT ak19 [34153] * main/trunk/gli/src/org/greenstone/gatherer/remote/ActionQueue.java (modified) While trying to debug client-gli to remote issues, found some more ... Fri, 12 Jun 2020 08:40:59 GMT ak19 [34152] * main/trunk/greenstone2/perllib/servercontrol.pm (modified) When sending a request to activate and deactivate, can request ... Thu, 11 Jun 2020 02:16:45 GMT ak19 [34151] * main/trunk/greenstone2/common-src/cgi-bin/CGIModule.tar.gz (added) Part 1. Untested. (The commit didn't go through previously for ... Thu, 11 Jun 2020 02:14:50 GMT ak19 [34150] * main/trunk/greenstone3/web/WEB-INF/cgi (modified) Part 2. Untested. Setting svn externals property to pull ... Thu, 11 Jun 2020 01:16:20 GMT ak19 [34149] * main/trunk/greenstone3/web/interfaces/default/transform/layouts/main.xsl (modified) Adding note to stress the importance of ensuring the div containing ... Wed, 10 Jun 2020 17:43:57 GMT ak19 [34148] * main/trunk/model-interfaces-dev/opotiki/transform/layouts/main.xsl (modified) Fix for broken remote greenstone server, which wouldn't load ... Wed, 10 Jun 2020 14:11:19 GMT ak19 [34147] * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto/XML/Parser/Expat/Expat.so (added) Related to prev commit. Part 2 of: The Expat.so I built on the uni ... Wed, 10 Jun 2020 14:08:35 GMT ak19 [34146] * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto/XML/Parser/Expat/Expat.notWorking (moved) The Expat.so I built on the uni machine against a perl 5.30 I ... Wed, 10 Jun 2020 14:01:14 GMT ak19 [34145] * main/trunk/model-interfaces-dev/opotiki/transform/layouts/main.xsl (modified) Copyright info and link to OS templates must remain intact as seen in ... Wed, 10 Jun 2020 02:57:31 GMT ak19 [34144] * main/trunk/model-interfaces-dev/opotiki/transform/layouts/main.xsl (modified) Adding in the Depositor link (visible only to logged in users) to the ... Wed, 10 Jun 2020 02:56:19 GMT ak19 [34143] * main/trunk/model-interfaces-dev/opotiki/styles/layout.css (modified) * main/trunk/model-interfaces-dev/opotiki/transform/layouts/main.xsl (modified) Some interface changes made by Dr Bainbridge and some by me on his ... Tue, 09 Jun 2020 03:15:19 GMT ak19 [34142] * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto/XML/Parser/Expat/Expat.so (added) Manually force-adding the Expat.so to svn which STILL didn't get ... Tue, 09 Jun 2020 03:11:26 GMT ak19 [34141] * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser.pm (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/Japanese_Encodings.msg (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/README (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/big5.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/euc-kr.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/ibm866.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/iso-8859-2.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/iso-8859-3.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/iso-8859-4.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/iso-8859-5.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/iso-8859-7.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/iso-8859-8.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/iso-8859-9.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/koi8-r.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/windows-1250.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/windows-1251.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/windows-1252.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/windows-1255.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/x-euc-jp-jisx0221.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/x-euc-jp-unicode.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/x-sjis-cp932.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/x-sjis-jdk117.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/x-sjis-jisx0221.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/x-sjis-unicode.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Expat.pm (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/LWPExternEnt.pl (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Style (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Style/Debug.pm (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Style/Objects.pm (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Style/Stream.pm (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Style/Subs.pm (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Style/Tree.pm (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto/XML (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto/XML/Parser (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto/XML/Parser/.packlist (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto/XML/Parser/Expat (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto/XML/Parser/Expat/Expat.bs (added) Redoing commit for XML-Parser of perl-5.30 as the previous commit ... Tue, 09 Jun 2020 03:07:56 GMT ak19 [34140] * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML (deleted) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto (deleted) Recommitting since there are all kinds of questions marks about what ... Tue, 09 Jun 2020 01:43:48 GMT ak19 [34139] * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.22/auto/XML/Parser/Expat/Expat.so (added) Not sure why the perl-5.22's Expat folder was empty, adding in ... Mon, 08 Jun 2020 07:00:25 GMT ak19 [34138] * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30 (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser.pm (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/Japanese_Encodings.msg (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/README (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/big5.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/euc-kr.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/ibm866.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/iso-8859-2.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/iso-8859-3.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/iso-8859-4.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/iso-8859-5.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/iso-8859-7.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/iso-8859-8.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/iso-8859-9.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/koi8-r.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/windows-1250.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/windows-1251.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/windows-1252.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/windows-1255.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/x-euc-jp-jisx0221.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/x-euc-jp-unicode.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/x-sjis-cp932.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/x-sjis-jdk117.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/x-sjis-jisx0221.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Encodings/x-sjis-unicode.enc (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Expat.pm (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/LWPExternEnt.pl (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Style (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Style/Debug.pm (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Style/Objects.pm (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Style/Stream.pm (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Style/Subs.pm (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/XML/Parser/Style/Tree.pm (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto/XML (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto/XML/Parser (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto/XML/Parser/.packlist (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto/XML/Parser/Expat (added) * main/trunk/release-kits/shared/linux/XML-Parser/64-bit/perl-5.30/auto/XML/Parser/Expat/Expat.bs (added) XML-Parser for perl version 5.30 (specifically perl 5.30.3 was ... Fri, 05 Jun 2020 07:51:38 GMT ak19 [34137] * main/trunk/greenstone2/perllib/plugins/NutchTextDumpPlugin.pm (modified) Have only been able to incorporate one of Dr Bainbridge's ... Wed, 03 Jun 2020 03:52:35 GMT ak19 [34136] * main/trunk/model-sites-dev/opotiki/collect/imagesco/etc/collectionConfig.xml (modified) * main/trunk/model-sites-dev/opotiki/collect/textdemo/etc/collectionConfig.xml (modified) * main/trunk/model-sites-dev/opotiki/collect/waiatade/etc/collectionConfig.xml (modified) * main/trunk/model-sites-dev/opotiki/resources/siteConfig.properties (modified) * main/trunk/model-sites-dev/opotiki/siteConfig.xml (modified) Incorporating Anita Kurei's improvements to display strings for ... Sat, 30 May 2020 04:42:54 GMT ak19 [34135] * main/trunk/model-sites-dev/commoncrawl/collect/allismri/etc/collectionConfig.xml (modified) Changed the name of a collection making it more descriptive and also ... Sat, 30 May 2020 04:15:04 GMT ak19 [34134] * main/trunk/model-sites-dev/commoncrawl/collect/allismri/untarSiteLevelImportTarballHere (added) Added an empty text file with instruction for the allismri collection too Sat, 30 May 2020 04:14:12 GMT ak19 [34133] * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/untarSiteLevelImportTarballHere (added) Added an empty text file with instruction Sat, 30 May 2020 04:01:47 GMT ak19 [34132] * main/trunk/model-sites-dev/commoncrawl (added) * main/trunk/model-sites-dev/commoncrawl/collect (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/etc (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/etc/all_isMRIPages_forManual_containsMRIDomainListing.txt (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/etc/collectionConfig.bak (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/etc/collectionConfig.xml (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/etc/oai-inf.jdb (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/etc/oai-inf.jdb.bak (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/etc/oai-inf.lg (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/images (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/metadata (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/metadata/ex.mds (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/metadata/greenstone.mds (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/metadata/profile.xml (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/script (added) * main/trunk/model-sites-dev/commoncrawl/collect/allIsMRIForDomainShortlist/style (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri/etc (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri/etc/collectionConfig.bak (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri/etc/collectionConfig.xml (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri/etc/isMRI_urls.txt (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri/etc/oai-inf.jdb (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri/etc/oai-inf.jdb.bak (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri/etc/oai-inf.lg (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri/images (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri/metadata (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri/metadata/ex.mds (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri/metadata/greenstone.mds (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri/metadata/profile.xml (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri/script (added) * main/trunk/model-sites-dev/commoncrawl/collect/allismri/style (added) * main/trunk/model-sites-dev/commoncrawl/etc (added) * main/trunk/model-sites-dev/commoncrawl/import_nutchDumpTxtsOfcrawledMRICC.tar.gz (added) * main/trunk/model-sites-dev/commoncrawl/moveDumpTxtFilesIntoImport.sh (added) * main/trunk/model-sites-dev/commoncrawl/resources (added) * main/trunk/model-sites-dev/commoncrawl/resources/siteConfig.properties (added) * main/trunk/model-sites-dev/commoncrawl/siteConfig.xml (added) Committing the commoncrawl site of Nutch recrawls of our CC data ... Sat, 30 May 2020 03:18:25 GMT ak19 [34131] * main/trunk/greenstone2/perllib/plugins/NutchTextDumpPlugin.pm (modified) Allowing input keep-urls-file to contain a comma followed by country ... Fri, 29 May 2020 13:27:03 GMT ak19 [34130] * main/trunk/greenstone2/perllib/plugins/NutchTextDumpPlugin.pm (modified) Some more tidying up while isMRI filtered collection rebuilding Fri, 29 May 2020 13:01:01 GMT ak19 [34129] * main/trunk/greenstone2/perllib/plugins/NutchTextDumpPlugin.pm (modified) Implemented Kathy's suggestions: 1. Explicit ex prefix to ex meta ... Wed, 27 May 2020 08:06:24 GMT ak19 [34128] * main/trunk/greenstone2/bin/script/full-rebuild.pl (modified) When rebuilding the opotiki site today, had noticed that full- ... Wed, 27 May 2020 07:43:03 GMT ak19 [34127] * other-projects/maori-lang-detection/mongodb-data/pieChart3c_screenshot_SimplerCrawledWebPages_EmptyVsInMongoDB.png (moved) Spelling correction in filename: screeMshot to screeNshot Wed, 27 May 2020 07:10:44 GMT ak19 [34126] * main/trunk/greenstone2/perllib/plugins/NutchTextDumpPlugin.pm (modified) When I'd modified the code to make the keep_urls_file non-compulsory, ... Wed, 27 May 2020 06:07:26 GMT ak19 [34125] * main/trunk/greenstone2/perllib/plugins/NutchTextDumpPlugin.pm (modified) Commit message went awry. Cleaned up some comments to recommit with ... Wed, 27 May 2020 06:03:58 GMT ak19 [34124] * main/trunk/greenstone2/perllib/plugins/NutchTextDumpPlugin.pm (modified) Decoding the title and text using the encoding seemed to have turned ... Mon, 25 May 2020 14:18:44 GMT ak19 [34123] * main/trunk/greenstone2/perllib/plugins/NutchTextDumpPlugin.pm (modified) Some more minor changes Mon, 25 May 2020 13:13:33 GMT ak19 [34122] * main/trunk/greenstone2/perllib/plugins/NutchTextDumpPlugin.pm (modified) 1. After some testing of building the complete commoncrawl ... Mon, 25 May 2020 11:53:29 GMT ak19 [34121] * main/trunk/greenstone2/perllib/plugins/NutchTextDumpPlugin.pm (added) * main/trunk/greenstone2/perllib/strings.properties (modified) * main/trunk/greenstone2/perllib/util.pm (modified) 1. Introducing NutchTextDumpPlugin to process the records ... Thu, 21 May 2020 05:47:46 GMT ak19 [34120] * other-projects/maori-lang-detection/mongodb-data/random260.csv (added) CSV version of .ods file, so openoffice isn't required Thu, 21 May 2020 05:28:45 GMT ak19 [34119] * other-projects/maori-lang-detection/mongodb-data-auto (added) * other-projects/maori-lang-detection/mongodb-data-auto/1table_allCrawledSites.csv (added) * other-projects/maori-lang-detection/mongodb-data-auto/1table_allCrawledSites.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/1table_allCrawledSites.png (added) * other-projects/maori-lang-detection/mongodb-data-auto/2table_sitesWithPagesInMRI.csv (added) * other-projects/maori-lang-detection/mongodb-data-auto/2table_sitesWithPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/2table_sitesWithPagesInMRI.png (added) * other-projects/maori-lang-detection/mongodb-data-auto/3table_sitesWithPagesContainingMRI.csv (added) * other-projects/maori-lang-detection/mongodb-data-auto/3table_sitesWithPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/3table_sitesWithPagesContainingMRI.png (added) * other-projects/maori-lang-detection/mongodb-data-auto/4table_containsMRI_exclTentativeProductSites.csv (added) * other-projects/maori-lang-detection/mongodb-data-auto/4table_containsMRI_exclTentativeProductSites.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/4table_containsMRI_exclTentativeProductSites.png (added) * other-projects/maori-lang-detection/mongodb-data-auto/5a_counts_tentativeNonAutotranslatedSites.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/5b_counts_overseasSitesWithMiInPath.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/5counts_containsMRISites_allNZGrouped.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/5table_sitesWithPagesContainingMRI_allNZGrouped.csv (added) * other-projects/maori-lang-detection/mongodb-data-auto/5table_sitesWithPagesContainingMRI_allNZGrouped.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/5table_sitesWithPagesContainingMRI_allNZGrouped.png (added) * other-projects/maori-lang-detection/mongodb-data-auto/5table_sitesWithPagesInMRI_allNZGrouped.csv (added) * other-projects/maori-lang-detection/mongodb-data-auto/5table_sitesWithPagesInMRI_allNZGrouped.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/5table_sitesWithPagesInMRI_allNZGrouped.png (added) * other-projects/maori-lang-detection/mongodb-data-auto/6counts_sitesWithPagesContainingMRI_manualShortlist.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/6counts_sitesWithPagesContainingMRI_manualShortlist.png (added) * other-projects/maori-lang-detection/mongodb-data-auto/geojson-features_1table_allCrawledSites.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/geojson-features_2table_sitesWithPagesInMRI.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/geojson-features_3table_sitesWithPagesContainingMRI.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/geojson-features_4table_containsMRI_exclTentativeProductSites.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/geojson-features_5table_sitesWithPagesContainingMRI_allNZGrouped.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/geojson-features_5table_sitesWithPagesInMRI_allNZGrouped.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/geojson-features_6counts_sitesWithPagesContainingMRI_manualShortlist.json (added) * other-projects/maori-lang-detection/mongodb-data-auto/isMRI_full_manualList_globalDomains_whereAPageContainsMRI.txt (added) * other-projects/maori-lang-detection/mongodb-data-auto/random260_manualList_globalDomains_whereAPageContainsMRI.txt (added) Committing the auto-generated analysis results folder, mongodb-data- ... Thu, 21 May 2020 02:16:12 GMT ak19 [34118] * main/trunk/greenstone2/common-src/cgi-bin/gliserver.pl (modified) Kathy's hard work for commit 34117 was done on a Windows machine ... Wed, 20 May 2020 03:53:56 GMT kjdon [34117] * main/trunk/greenstone2/common-src/cgi-bin/gliserver.pl (modified) tidied up the code. Moved a few commands that don't actually need ... Wed, 20 May 2020 02:44:53 GMT kjdon [34116] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/util/ServletRealmCheck.java (modified) use global.properties, not build.properties. therefore call this with ... Tue, 19 May 2020 03:03:21 GMT kjdon [34115] * main/trunk/gli/src/org/greenstone/gatherer/gui/Preferences.java (modified) a couple changes. 1 don't explicitly need to remove the lock file ... Tue, 19 May 2020 00:25:22 GMT kjdon [34114] * main/trunk/greenstone3/resources/cgi/gsdl3site.cfg.svn (modified) for gs3, gwcgi is the tomcat context, i.e. greenstone3 by default. If ... Mon, 18 May 2020 23:34:40 GMT ak19 [34113] * main/trunk/gli/src/org/greenstone/gatherer/Gatherer.java (modified) * main/trunk/gli/src/org/greenstone/gatherer/greenstone3/ProtocolPortProperties.java (added) * main/trunk/gli/src/org/greenstone/gatherer/gui/FedoraLogin.java (modified) 1. tomcat.port no longer exists in build.properties after https also ... Mon, 18 May 2020 01:40:55 GMT ak19 [34112] * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/util/GSConstants.java (modified) * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/util/XMLConverter.java (modified) * main/trunk/greenstone3/src/java/org/greenstone/gsdl3/util/XMLTransformer.java (modified) * main/trunk/greenstone3/src/java/org/greenstone/server/Server3Settings.java (modified) GS3 source code seems to already use FileInputStream with UTF-8 ... Sun, 17 May 2020 23:24:29 GMT ak19 [34111] * main/trunk/greenstone2/setup.bash (modified) * main/trunk/greenstone2/setup.bat (modified) * main/trunk/greenstone3/build.xml (modified) * main/trunk/greenstone3/gs3-setup.bat (modified) Undoing additions surrounding JAVA_TOOL_OPTIONS where file.encoding ... Tue, 05 May 2020 20:02:11 GMT kjdon [34110] * main/trunk/gli/src/org/greenstone/gatherer/greenstone/Classifiers.java (modified) modified a couple of error strings to be more helpful Sun, 03 May 2020 20:39:22 GMT kjdon [34109] * main/trunk/greenstone2/perllib/classify/DateList.pm (modified) tidied this up a bit. Now we leave in _textmonth00_ if the month is ...