and .


21:37 Changeset [27743] by ak19
Basic Word-PDF collection now has the same number of diffing errors on …
20:52 Changeset [27742] by ak19
Remove Windows carriage returns when Greenstone assigns titles, where it …
14:23 Changeset [27741] by jlwhisler
Adding in image-e dec collection so the simple image collection can base …


18:01 Changeset [27740] by ak19
Assocfile tutorial files are the same as for the word/pdf tutorials.
16:43 Changeset [27739] by davidb
Greenstone's video extension for 'ffmpeg'
16:38 Changeset [27738] by davidb
Greenstone's video extension for 'ffmpeg'
16:22 Changeset [27737] by davidb
Fixing up file structure to have a main trunk
16:21 Changeset [27736] by davidb
16:08 Changeset [27735] by davidb
Wrong spot for this folder. Was meant to be in the salamiEndpoint area
16:06 Changeset [27734] by davidb
Initial structure to the RDF Store for the Salami endpoint projet
16:05 Changeset [27733] by davidb
Initial structure to the RDF Store for the Salami endpoint projet
14:35 Changeset [27732] by jmt12
Nice the copy itself too


21:40 Changeset [27731] by ak19
Forgot to commit changes to release-kit that checkout the new imagemagick …
20:21 Changeset [27730] by ak19
More diffing issues detected when diffcol ran over the first Word and PDF …
19:07 Changeset [27729] by ak19
Recommitted rebuilt under new name
19:04 Changeset [27728] by ak19
Deleting to commit rebuilt under new name
18:59 Changeset [27727] by ak19
Renamed Word PDF Basic collection
18:54 Changeset [27726] by ak19
Word-PDF-Basic model collection
17:40 Changeset [27725] by ak19
Images can be different in size when generated by imagemagick on …
17:22 Ticket #861 (GS win binary: have a GS ready cmd console) created by ak19
Create a new Start menu shortcut for the Greenstone set of shortcuts on …
17:06 Ticket #860 (Diffcol ToDo) created by ak19
* investigate why images are different sizes between the linux-generated …


21:29 Changeset [27724] by ak19
Mac now gets its imagemagick binary from where linux gets it at svn, since …
20:20 Changeset [27723] by ak19
imagemagick pre-compiled binary for mac 10.5 (10.6), since the existing …
20:19 Changeset [27722] by ak19
The changes from a few commits back that would create symbolic links also …
19:20 Changeset [27721] by ak19
Committing 32 bit linux imagemagick binary. The changes are the same as in …
19:11 Changeset [27720] by ak19
libjpeg.so problem again: the fix in the previous commit was the wrong way …
17:01 Changeset [27719] by sjm84
Some depositor updates
16:59 Changeset [27718] by sjm84
Phase two of fixing collectionConfig templates being incorrect in Greenbug
16:58 Changeset [27717] by sjm84
A minor error check
16:57 Changeset [27716] by sjm84
Need to store xpath data from the collectionConfig for debug purposes
16:55 Changeset [27715] by sjm84
Fixed some potential perl path errors
16:53 Changeset [27714] by sjm84
Reverting an accidental change
16:52 Changeset [27713] by sjm84
Fixing collectionConfig templates being incorrect
16:49 Changeset [27712] by ak19
Committing recompiled 32 bit version of imagemagick linux binary. Commit …
16:35 Changeset [27711] by ak19
When running diffcol on the backdrop model collection, none of the images …


22:11 Changeset [27710] by ak19
Now checking out imagemagick binary (tested on linux) to work with image …
18:52 Changeset [27709] by ak19
Rebuilt backdrop with import options set correctly for diffcol, -OIDType …
18:39 Changeset [27708] by ak19
Need to rebuild backdrop collection with OID hash on full filenames and …
18:26 Changeset [27707] by ak19
Deleting some files in lomdemo that point to unresponsive urls which cause …
17:54 Changeset [27706] by ak19
Adding tutorial 2 backdrop as GS2 model collect
15:31 Changeset [27705] by sjm84
Reformatting this file
14:46 Changeset [27704] by ak19
The one change to make diffcol work on darwin.


16:36 Changeset [27703] by ak19
Dr Bainbridge fixed the final diffcol issue with Small-HTML on windows …
14:35 Changeset [27702] by ak19
I think the reports get generated and uploaded nightly, but they get …


22:43 Changeset [27701] by ak19
Fixed another very subtle to do with the case of the TASK_HOME env var set …
18:42 Changeset [27700] by ak19
Second part of previous commit. Rebuilt model collection Small-HTML with …
18:39 Changeset [27699] by ak19
Rebuilt model collection Small-HTML with the new '-sort' to …
17:33 Changeset [27698] by ak19
import.pl/export.pl now issues a reminder that sortmeta needs to be paired …
17:23 Changeset [27697] by ak19
Dr Bainbridge fixed it so that the gdb files generated on Windows for …


22:58 Changeset [27696] by ak19
Windows-specific import fixed for linux.
22:54 Changeset [27695] by ak19
Better diffing on Windows. If either the test or model collection was …
17:56 Changeset [27694] by ak19
Fixing up previous windows commit for linux
17:50 Changeset [27693] by davidb
Mods after latest round of development
17:50 Changeset [27692] by davidb
Handing images folder within css folder for background images
17:49 Changeset [27691] by davidb
Funding org logos
17:48 Changeset [27690] by davidb
Switched to using rsync to copy rather than cp, so .svn directories can be …
17:48 Changeset [27689] by davidb
Improvements to README instructions
17:35 Changeset [27688] by ak19
Bin folder with Readme for blat. No blat binaries though, since we're not …
17:32 Changeset [27687] by ak19
Fixes for task.pl: html diffcol report failed to upload properly to puka …
15:27 Ticket #859 (GS3 outstanding (installer and more)) created by ak19
- installer: The following ant calls in installer ends up under the …
12:25 Changeset [27686] by jmt12
A little more progress comments
12:24 Changeset [27685] by jmt12
in the case of multiple attempts you need to retain the information about …
12:22 Changeset [27684] by jmt12
Adding natural sorting into report generation - so also needed to add INC …
12:20 Changeset [27683] by jmt12
moving a few more headings around to help with information block layout
12:19 Changeset [27682] by jmt12
Copying makeAllDirectories() from vanilla FileUtils?.pm
00:21 Changeset [27681] by davidb
Fine-tuning of domain name and URL prefix to better cope with being behind …
00:19 Changeset [27680] by davidb
Embelishment of the web content
00:19 Changeset [27679] by davidb
Embelishment of the web content


21:31 Changeset [27678] by ak19
Uploading report to caveat-emptor page now works on windows
16:51 Changeset [27677] by ak19
Envi passes an extra arg, the env_verbosity, to all tasks now. It's passed …
16:51 Changeset [27676] by ak19
Envi passes an extra arg, the env_verbosity, to all tasks now. It's passed …
16:13 Changeset [27675] by ak19
Tidying up my previous 2 commits on this file.
16:06 Changeset [27674] by sjm84
Added the missing GSDL3SRCHOME enviroment variable into the Perl …
14:53 Changeset [27673] by kjdon
need to return the set name\!
14:02 Changeset [27672] by kjdon
adding new functionality to identify request
13:16 Changeset [27671] by kjdon
added a couple of methods to get soem extra xml for the identify request
12:07 Changeset [27670] by kjdon
added some more fields to bring identify response in line with gs2 …
09:26 Changeset [27669] by jmt12
Sort compute nodes naturally before labelling them with incremental worker …


22:00 Changeset [27668] by kjdon
task.pl works for windows now, except for the upload/emailing function. …
21:44 Changeset [27667] by ak19
Another change to not be linux/mac specific, so that the diffcol task will …
21:37 Changeset [27666] by kjdon
Using perl rather than bash to test if file is binary, so that this test …
20:51 Changeset [27665] by kjdon
Needed to add a second change to the previous commit
20:45 Changeset [27664] by kjdon
Fixed an issue with building on Windows where a regex in an eval failed …
16:42 Changeset [27663] by ak19
A TODO listing
16:37 Changeset [27662] by ak19
Dr Bainbridge fixed the dec regex problem and other regex issues.
15:36 Changeset [27661] by sjm84
1. Undoing one of the commits made yesterday because 2013.06 matches the …
15:06 Changeset [27660] by kjdon
Outdated build of wordpf collection
15:05 Changeset [27659] by kjdon
Outdated build of demo collection


20:40 Changeset [27658] by ak19
Although task(.sh) is now deprecated, commiting a local change
20:37 Changeset [27657] by ak19
Delete the upload_dir and recreate it before moving the generated reports …
19:35 Changeset [27656] by ak19
Fix for files not getting copied over into the upload_dir on the 32 bit VM
17:51 Changeset [27655] by ak19
Fix for an idiosyncracy dependent on this month's date (2013.06.) matching …
10:59 Changeset [27654] by jmt12
Add the ability to stagger the starting of Mappers by placing a 'delay.me' …
10:52 Changeset [27653] by jmt12
Forgot to pull self off the head of arguments
10:51 Changeset [27652] by jmt12
Changing buffer to 128K (slightly faster) and adding a comment explaining …
10:50 Changeset [27651] by jmt12
10:49 Changeset [27650] by jmt12
10:48 Changeset [27649] by jmt12
No longer in SVN control
10:48 Changeset [27648] by jmt12
Template for setup.bash - a user will have to populate Hadoop fields
10:37 Changeset [27647] by jmt12
Add some more testing to ensure any local copy of a media file is the same …
10:34 Changeset [27646] by jmt12
Adding an option to allow me to suppress RSS file writing, -no_rss, as it …
10:31 Changeset [27645] by jmt12
10:31 Changeset [27644] by jmt12
Extended to support HDFS-access via NFS. This applies to both the call to …
10:30 Changeset [27643] by jmt12
Changed the script generator so it can recurse through directories and …
10:28 Changeset [27642] by jmt12
A script I downloaded that successfully splits video files - something I …
10:12 Changeset [27641] by jmt12
Altered order of arguments and allow archives dir to be passed as argument …
10:11 Changeset [27640] by jmt12
10:10 Changeset [27639] by jmt12
Change it so failure to open a filehandle isn't fatal - leave it up to the …
10:09 Changeset [27638] by jmt12
Change it so failure to open a filehandle isn't fatal - leave it up to the …


22:05 Changeset [27637] by ak19
Added section which does uploading to nzdl (though that will only work on …
20:26 Changeset [27636] by ak19
In order for the DEC's GS2 to compile, it needs gnome-lib. So build.xml …
17:58 Changeset [27635] by ak19
Checking return status of compilation so it stops on error.
17:47 Changeset [27634] by ak19
Changed order of @INC 'unshifts' due to clash over Greenstone own …
17:23 Changeset [27633] by sjm84
Under linux, test for linux/magic.h causes a compilation error on older …
16:15 Changeset [27632] by ak19
Added external reference to makegs2.sh (useful for derk release kit).
12:36 Changeset [27631] by jmt12
A proxy to allow NFS access to HDFS


20:22 Changeset [27630] by ak19
Minor adjustments
20:17 Changeset [27629] by ak19
Build scripts run via function again, as the function now calls system() …
19:47 Changeset [27628] by ak19
Added main() behaviour and sending mail attachment and fixed issues in …
17:32 Changeset [27627] by ak19
File lock code currently experimental, so not in the main greenstone build …
16:59 Changeset [27626] by ak19
Dr Bainbridge hopes including sys/types.h may help compiling filelock.cpp …
16:55 Changeset [27625] by ak19
Correcting comment
16:53 Changeset [27624] by ak19
renamed winlock to filelock as it's not windows specific.
15:50 Changeset [27623] by ak19
Using FileUtils::FileExists? in place of minus-e for the same test.
15:49 Changeset [27622] by ak19
Added in an automatic compilation mode, which you can use by passing in …


20:55 Changeset [27621] by ak19
Ported most of the task.sh functionality across to task.pl. The email and …
17:00 Changeset [27620] by ak19
Rebuilt dist on 32 bit machine. Dr Bainbridge removed the absolute paths …
16:37 Changeset [27619] by ak19
Dr Bainbridge removed the absolute paths in symbolic links to new bz …
16:37 Changeset [27618] by ak19
Dr Bainbridge removed the absolute paths in symbolic links to new bz …
13:13 Changeset [27617] by sjm84
Various improvements and fixes mostly to do with adding depositor …


22:55 Changeset [27616] by davidb
Fine tuning
22:21 Changeset [27615] by davidb
22:18 Changeset [27614] by davidb
Now generated from '.in' file, and no longer under SVN control
22:17 Changeset [27613] by davidb
Create AFR-SETUP.sh from its '.in' counterpart if it does not already …
22:16 Changeset [27612] by davidb
.in file from which a local copy is made (not under SVN control) that can …
20:46 Changeset [27611] by davidb
Improved wording
20:40 Changeset [27610] by davidb
Bespoke script for the SALAMI script
20:39 Changeset [27609] by davidb
Setting needed by 'cluster1'
20:39 Changeset [27608] by davidb
Tidy up
20:02 Changeset [27607] by davidb
Due to unusual configure/compile script, tarclean tested for earlier on …
20:00 Changeset [27606] by davidb
Yasm needed by cascade make, but found not to be present (be default) on …
19:59 Changeset [27605] by davidb
Yasm needed by cascade make, but found not to be present (be default) on …
18:43 Changeset [27604] by ak19
Fixing up diffcol process so it works better. Current state finds no …
17:33 Changeset [27603] by ak19
Updating indexes after adding a sort on keys for archive gdb files in …
17:23 Changeset [27602] by ak19
Adding sorting on keys. Particularly necessary for diffcol.pl (automated …
17:19 Changeset [27601] by ak19
Updating indexes after adding a sort on keys for archive gdb files in …
17:11 Changeset [27600] by ak19
Two things 1. Moving John's windows (un)locking to new file winlock.cpp …
12:01 Changeset [27599] by davidb
Update to text message
11:52 Changeset [27598] by davidb
Used to control the hostname and port services run on
09:32 Changeset [27597] by davidb
Additional header file included -- to help with finding the Unix mkdir …


18:24 Changeset [27596] by ak19
Committing archiveinf-doc after build.


17:10 Changeset [27595] by jmt12
Updating list of untarred directories to ignore
17:09 Changeset [27594] by jmt12
Extend hadoop_import.pl to be able to start and stop the Thrift server(s)
16:50 Changeset [27593] by jmt12
Need Class Accessor for Thrift client under Rocks
16:34 Changeset [27592] by jmt12
Adding in a script to allow a daemon version of Thrift to be started (and …
16:32 Changeset [27591] by jmt12
Ensure Thrift will, be default, attempt to connect to the local machine …
16:27 Changeset [27590] by jmt12
Adding statistics about data locality, and highlighting tasks where file …
14:19 Changeset [27589] by jmt12
Fixing up some minor bugs in regex's
14:12 Changeset [27588] by jmt12
Extend parser to support jobs that are split over several logs. Also …
11:29 Changeset [27587] by jmt12
Allow debug mode to be enabled from the command line
11:15 Changeset [27586] by jmt12
Updating script to date date of hadoop job into account when searching for …
10:25 Changeset [27585] by jmt12
The perl on Medusa won't let you immediately treat a returned array in a …
10:23 Changeset [27584] by jmt12
I wasn't doing -r when attempting to clear directories left in /tmp by …
10:22 Changeset [27583] by jmt12
Adding code to differentiate between workers in a cluster - all of which …


21:00 Changeset [27582] by ak19
New imagemagick distribution for 32 BIT LINUX that includes zlib (libz …
20:47 Changeset [27581] by ak19
New imagemagick distribution for linux 64 bit that includes zlib (libz …
20:01 Changeset [27580] by ak19
Adding libbz2 (bzip2) and its cascade-make file from gnome-lin, adjusted …
18:49 Changeset [27579] by ak19
A few more date fields need to be ignored when diffing.
18:07 Changeset [27578] by ak19
Doing a sort on all occurrences of readdir, so readdir lists dir contents …
18:05 Changeset [27577] by ak19
Updating index after sort to DirectoryPlugin?'s use of readdir
17:53 Changeset [27576] by ak19
Setting OIDtype to stable hash_on_full_filename in the collect.cfg itself
17:50 Changeset [27575] by ak19
Sorting directories
13:54 Changeset [27574] by ak19
Replacing with new index and archives folders.
13:53 Changeset [27573] by ak19
Replacing with new index and archives folders.
13:53 Changeset [27572] by ak19
Replacing with new index and archives folders.
11:27 Changeset [27571] by jmt12
increase timeout to 4 hours per map
10:53 Changeset [27570] by jmt12
Make the warning about binmode() not being applicable more meaningful, and …
10:48 Changeset [27569] by jmt12
Trying to streamline the error messages from failing to link (otherwise I …
10:24 Changeset [27568] by jmt12
Testing on Medusa suggests optimal buffer size around 128K
10:20 Changeset [27567] by jmt12
Found a printWarning that I handed changed to use the FileUtils? version


16:21 Changeset [27566] by jmt12
Making the getcpu optional - as it isn't available on Medusa (but then I …
12:02 Changeset [27565] by kjdon
ignore special keywords which should be only in indexes list, and ignore …
12:01 Changeset [27564] by kjdon
check if defined before setting sortfields, as there may not be any
11:29 Changeset [27563] by kjdon
implementing the new build option sections_sort_on_document_metadata
11:28 Changeset [27562] by kjdon
added new build option sections_sort_on_document_metadata. same as …
11:23 Changeset [27561] by jmt12
Adding very basic compile file for getcpu - can't be bothered going …
11:16 Changeset [27560] by jmt12
Fixing typo in regexp that meant filenames sometimes ignored
11:15 Changeset [27559] by jmt12
Changed mime-type away from binary - I hope. Meanwhile, generate …
11:11 Changeset [27558] by jmt12
Forgot that Hadoop Map processes no longer have the environment …


21:38 Changeset [27557] by ak19
Beginnings of changes to make the diffcol task use a standalone …
19:57 Changeset [27556] by ak19
Adding the missing task.pl for envi to invoke
18:50 Changeset [27555] by ak19
Redid the Small-HTML collection so it uses the correct name from the …
18:45 Changeset [27554] by ak19
Deleting to replace with new version built from scratch and with new …
14:42 Changeset [27553] by ak19
Function needed to return a bool in order to compile.
13:08 Changeset [27552] by jmt12
Altering the debug comments to provide IO boundary timings a little more …
13:07 Changeset [27551] by jmt12
Altered so that it expects to be given a CSV containing parallel …
13:06 Changeset [27550] by jmt12
Ensure the hostname is added to the Hadoop logs so we can identify the …
13:04 Changeset [27549] by jmt12
Extract information from the logs generated by parallel Greenstone using …
13:04 Changeset [27548] by jmt12
Extract information from the logs generated by parallel Greenstone using …
13:03 Changeset [27547] by jmt12
Rejigging some processing comments
13:02 Changeset [27546] by jmt12
Adding the ability for the Hadoop Mapper to determine what CPU number it …
13:00 Changeset [27545] by jmt12
Ignoring just the compiled file (for now)
13:00 Changeset [27544] by jmt12
A tiny C script to guesstimate the CPU the calling Process is on
11:53 Changeset [27543] by jmt12
Adding generate_gantt.pl script in its original form - i.e. directly reads …


19:42 Changeset [27542] by ak19
Minor correction to commit just made.
19:35 Changeset [27541] by ak19
Message being mailed now includes the html version of the report as an …
18:23 Changeset [27540] by ak19
1. Reports better sent to the greenstone mail id 2. Need to import with …
17:31 Changeset [27539] by ak19
Cosmetic change: fixing spelling error, to help locate other issues.
17:22 Changeset [27538] by ak19
Using FileUtils::removeFiles in place of utils::rm
16:46 Changeset [27537] by ak19
Bugfix: should be testing strOutputFormat is set to xml, not strOutput
16:29 Changeset [27536] by ak19
FileUtils? functions instead of util.pm
16:09 Changeset [27535] by ak19
Using the recommended FileUtils?' subroutines for the deprecated utils.pm …
15:56 Changeset [27534] by kjdon
more changes for super collection stuff. Now can handle having collections …
15:52 Changeset [27533] by kjdon
added comments about new oaisupercollection configuration command
11:12 Changeset [27532] by jmt12
Add the ability to configure the Thrift connector using a 'thrift.conf' …
11:11 Changeset [27531] by jmt12
Only output the message about using copy instead of hard/soft link once
11:08 Changeset [27530] by jmt12
Clear out old logs, and adding more comments about what the script is …
11:07 Changeset [27529] by jmt12
Fixing a bug (HDFS drivers not being recognized due to sometimes being …
11:05 Changeset [27528] by kjdon
implemented oaisupercollection. add to oai.cfg and the server will make a …
09:45 Changeset [27527] by jmt12
Calling the isHDFS() in FileUtils? rather than the non-existant one in …
09:28 Changeset [27526] by jmt12
Adding in a 'isHDFS()' function so that some plugins (SimpleVideoPlug?) can …
09:27 Changeset [27525] by jmt12
Adding in a 'isHDFS()' function so that some plugins (SimpleVideoPlug?) can …
