|
|
@30354
|
8 years |
jmt12 |
Extending manifest v2 support to allow for directories to be listed in …
|
|
|
@30353
|
8 years |
jmt12 |
Replacing old file util calls with new FileUtil ones - including …
|
|
|
@30352
|
8 years |
jmt12 |
Adding in a function to test whether a File driver has a certain …
|
|
|
@30351
|
8 years |
jmt12 |
Restructured readDirectory to not die if directory isn't readable
|
|
|
@30350
|
8 years |
jmt12 |
Initial commit of rewrite so DBDrivers are object-oriented like …
|
|
|
@30308
|
8 years |
jmt12 |
Another CPAN module required by the Gantt Chart generation script
|
|
|
@30306
|
8 years |
jmt12 |
Making the setup of CPAN path more robust based on the better control …
|
|
|
@30305
|
8 years |
jmt12 |
replacing deprecated function calls to newer ones in FileUtils
|
|
|
@30302
|
8 years |
jmt12 |
Altering os-specific installed path and trying to improve clean to …
|
|
|
@30301
|
8 years |
jmt12 |
Missed displaying one variable in a fprintf statement (threw a …
|
|
|
@30300
|
8 years |
jmt12 |
Bunch of changes (including whitespace safety ones) due to different …
|
|
|
@30299
|
8 years |
jmt12 |
Making the removal of the jar file conditional on it actually being …
|
|
|
@30298
|
8 years |
jmt12 |
Correcting path to manifest files for use in Greenstone3. Started to …
|
|
|
@30297
|
8 years |
jmt12 |
Altering the Makefile.in to determine whether it is in GSDL2 or GSDL3 …
|
|
|
@30296
|
8 years |
jmt12 |
extending to support GSDL3 as well
|
|
|
@30295
|
8 years |
jmt12 |
Using the proper environment variable, GSDL3SRCHOME, rather than GSDL3HOME
|
|
|
@30294
|
8 years |
jmt12 |
Typo would have prevented the generated configure script from working …
|
|
|
@30293
|
8 years |
jmt12 |
The missing script was, ironically, missing
|
|
|
@30292
|
8 years |
jmt12 |
Removing reference to debugging module Devel::Peek
|
|
|
@30291
|
8 years |
jmt12 |
Minor changes in generated configs mostly to do with whitespace safety
|
|
|
@30290
|
8 years |
jmt12 |
Extended build script to make Hadoop support optional. If no …
|
|
|
@30289
|
8 years |
jmt12 |
Significant changes to read() function - essentially split in half …
|
|
|
@30288
|
8 years |
jmt12 |
No longer different that the vanilla Greenstone version
|
|
|
@30287
|
8 years |
jmt12 |
Extending error messages a bit to differentiate between linking that …
|
|
|
@30286
|
8 years |
jmt12 |
Adding a customized version of inexport.pm allowing us to handle …
|
|
|
@30285
|
8 years |
jmt12 |
Adding in a call to uptar/compile/install Hadoop support package
|
|
|
@30284
|
8 years |
jmt12 |
updated svnignore
|
|
|
@30283
|
8 years |
jmt12 |
Ignoring the unpacked versions of a couple of new packages used to …
|
|
|
@30282
|
8 years |
jmt12 |
Ensure the perl/cpan install directories exist before trying to copy …
|
|
|
@30281
|
8 years |
jmt12 |
Cascade-Make file to provide Hadoop functionality
|
|
|
@30280
|
8 years |
jmt12 |
Ensure the platform specific directory for built files exists. It may …
|
|
|
@30278
|
8 years |
jmt12 |
Might as well add this, with default setting for Hadoop, to SVN... …
|
|
|
@29663
|
9 years |
jmt12 |
Supporting grayscale printing, fixing mismatched tags and speechmarks, …
|
|
|
@29662
|
9 years |
jmt12 |
Now removes building and index directories if found
|
|
|
@29661
|
9 years |
jmt12 |
A helper script to clean-up the bogus directories sometimes created by …
|
|
|
@29660
|
9 years |
jmt12 |
making the debug variable global... can't remember why though
|
|
|
@29649
|
9 years |
jmt12 |
Perseus was an attempt to add functionality to automatically and …
|
|
|
@29276
|
9 years |
jmt12 |
I need to measure the time spent on generating the initial manifest, …
|
|
|
@29261
|
9 years |
jmt12 |
Removing some of the extraneous IO from high cpu importing... altering …
|
|
|
@29260
|
9 years |
jmt12 |
Replacing the obsolete call to util::file_lastmodified() with the …
|
|
|
@29259
|
9 years |
jmt12 |
Kea override allowing for fixed processor affinity if necessary …
|
|
|
@29258
|
9 years |
jmt12 |
Initial checkin of a new TDB infodb that allows each worker thread in …
|
|
|
@29257
|
9 years |
jmt12 |
Allow for collection configuration to be passed down to parallel …
|
|
|
@29243
|
9 years |
jmt12 |
Allowing for file linking to be disabled
|
|
|
@29162
|
9 years |
jmt12 |
The Lingua module for detecting syllables - used when determining …
|
|
|
@29161
|
9 years |
jmt12 |
Some modules aren't availalbe on cluster... add test and include path …
|
|
|
@29160
|
9 years |
jmt12 |
Adding blowfish encryption package to give text processing some work to do
|
|
|
@29158
|
9 years |
jmt12 |
Initial checkin of script to convert a number of Greenstone|| logs …
|
|
|
@29106
|
9 years |
jmt12 |
Check-in of script to symlink lorem files to matching files in another …
|
|
|
@29104
|
9 years |
jmt12 |
A script for extracting textual metrics from a collection of text …
|
|
|
@29103
|
9 years |
jmt12 |
updated - not any more efficient (Schlemiel the painter performance) …
|
|
|
@28779
|
10 years |
jmt12 |
Making timing message all sorts of purty
|
|
|
@28778
|
10 years |
jmt12 |
Typo - underscore where I meant hyphen
|
|
|
@28777
|
10 years |
jmt12 |
Need to include path to mpiimport on Medusa
|
|
|
@28771
|
10 years |
jmt12 |
A version of BasePlugout where the RSS feed update attempts to write …
|
|
|
@28770
|
10 years |
jmt12 |
Adding microtiming... a little tricky what with TDBServer taking …
|
|
|
@28769
|
10 years |
jmt12 |
No longer used. import.pl now smart enough to dynamically load …
|
|
|
@28768
|
10 years |
jmt12 |
Initially added microtime to this script, but then remembered it isn't …
|
|
|
@28767
|
10 years |
jmt12 |
Drastically increased the script to allow 1) battery of imports backed …
|
|
|
@28766
|
10 years |
jmt12 |
Removing an occasional few characters of garbage that turn up in the …
|
|
|
@28764
|
10 years |
jmt12 |
Adding microsecond timing messages
|
|
|
@28666
|
10 years |
jmt12 |
A script to transform a strace.out into a Tab separated file worthy of …
|
|
|
@28665
|
10 years |
jmt12 |
Latest changes to workaround resumed syscalls massive duration problem
|
|
|
@28654
|
10 years |
jmt12 |
Removed recordEarliestDatestamp() function as that no lurks in the …
|
|
|
@28653
|
10 years |
jmt12 |
Changed the way a require was 'eval'd - but I have no idea why
|
|
|
@28652
|
10 years |
jmt12 |
Changes to support running the reports over logs produced from …
|
|
|
@28649
|
10 years |
jmt12 |
A version of a Textfile reading plugin that has a configurable load …
|
|
|
@28648
|
10 years |
jmt12 |
Adding a short delay after writing to the flush_cache file just to …
|
|
|
@28647
|
10 years |
jmt12 |
Adding progress messages and making a debug message optional
|
|
|
@28646
|
10 years |
jmt12 |
A script that uses strace to produce IO metrics of a Greenstone import
|
|
|
@28645
|
10 years |
jmt12 |
Script to generate a report on data locality from GreenstoneHadoop logs
|
|
|
@28358
|
10 years |
jmt12 |
Replacing my earlier decision to only have data locality information …
|
|
|
@28357
|
10 years |
jmt12 |
used to update the data_locality.csv file in the case where other …
|
|
|
@28356
|
10 years |
jmt12 |
Support the legacy version of taskno in the data_locality.csv file (we …
|
|
|
@28312
|
10 years |
jmt12 |
Working on finer control over data locality - so I can configure a run …
|
|
|
@28192
|
10 years |
jmt12 |
Need to still output Greenstone messages to log otherwise I can't …
|
|
|
@28191
|
10 years |
jmt12 |
Removing redundant error stream redirect - this wasn't causing the …
|
|
|
@28190
|
10 years |
jmt12 |
Had accidently hardcoded the max replication number - allow it to be …
|
|
|
@28189
|
10 years |
jmt12 |
Replace the newer (and faster) while(@file) loop with the older (and …
|
|
|
@28188
|
10 years |
jmt12 |
Minor fix to allow for tasks that start in the same second (now each …
|
|
|
@28187
|
10 years |
jmt12 |
A customized version of Kea.pm that looks in the correct place for …
|
|
|
@28186
|
10 years |
jmt12 |
A (failed) attempt to use the unix iotop tool to determine IO percentage
|
|
|
@28018
|
10 years |
jmt12 |
Try really hard to capture the output from 'time' function as Medusa …
|
|
|
@28017
|
10 years |
jmt12 |
Forgot to add processing comment before call to hadoop_import.pl
|
|
|
@28016
|
10 years |
jmt12 |
Allow the hadoop report generator to parse start and end times …
|
|
|
@28015
|
10 years |
jmt12 |
Add an extra option that allows me to pass in the directory to write …
|
|
|
@28014
|
10 years |
jmt12 |
Remove tasks that have had data locality established from the array of …
|
|
|
@28013
|
10 years |
jmt12 |
A new script to run a battery of Hadoop ingests at varying replication …
|
|
|
@28012
|
10 years |
jmt12 |
Express start time as a double as well
|
|
|
@28011
|
10 years |
jmt12 |
Turn off debugging in the copy in SVN
|
|
|
@28010
|
10 years |
jmt12 |
Correctly set up the environment for calls to txt2tdb and also replace …
|
|
|
@28001
|
10 years |
jmt12 |
Write datestamp using dbutil if applicable
|
|
|
@27996
|
10 years |
jmt12 |
A new version of the archive with minor changes to log4j configuration
|
|
|
@27995
|
10 years |
jmt12 |
Just adding some code comments
|
|
|
@27915
|
10 years |
jmt12 |
A new PlugOut that doesn't write any intermediate files (bar those …
|
|
|
@27914
|
10 years |
jmt12 |
Trying to get around a couple of divide-by-zero issues when generating …
|
|
|
@27913
|
10 years |
jmt12 |
Made the ingester to be used (version 1 without reduce phase, or …
|
|
|
@27912
|
10 years |
jmt12 |
Modified the compilation to include the new ingester and its co-requisites.
|
|
|
@27911
|
10 years |
jmt12 |
Modified the compilation to include the new ingester and its co-requisites
|
|
|
@27910
|
10 years |
jmt12 |
Extended the existing HadoopGreenstoneIngest with proper Reduce phase …
|
|
|