source:

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @27598   11 years davidb Used to control the hostname and port services run on
(edit) @27597   11 years davidb Additional header file included -- to help with finding the Unix mkdir …
(edit) @27596   11 years ak19 Committing archiveinf-doc after build.
(edit) @27595   11 years jmt12 Updating list of untarred directories to ignore
(edit) @27594   11 years jmt12 Extend hadoop_import.pl to be able to start and stop the Thrift server(s)
(edit) @27593   11 years jmt12 Need Class Accessor for Thrift client under Rocks
(edit) @27592   11 years jmt12 Adding in a script to allow a daemon version of Thrift to be started …
(edit) @27591   11 years jmt12 Ensure Thrift will, be default, attempt to connect to the local …
(edit) @27590   11 years jmt12 Adding statistics about data locality, and highlighting tasks where …
(edit) @27589   11 years jmt12 Fixing up some minor bugs in regex's
(edit) @27588   11 years jmt12 Extend parser to support jobs that are split over several logs. Also …
(edit) @27587   11 years jmt12 Allow debug mode to be enabled from the command line
(edit) @27586   11 years jmt12 Updating script to date date of hadoop job into account when searching …
(edit) @27585   11 years jmt12 The perl on Medusa won't let you immediately treat a returned array in …
(edit) @27584   11 years jmt12 I wasn't doing -r when attempting to clear directories left in /tmp by …
(edit) @27583   11 years jmt12 Adding code to differentiate between workers in a cluster - all of …
(edit) @27582   11 years ak19 New imagemagick distribution for 32 BIT LINUX that includes zlib (libz …
(edit) @27581   11 years ak19 New imagemagick distribution for linux 64 bit that includes zlib (libz …
(edit) @27580   11 years ak19 Adding libbz2 (bzip2) and its cascade-make file from gnome-lin, …
(edit) @27579   11 years ak19 A few more date fields need to be ignored when diffing.
(edit) @27578   11 years ak19 Doing a sort on all occurrences of readdir, so readdir lists dir …
(edit) @27577   11 years ak19 Updating index after sort to DirectoryPlugin's use of readdir
(edit) @27576   11 years ak19 Setting OIDtype to stable hash_on_full_filename in the collect.cfg itself
(edit) @27575   11 years ak19 Sorting directories
(edit) @27574   11 years ak19 Replacing with new index and archives folders.
(edit) @27573   11 years ak19 Replacing with new index and archives folders.
(edit) @27572   11 years ak19 Replacing with new index and archives folders.
(edit) @27571   11 years jmt12 increase timeout to 4 hours per map
(edit) @27570   11 years jmt12 Make the warning about binmode() not being applicable more meaningful, …
(edit) @27569   11 years jmt12 Trying to streamline the error messages from failing to link …
(edit) @27568   11 years jmt12 Testing on Medusa suggests optimal buffer size around 128K
(edit) @27567   11 years jmt12 Found a printWarning that I handed changed to use the FileUtils version
(edit) @27566   11 years jmt12 Making the getcpu optional - as it isn't available on Medusa (but then …
(edit) @27565   11 years kjdon ignore special keywords which should be only in indexes list, and …
(edit) @27564   11 years kjdon check if defined before setting sortfields, as there may not be any
(edit) @27563   11 years kjdon implementing the new build option sections_sort_on_document_metadata
(edit) @27562   11 years kjdon added new build option sections_sort_on_document_metadata. same as …
(edit) @27561   11 years jmt12 Adding very basic compile file for getcpu - can't be bothered going …
(edit) @27560   11 years jmt12 Fixing typo in regexp that meant filenames sometimes ignored
(edit) @27559   11 years jmt12 Changed mime-type away from binary - I hope. Meanwhile, generate …
(edit) @27558   11 years jmt12 Forgot that Hadoop Map processes no longer have the environment …
(edit) @27557   11 years ak19 Beginnings of changes to make the diffcol task use a standalone …
(edit) @27556   11 years ak19 Adding the missing task.pl for envi to invoke
(edit) @27555   11 years ak19 Redid the Small-HTML collection so it uses the correct name from the …
(edit) @27554   11 years ak19 Deleting to replace with new version built from scratch and with new …
(edit) @27553   11 years ak19 Function needed to return a bool in order to compile.
(edit) @27552   11 years jmt12 Altering the debug comments to provide IO boundary timings a little …
(edit) @27551   11 years jmt12 Altered so that it expects to be given a CSV containing parallel …
(edit) @27550   11 years jmt12 Ensure the hostname is added to the Hadoop logs so we can identify the …
(edit) @27549   11 years jmt12 Extract information from the logs generated by parallel Greenstone …
(edit) @27548   11 years jmt12 Extract information from the logs generated by parallel Greenstone …
(edit) @27547   11 years jmt12 Rejigging some processing comments
(edit) @27546   11 years jmt12 Adding the ability for the Hadoop Mapper to determine what CPU number …
(edit) @27545   11 years jmt12 Ignoring just the compiled file (for now)
(edit) @27544   11 years jmt12 A tiny C script to guesstimate the CPU the calling Process is on
(edit) @27543   11 years jmt12 Adding generate_gantt.pl script in its original form - i.e. directly …
(edit) @27542   11 years ak19 Minor correction to commit just made.
(edit) @27541   11 years ak19 Message being mailed now includes the html version of the report as an …
(edit) @27540   11 years ak19 1. Reports better sent to the greenstone mail id 2. Need to import …
(edit) @27539   11 years ak19 Cosmetic change: fixing spelling error, to help locate other issues.
(edit) @27538   11 years ak19 Using FileUtils::removeFiles in place of utils::rm
(edit) @27537   11 years ak19 Bugfix: should be testing strOutputFormat is set to xml, not strOutput
(edit) @27536   11 years ak19 FileUtils functions instead of util.pm
(edit) @27535   11 years ak19 Using the recommended FileUtils' subroutines for the deprecated …
(edit) @27534   11 years kjdon more changes for super collection stuff. Now can handle having …
(edit) @27533   11 years kjdon added comments about new oaisupercollection configuration command
(edit) @27532   11 years jmt12 Add the ability to configure the Thrift connector using a …
(edit) @27531   11 years jmt12 Only output the message about using copy instead of hard/soft link once
(edit) @27530   11 years jmt12 Clear out old logs, and adding more comments about what the script is …
(edit) @27529   11 years jmt12 Fixing a bug (HDFS drivers not being recognized due to sometimes …
(edit) @27528   11 years kjdon implemented oaisupercollection. add to oai.cfg and the server will …
(edit) @27527   11 years jmt12 Calling the isHDFS() in FileUtils rather than the non-existant one in utils
(edit) @27526   11 years jmt12 Adding in a 'isHDFS()' function so that some plugins (SimpleVideoPlug) …
(edit) @27525   11 years jmt12 Adding in a 'isHDFS()' function so that some plugins (SimpleVideoPlug) …
(edit) @27524   11 years ak19 Haven't run diffcol yet, but making the basic changes necessary to get …
(edit) @27523   11 years ak19 Committing the first of the pre-built tutorials (built on CentOS, 64 …
(edit) @27522   11 years ak19 Correcting some minor bugs during build.
(edit) @27521   11 years ak19 Resetting gsdlhome (reset-gsdlhome command) should not just update the …
(edit) @27520   11 years ak19 Undoing commit to FileUtils::closeFileHandle since John thinks the …
(edit) @27519   11 years ak19 Using the recommended FileUtils.pm equivalents for util.pm subroutines.
(edit) @27518   11 years ak19 Completing the listing of functions in FileUtils.pm
(edit) @27517   11 years jmt12 Noticed and replaced a couple of -e's (that should have been -d's …
(edit) @27516   11 years jmt12 Matching a call to the new FileUtils::openFileHandle() (in …
(edit) @27515   11 years jmt12 Making the file used durig buffertes be configurable
(edit) @27514   11 years jmt12 Altering code to allow configurable length of read/write buffer when …
(edit) @27513   11 years jmt12 Restoring the original logic around working_info (although still not …
(edit) @27512   11 years jmt12 Adding in a special test for measuring the effect of altering ThriftFS …
(edit) @27511   11 years kjdon pass in file handle as a reference
(edit) @27510   11 years ak19 Using the recommended FileUtils.pm methods in place of the deprecated …
(edit) @27509   11 years ak19 Using the recommended FileUtils.pm methods in place of the deprecated …
(edit) @27508   11 years ak19 closeFileHandle() should deal with the case of the file not existing.
(edit) @27507   11 years jmt12 Ensuring the downloadable versions of the XML exports are stored in a …
(edit) @27506   11 years jmt12 Adding a (hopefully) safe recursive delete, and removed some debugging …
(edit) @27505   11 years jmt12 Closing the RSS filehandle with the new function in FileUtils too
(edit) @27504   11 years jmt12 Changing the wat get_new_doc_dir() works so that it creates the new …
(edit) @27503   11 years kjdon modified to handle files with just a single record. So no collection …
(edit) @27502   11 years kjdon trying to fix double encoding issue for isis files. not sure that I …
(edit) @27501   11 years jmt12 Missed (another) old style file open that instead needs to go through …
(edit) @27500   11 years jmt12 Missed an old style file open that instead needs to go through …
(edit) @27499   11 years jmt12 New configuration options to control the creation of directories in …
Note: See TracRevisionLog for help on using the revision log.