|
|
@28653
|
[28653]
|
6 years |
jmt12 |
Changed the way a require was 'eval'd - but I have no idea why
|
|
|
@28652
|
[28652]
|
6 years |
jmt12 |
Changes to support running the reports over logs produced from multicore …
|
|
|
@28649
|
[28649]
|
6 years |
jmt12 |
A version of a Textfile reading plugin that has a configurable load …
|
|
|
@28648
|
[28648]
|
6 years |
jmt12 |
Adding a short delay after writing to the flush_cache file just to ensure …
|
|
|
@28647
|
[28647]
|
6 years |
jmt12 |
Adding progress messages and making a debug message optional
|
|
|
@28646
|
[28646]
|
6 years |
jmt12 |
A script that uses strace to produce IO metrics of a Greenstone import
|
|
|
@28645
|
[28645]
|
6 years |
jmt12 |
Script to generate a report on data locality from GreenstoneHadoop? logs
|
|
|
@28358
|
[28358]
|
6 years |
jmt12 |
Replacing my earlier decision to only have data locality information …
|
|
|
@28357
|
[28357]
|
6 years |
jmt12 |
used to update the data_locality.csv file in the case where other …
|
|
|
@28356
|
[28356]
|
6 years |
jmt12 |
Support the legacy version of taskno in the data_locality.csv file (we now …
|
|
|
@28312
|
[28312]
|
6 years |
jmt12 |
Working on finer control over data locality - so I can configure a run …
|
|
|
@28192
|
[28192]
|
6 years |
jmt12 |
Need to still output Greenstone messages to log otherwise I can't …
|
|
|
@28191
|
[28191]
|
6 years |
jmt12 |
Removing redundant error stream redirect - this wasn't causing the issue I …
|
|
|
@28190
|
[28190]
|
6 years |
jmt12 |
Had accidently hardcoded the max replication number - allow it to be …
|
|
|
@28189
|
[28189]
|
6 years |
jmt12 |
Replace the newer (and faster) while(@file) loop with the older (and more …
|
|
|
@28188
|
[28188]
|
6 years |
jmt12 |
Minor fix to allow for tasks that start in the same second (now each …
|
|
|
@28187
|
[28187]
|
6 years |
jmt12 |
A customized version of Kea.pm that looks in the correct place for newer …
|
|
|
@28186
|
[28186]
|
6 years |
jmt12 |
A (failed) attempt to use the unix iotop tool to determine IO percentage
|
|
|
@28018
|
[28018]
|
6 years |
jmt12 |
Try really hard to capture the output from 'time' function as Medusa lets …
|
|
|
@28017
|
[28017]
|
6 years |
jmt12 |
Forgot to add processing comment before call to hadoop_import.pl
|
|
|
@28016
|
[28016]
|
6 years |
jmt12 |
Allow the hadoop report generator to parse start and end times expressed …
|
|
|
@28015
|
[28015]
|
6 years |
jmt12 |
Add an extra option that allows me to pass in the directory to write log …
|
|
|
@28014
|
[28014]
|
6 years |
jmt12 |
Remove tasks that have had data locality established from the array of …
|
|
|
@28013
|
[28013]
|
6 years |
jmt12 |
A new script to run a battery of Hadoop ingests at varying replication …
|
|
|
@28012
|
[28012]
|
6 years |
jmt12 |
Express start time as a double as well
|
|
|
@28011
|
[28011]
|
6 years |
jmt12 |
Turn off debugging in the copy in SVN
|
|
|
@28010
|
[28010]
|
6 years |
jmt12 |
Correctly set up the environment for calls to txt2tdb and also replace …
|
|
|
@28001
|
[28001]
|
6 years |
jmt12 |
Write datestamp using dbutil if applicable
|
|
|
@27996
|
[27996]
|
6 years |
jmt12 |
A new version of the archive with minor changes to log4j configuration
|
|
|
@27995
|
[27995]
|
6 years |
jmt12 |
Just adding some code comments
|
|
|
@27915
|
[27915]
|
6 years |
jmt12 |
A new PlugOut? that doesn't write any intermediate files (bar those …
|
|
|
@27914
|
[27914]
|
6 years |
jmt12 |
Trying to get around a couple of divide-by-zero issues when generating …
|
|
|
@27913
|
[27913]
|
6 years |
jmt12 |
Made the ingester to be used (version 1 without reduce phase, or version 2 …
|
|
|
@27912
|
[27912]
|
6 years |
jmt12 |
Modified the compilation to include the new ingester and its …
|
|
|
@27911
|
[27911]
|
6 years |
jmt12 |
Modified the compilation to include the new ingester and its co-requisites
|
|
|
@27910
|
[27910]
|
6 years |
jmt12 |
Extended the existing HadoopGreenstoneIngest? with proper Reduce phase - …
|
|
|
@27753
|
[27753]
|
6 years |
jmt12 |
Adding Handbrake's percentage complete to report - although this is …
|
|
|
@27752
|
[27752]
|
6 years |
jmt12 |
Data locality file not being found is no longer fatal (HDFS-NFS-Proxy …
|
|
|
@27732
|
[27732]
|
6 years |
jmt12 |
Nice the copy itself too
|
|
|
@27686
|
[27686]
|
6 years |
jmt12 |
A little more progress comments
|
|
|
@27685
|
[27685]
|
6 years |
jmt12 |
in the case of multiple attempts you need to retain the information about …
|
|
|
@27684
|
[27684]
|
6 years |
jmt12 |
Adding natural sorting into report generation - so also needed to add INC …
|
|
|
@27683
|
[27683]
|
6 years |
jmt12 |
moving a few more headings around to help with information block layout
|
|
|
@27682
|
[27682]
|
6 years |
jmt12 |
Copying makeAllDirectories() from vanilla FileUtils?.pm
|
|
|
@27669
|
[27669]
|
6 years |
jmt12 |
Sort compute nodes naturally before labelling them with incremental worker …
|
|
|
@27654
|
[27654]
|
6 years |
jmt12 |
Add the ability to stagger the starting of Mappers by placing a 'delay.me' …
|
|
|
@27653
|
[27653]
|
6 years |
jmt12 |
Forgot to pull self off the head of arguments
|
|
|
@27652
|
[27652]
|
6 years |
jmt12 |
Changing buffer to 128K (slightly faster) and adding a comment explaining …
|
|
|
@27651
|
[27651]
|
6 years |
jmt12 |
|
|
|
@27650
|
[27650]
|
6 years |
jmt12 |
|
|
|
@27649
|
[27649]
|
6 years |
jmt12 |
No longer in SVN control
|
|
|
@27648
|
[27648]
|
6 years |
jmt12 |
Template for setup.bash - a user will have to populate Hadoop fields
|
|
|
@27645
|
[27645]
|
6 years |
jmt12 |
|
|
|
@27644
|
[27644]
|
6 years |
jmt12 |
Extended to support HDFS-access via NFS. This applies to both the call to …
|
|
|
@27643
|
[27643]
|
6 years |
jmt12 |
Changed the script generator so it can recurse through directories and …
|
|
|
@27642
|
[27642]
|
6 years |
jmt12 |
A script I downloaded that successfully splits video files - something I …
|
|
|
@27641
|
[27641]
|
6 years |
jmt12 |
Altered order of arguments and allow archives dir to be passed as argument …
|
|
|
@27640
|
[27640]
|
6 years |
jmt12 |
|
|
|
@27638
|
[27638]
|
6 years |
jmt12 |
Change it so failure to open a filehandle isn't fatal - leave it up to the …
|
|
|
@27631
|
[27631]
|
6 years |
jmt12 |
A proxy to allow NFS access to HDFS
|
|
|
@27595
|
[27595]
|
7 years |
jmt12 |
Updating list of untarred directories to ignore
|
|
|
@27594
|
[27594]
|
7 years |
jmt12 |
Extend hadoop_import.pl to be able to start and stop the Thrift server(s)
|
|
|
@27593
|
[27593]
|
7 years |
jmt12 |
Need Class Accessor for Thrift client under Rocks
|
|
|
@27592
|
[27592]
|
7 years |
jmt12 |
Adding in a script to allow a daemon version of Thrift to be started (and …
|
|
|
@27591
|
[27591]
|
7 years |
jmt12 |
Ensure Thrift will, be default, attempt to connect to the local machine …
|
|
|
@27590
|
[27590]
|
7 years |
jmt12 |
Adding statistics about data locality, and highlighting tasks where file …
|
|
|
@27589
|
[27589]
|
7 years |
jmt12 |
Fixing up some minor bugs in regex's
|
|
|
@27588
|
[27588]
|
7 years |
jmt12 |
Extend parser to support jobs that are split over several logs. Also …
|
|
|
@27587
|
[27587]
|
7 years |
jmt12 |
Allow debug mode to be enabled from the command line
|
|
|
@27586
|
[27586]
|
7 years |
jmt12 |
Updating script to date date of hadoop job into account when searching for …
|
|
|
@27585
|
[27585]
|
7 years |
jmt12 |
The perl on Medusa won't let you immediately treat a returned array in a …
|
|
|
@27584
|
[27584]
|
7 years |
jmt12 |
I wasn't doing -r when attempting to clear directories left in /tmp by …
|
|
|
@27583
|
[27583]
|
7 years |
jmt12 |
Adding code to differentiate between workers in a cluster - all of which …
|
|
|
@27571
|
[27571]
|
7 years |
jmt12 |
increase timeout to 4 hours per map
|
|
|
@27570
|
[27570]
|
7 years |
jmt12 |
Make the warning about binmode() not being applicable more meaningful, and …
|
|
|
@27569
|
[27569]
|
7 years |
jmt12 |
Trying to streamline the error messages from failing to link (otherwise I …
|
|
|
@27568
|
[27568]
|
7 years |
jmt12 |
Testing on Medusa suggests optimal buffer size around 128K
|
|
|
@27567
|
[27567]
|
7 years |
jmt12 |
Found a printWarning that I handed changed to use the FileUtils? version
|
|
|
@27566
|
[27566]
|
7 years |
jmt12 |
Making the getcpu optional - as it isn't available on Medusa (but then I …
|
|
|
@27561
|
[27561]
|
7 years |
jmt12 |
Adding very basic compile file for getcpu - can't be bothered going …
|
|
|
@27560
|
[27560]
|
7 years |
jmt12 |
Fixing typo in regexp that meant filenames sometimes ignored
|
|
|
@27559
|
[27559]
|
7 years |
jmt12 |
Changed mime-type away from binary - I hope. Meanwhile, generate …
|
|
|
@27558
|
[27558]
|
7 years |
jmt12 |
Forgot that Hadoop Map processes no longer have the environment …
|
|
|
@27551
|
[27551]
|
7 years |
jmt12 |
Altered so that it expects to be given a CSV containing parallel …
|
|
|
@27550
|
[27550]
|
7 years |
jmt12 |
Ensure the hostname is added to the Hadoop logs so we can identify the …
|
|
|
@27549
|
[27549]
|
7 years |
jmt12 |
Extract information from the logs generated by parallel Greenstone using …
|
|
|
@27548
|
[27548]
|
7 years |
jmt12 |
Extract information from the logs generated by parallel Greenstone using …
|
|
|
@27547
|
[27547]
|
7 years |
jmt12 |
Rejigging some processing comments
|
|
|
@27546
|
[27546]
|
7 years |
jmt12 |
Adding the ability for the Hadoop Mapper to determine what CPU number it …
|
|
|
@27545
|
[27545]
|
7 years |
jmt12 |
Ignoring just the compiled file (for now)
|
|
|
@27544
|
[27544]
|
7 years |
jmt12 |
A tiny C script to guesstimate the CPU the calling Process is on
|
|
|
@27543
|
[27543]
|
7 years |
jmt12 |
Adding generate_gantt.pl script in its original form - i.e. directly reads …
|
|
|
@27532
|
[27532]
|
7 years |
jmt12 |
Add the ability to configure the Thrift connector using a 'thrift.conf' …
|
|
|
@27531
|
[27531]
|
7 years |
jmt12 |
Only output the message about using copy instead of hard/soft link once
|
|
|
@27530
|
[27530]
|
7 years |
jmt12 |
Clear out old logs, and adding more comments about what the script is …
|
|
|
@27526
|
[27526]
|
7 years |
jmt12 |
Adding in a 'isHDFS()' function so that some plugins (SimpleVideoPlug?) can …
|
|
|
@27525
|
[27525]
|
7 years |
jmt12 |
Adding in a 'isHDFS()' function so that some plugins (SimpleVideoPlug?) can …
|
|
|
@27515
|
[27515]
|
7 years |
jmt12 |
Making the file used durig buffertes be configurable
|
|
|
@27514
|
[27514]
|
7 years |
jmt12 |
Altering code to allow configurable length of read/write buffer when …
|
|
|
@27512
|
[27512]
|
7 years |
jmt12 |
Adding in a special test for measuring the effect of altering ThriftFS …
|
|
|