|
|
@30306
|
9 years |
jmt12 |
Making the setup of CPAN path more robust based on the better control …
|
|
|
@29663
|
9 years |
jmt12 |
Supporting grayscale printing, fixing mismatched tags and speechmarks, …
|
|
|
@29662
|
9 years |
jmt12 |
Now removes building and index directories if found
|
|
|
@29661
|
9 years |
jmt12 |
A helper script to clean-up the bogus directories sometimes created by …
|
|
|
@29158
|
10 years |
jmt12 |
Initial checkin of script to convert a number of Greenstone|| logs …
|
|
|
@29106
|
10 years |
jmt12 |
Check-in of script to symlink lorem files to matching files in another …
|
|
|
@29104
|
10 years |
jmt12 |
A script for extracting textual metrics from a collection of text …
|
|
|
@29103
|
10 years |
jmt12 |
updated - not any more efficient (Schlemiel the painter performance) …
|
|
|
@28769
|
10 years |
jmt12 |
No longer used. import.pl now smart enough to dynamically load …
|
|
|
@28768
|
10 years |
jmt12 |
Initially added microtime to this script, but then remembered it isn't …
|
|
|
@28767
|
10 years |
jmt12 |
Drastically increased the script to allow 1) battery of imports backed …
|
|
|
@28766
|
10 years |
jmt12 |
Removing an occasional few characters of garbage that turn up in the …
|
|
|
@28764
|
10 years |
jmt12 |
Adding microsecond timing messages
|
|
|
@28666
|
11 years |
jmt12 |
A script to transform a strace.out into a Tab separated file worthy of …
|
|
|
@28665
|
11 years |
jmt12 |
Latest changes to workaround resumed syscalls massive duration problem
|
|
|
@28652
|
11 years |
jmt12 |
Changes to support running the reports over logs produced from …
|
|
|
@28648
|
11 years |
jmt12 |
Adding a short delay after writing to the flush_cache file just to …
|
|
|
@28647
|
11 years |
jmt12 |
Adding progress messages and making a debug message optional
|
|
|
@28646
|
11 years |
jmt12 |
A script that uses strace to produce IO metrics of a Greenstone import
|
|
|
@28645
|
11 years |
jmt12 |
Script to generate a report on data locality from GreenstoneHadoop logs
|
|
|
@28358
|
11 years |
jmt12 |
Replacing my earlier decision to only have data locality information …
|
|
|
@28357
|
11 years |
jmt12 |
used to update the data_locality.csv file in the case where other …
|
|
|
@28356
|
11 years |
jmt12 |
Support the legacy version of taskno in the data_locality.csv file (we …
|
|
|
@28191
|
11 years |
jmt12 |
Removing redundant error stream redirect - this wasn't causing the …
|
|
|
@28190
|
11 years |
jmt12 |
Had accidently hardcoded the max replication number - allow it to be …
|
|
|
@28189
|
11 years |
jmt12 |
Replace the newer (and faster) while(@file) loop with the older (and …
|
|
|
@28188
|
11 years |
jmt12 |
Minor fix to allow for tasks that start in the same second (now each …
|
|
|
@28186
|
11 years |
jmt12 |
A (failed) attempt to use the unix iotop tool to determine IO percentage
|
|
|
@28018
|
11 years |
jmt12 |
Try really hard to capture the output from 'time' function as Medusa …
|
|
|
@28017
|
11 years |
jmt12 |
Forgot to add processing comment before call to hadoop_import.pl
|
|
|
@28016
|
11 years |
jmt12 |
Allow the hadoop report generator to parse start and end times …
|
|
|
@28015
|
11 years |
jmt12 |
Add an extra option that allows me to pass in the directory to write …
|
|
|
@28014
|
11 years |
jmt12 |
Remove tasks that have had data locality established from the array of …
|
|
|
@28013
|
11 years |
jmt12 |
A new script to run a battery of Hadoop ingests at varying replication …
|
|
|
@27914
|
11 years |
jmt12 |
Trying to get around a couple of divide-by-zero issues when generating …
|
|
|
@27913
|
11 years |
jmt12 |
Made the ingester to be used (version 1 without reduce phase, or …
|
|
|
@27753
|
11 years |
jmt12 |
Adding Handbrake's percentage complete to report - although this is …
|
|
|
@27752
|
11 years |
jmt12 |
Data locality file not being found is no longer fatal (HDFS-NFS-Proxy …
|
|
|
@27732
|
11 years |
jmt12 |
Nice the copy itself too
|
|
|
@27686
|
11 years |
jmt12 |
A little more progress comments
|
|
|
@27685
|
11 years |
jmt12 |
in the case of multiple attempts you need to retain the information …
|
|
|
@27684
|
11 years |
jmt12 |
Adding natural sorting into report generation - so also needed to add …
|
|
|
@27683
|
11 years |
jmt12 |
moving a few more headings around to help with information block layout
|
|
|
@27669
|
11 years |
jmt12 |
Sort compute nodes naturally before labelling them with incremental …
|
|
|
@27654
|
11 years |
jmt12 |
Add the ability to stagger the starting of Mappers by placing a …
|
|
|
@27644
|
11 years |
jmt12 |
Extended to support HDFS-access via NFS. This applies to both the call …
|
|
|
@27643
|
11 years |
jmt12 |
Changed the script generator so it can recurse through directories and …
|
|
|
@27642
|
11 years |
jmt12 |
A script I downloaded that successfully splits video files - something …
|
|
|
@27594
|
11 years |
jmt12 |
Extend hadoop_import.pl to be able to start and stop the Thrift server(s)
|
|
|
@27590
|
11 years |
jmt12 |
Adding statistics about data locality, and highlighting tasks where …
|
|
|
@27589
|
11 years |
jmt12 |
Fixing up some minor bugs in regex's
|
|
|
@27588
|
11 years |
jmt12 |
Extend parser to support jobs that are split over several logs. Also …
|
|
|
@27587
|
11 years |
jmt12 |
Allow debug mode to be enabled from the command line
|
|
|
@27586
|
11 years |
jmt12 |
Updating script to date date of hadoop job into account when searching …
|
|
|
@27585
|
11 years |
jmt12 |
The perl on Medusa won't let you immediately treat a returned array in …
|
|
|
@27584
|
11 years |
jmt12 |
I wasn't doing -r when attempting to clear directories left in /tmp by …
|
|
|
@27583
|
11 years |
jmt12 |
Adding code to differentiate between workers in a cluster - all of …
|
|
|
@27560
|
11 years |
jmt12 |
Fixing typo in regexp that meant filenames sometimes ignored
|
|
|
@27559
|
11 years |
jmt12 |
Changed mime-type away from binary - I hope. Meanwhile, generate …
|
|
|
@27551
|
11 years |
jmt12 |
Altered so that it expects to be given a CSV containing parallel …
|
|
|
@27550
|
11 years |
jmt12 |
Ensure the hostname is added to the Hadoop logs so we can identify the …
|
|
|
@27549
|
11 years |
jmt12 |
Extract information from the logs generated by parallel Greenstone …
|
|
|
@27548
|
11 years |
jmt12 |
Extract information from the logs generated by parallel Greenstone …
|
|
|
@27543
|
11 years |
jmt12 |
Adding generate_gantt.pl script in its original form - i.e. directly …
|
|
|
@27530
|
11 years |
jmt12 |
Clear out old logs, and adding more comments about what the script is …
|
|
|
@27515
|
11 years |
jmt12 |
Making the file used durig buffertes be configurable
|
|
|
@27512
|
11 years |
jmt12 |
Adding in a special test for measuring the effect of altering ThriftFS …
|
|
|
@27495
|
11 years |
jmt12 |
removing doubled up debug comments and putting some paths in …
|
|
|
@27481
|
11 years |
jmt12 |
Adding makeAllDirectories() (which I'd only implemented in LocalFS) to …
|
|
|
@27480
|
11 years |
jmt12 |
Removing DateTime dependency (so HDFSShell will always fail …
|
|
|
@27436
|
11 years |
jmt12 |
Adding the actual script - rather than a symlink to my dropbox. doh
|
|
|
@27435
|
11 years |
jmt12 |
Gah - only a symbolic link
|
|
|
@27414
|
11 years |
jmt12 |
Allowing more processing arguments to be configured at the call, and …
|
|
|
@27412
|
11 years |
jmt12 |
I obviously hadn't run this script on Karearea before - assumed all …
|
|
|
@27409
|
11 years |
jmt12 |
Unit test like testing for the FileUtils class and LocalFS, HDFSShell, …
|
|
|
@27408
|
11 years |
jmt12 |
A symbolic link to the actual script in the packages directory
|
|
|
@27378
|
11 years |
jmt12 |
Parallel processing support now added (via buildcolutil subclass) to …
|
|
|
@27126
|
11 years |
jmt12 |
Extra clean up commands (like removing cached versions of video …
|
|
|
@27125
|
11 years |
jmt12 |
A script to try and flush all caches - I'm certain it's flushing disk …
|
|
|
@27124
|
11 years |
jmt12 |
Use the new perl version script to extract the version number - so as …
|
|
|
@27119
|
11 years |
jmt12 |
Merging version finder from Medusa with the one lurking on Karearea
|
|
|
@27058
|
11 years |
jmt12 |
Adding data locality report generation to Hadoop greenstone imports
|
|
|
@27052
|
11 years |
jmt12 |
Turns out the Perl on Medusa doesn't support $V, so I've had to …
|
|
|
@27041
|
11 years |
jmt12 |
INC path now includes the installed extensions perl path (including …
|
|
|
@27040
|
11 years |
jmt12 |
A simple script that returns just the version number of Perl
|
|
|
@27036
|
11 years |
jmt12 |
A script to extract data locality and other task information from the …
|
|
|
@27006
|
11 years |
jmt12 |
A companion script to stop-hadoop-processes that just reports running …
|
|
|
@27005
|
11 years |
jmt12 |
Similar to stop-impt.pl, this script uses kill to stop runaway Hadoop …
|
|
|
@27004
|
11 years |
jmt12 |
A script to stop (using kill) a runaway import process and any related …
|
|
|
@27001
|
11 years |
jmt12 |
Passing more environment variables (HADOOPPREFIX, HDFSHOST, HDFSPORT) …
|
|
|
@26999
|
11 years |
jmt12 |
Ensuring MPI binds to correct interface, and passing through …
|
|
|
@26998
|
11 years |
jmt12 |
Adding maxdocs variable, lots of debug comments, added some tests for …
|
|
|
@26953
|
11 years |
jmt12 |
Checking in the script rather than a symbolic link to the script :P
|
|
|
@26952
|
11 years |
jmt12 |
Accidentally checked in symbolic link rather than script
|
|
|
@26949
|
11 years |
jmt12 |
Parallel import using Hadoop
|
|
|
@26930
|
11 years |
jmt12 |
Randomized order of files, and added the ability to specify a maximum …
|
|
|
@26929
|
11 years |
jmt12 |
A script to comprehensively clean up a collection between imports... …
|
|
|
@26923
|
11 years |
jmt12 |
Generates a specficied-size subset of a larger import directory
|
|
|
@26242
|
12 years |
jmt12 |
Modifications to progress messages to improve extracting information …
|
|
|
@26187
|
12 years |
jmt12 |
Adding the rest of parallel processing support for Terrier into SVN. …
|
|
|