Extract information from the logs generated by parallel Greenstone using Hadoop - and generate a CSV suitable for handing to generate_gantt.pl