Ignore:
Timestamp:
2013-07-05T22:55:09+12:00 (11 years ago)
Author:
ak19
Message:

Fixes to previous commit: the random file names (created by PDFBox for its intermediate html files) are generated with random numbers. As a result, the HASH OIDs of these intermediate source files, although constant between model and test collections, can therefore appear in a different order in the model and the test collection when both are ordered alphabetically in the index/col.gdb file.

File:
1 edited

Legend:

Unmodified
Added
Removed
  • other-projects/nightly-tasks/diffcol/trunk/diffcol/diffcol.pl

    r27766 r27767  
    679679        # The following block of code is necessary to deal with tmp (html) source files generated when using PDFBox
    680680        # These tmpdirs are located inside the toplevel *greenstone* directory
    681         (my $gsdlhome_re = $ENV{'GSDLHOME'}) =~ s@\\@\/@g;     
     681        (my $gsdlhome_re = $ENV{'GSDLHOME'}) =~ s@\\@\/@g;
     682        $gsdlhome_re = ".*" unless $$ENV{'GSDLHOME'};
    682683        my $tmpfile_regex = "<Metadata name=\"URL\">http://$gsdlhome_re/tmp/([^\.]*)(\..{3,4})</Metadata>"; # $gsdlhome/tmp/randomfilename.html, file ext can be 3 or 4 chars long 
    683684        if($test_contents =~ m@$tmpfile_regex@) {           
Note: See TracChangeset for help on using the changeset viewer.