Ignore:
Timestamp:
2019-09-13T17:44:41+12:00 (5 years ago)
Author:
ak19
Message:

Improved the code to use a static block to load the needed properties from config.properties and initialise some static final ints from there. Code now uses the logger for debugging. New properties in config.properties. Returned code to use a counter, recordCount, re-zeroed for each WETProcessor since the count was used for unique filenames, and filename prefixes are unique for each warc.wet file. So these prefixes, in combination with keeping track of the recordcount per warc.wet file, each WET record written out to a file is assigned a unique filename. (No longer need a running total of all WET records across warc.wet files processed ensuring uniqueness of filenames.) All appears to still work similarly to previous commit in creating discard and keep subfolders.

File:
1 edited

Legend:

Unmodified
Added
Removed
  • gs3-extensions/maori-lang-detection/src/org/greenstone/atea/Utility.java

    r33466 r33467  
    44import java.util.zip.GZIPInputStream;
    55
     6import org.apache.log4j.Logger;
     7
    68public class Utility {
     9    private static Logger logger = Logger.getLogger(org.greenstone.atea.Utility.class.getName());
    710   
    811    // Run gunzip
     
    2629        //out.close();
    2730       
    28         //log("Unzipped " + inZipFile + " to " + outFile);
     31        logger.debug("Unzipped " + inZipFile + " to " + outFile);
    2932       
    3033    } catch(IOException ex) {
Note: See TracChangeset for help on using the changeset viewer.