Ignore:
Timestamp:
2019-11-05T21:04:09+13:00 (4 years ago)
Author:
ak19
Message:
  1. Incorporated Dr Nichols earlier suggestion of storing page modified time and char-encoding metadata if present in the crawl dump output. Have done so, but neither modifiedTime nor fetchTime metadata of the dump file appear to be a webpage's actual modified time, as they're from 2019 and set around the period we've been crawling. 2. Moved getDomainFromURL() function from CCWETProcessor.java to Utility.java since it's been reused. 3. MongoDBAccess class successfully connects (at least, no exceptions) and uses the newly added properties in config.properties to make the connection.
File:
1 edited

Legend:

Unmodified
Added
Removed
  • gs3-extensions/maori-lang-detection/conf/config.properties

    r33615 r33623  
    2323
    2424
    25 mongodb.user=admin
    26 mongodb.pwd=pinky
    27 #CHANGEME
    28 
     25mongodb.user=anupama
     26mongodb.pwd=chang3m3
     27# default mongodb port is 27017. Don't change the port unless you really have configured
     28# your mongodb server to listen at some other port
     29mongodb.port=27017
     30mongodb.host=mongodb.cms.waikato.ac.nz
     31mongodb.dbname=ateacrawldata
Note: See TracChangeset for help on using the changeset viewer.