Ignore:
Timestamp:
2019-12-17T19:31:28+13:00 (4 years ago)
Author:
ak19
Message:

Storing not just whether /mi(/) suffix is in path, but also whether http(s):mi. is in path, as storing these can help reducing number of auto-translated sites too in a similar way.

File:
1 edited

Legend:

Unmodified
Added
Removed
  • other-projects/maori-lang-detection/src/org/greenstone/atea/morphia/WebsiteInfo.java

    r33801 r33808  
    2121   
    2222    public final String geoLocationCountryCode;
    23     public final boolean urlContainsLangCodeInPath;
     23    public final boolean urlContainsLangCodeInPathSuffix;
     24    public final boolean urlContainsLangCodeInPathPrefix;
    2425   
    2526    public WebsiteInfo(/*int siteCount,*/ String siteFolderName, String domainOfSite,
     
    2728               int numPagesInMRI, int numPagesContainingMRI,
    2829               long siteCrawledTimestamp, boolean siteCrawlUnfinished, boolean redoCrawl,
    29                String geoLocationCountryCode, boolean urlContainsLangCodeInPath)
     30               String geoLocationCountryCode, boolean urlContainsLangCodeInPathSuffix, boolean urlContainsLangCodeInPathPrefix)
    3031    {
    3132    //this.id = siteCount;
     
    4445   
    4546    this.geoLocationCountryCode = geoLocationCountryCode;
    46     this.urlContainsLangCodeInPath = urlContainsLangCodeInPath;
     47    this.urlContainsLangCodeInPathSuffix = urlContainsLangCodeInPathSuffix;
     48    this.urlContainsLangCodeInPathPrefix = urlContainsLangCodeInPathPrefix;
    4749    }
    4850}
Note: See TracChangeset for help on using the changeset viewer.