Changeset 33469
- Timestamp:
- 2019-09-13T21:46:09+12:00 (5 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
gs3-extensions/maori-lang-detection/src/org/greenstone/atea/WETProcessor.java
r33468 r33469 200 200 201 201 File parentFolder = null; 202 203 if(lineCount >= MIN_LINE_COUNT && contentLength >= MIN_CONTENT_LENGTH) { 202 203 // want to match "product(s)" but not "production" 204 205 //if(recordURI.matches(".*/?product[^a-rt-z].*")) {//if(recordURI.matches(".*/?products?/?.*")) { 206 if(recordURI.contains("product") && !recordURI.contains("production")) { 207 208 // don't want a "translated" product site/online store 209 // These curiously often tend to have "product(s)" in the URL 210 parentFolder = WETProcessor.discardFolder; 211 } 212 else if(lineCount >= MIN_LINE_COUNT && contentLength >= MIN_CONTENT_LENGTH) { 204 213 parentFolder = WETProcessor.keepFolder; 205 214 System.err.println("@@@KEEPING");
Note:
See TracChangeset
for help on using the changeset viewer.