source: other-projects/maori-lang-detection/conf/config.properties.in@ 36317

Last change on this file since 36317 was 33643, checked in by ak19, 5 years ago

Brought the template log4j.properties.in back up to speed. I forgot it existed and had committed the local log4j.properties and was solely modifying that.

File size: 1.3 KB
Line 
1# https://www.linuxjournal.com/content/downloading-entire-web-site-wget
2# https://linuxreviews.org/Wget:_download_whole_or_parts_of_websites_with_ease
3# https://www.webhostface.com/kb/knowledgebase/examples-using-wget/
4# "You can replicate the HTML content of a website with the –mirror option (or -m for short)
5# wget -m http://domain.com"
6# https://www.linuxquestions.org/questions/linux-server-73/wget-how-to-download-more-than-one-file-at-once-instead-of-file-after-file-704693/
7wget.mirror.cmd=wget -Q10m -m %%BASE_URL%%
8
9# for downloading a single file
10wget.file.cmd=wget %%FILE_URL%%
11
12# Arbitrary cutoff values for WETProcessor.java
13WETprocessor.min.content.length=100
14WETprocessor.min.line.count=2
15WETprocessor.min.content.length.wrapped.line=500
16WETprocessor.min.spaces.per.wrapped.line=10
17
18# Arbitrary cutoff values for WETProcessor.java
19# for determining whether a WET record has sufficient and sensible content
20WETprocessor.max.word.length=15
21WETprocessor.min.num.words=20
22WETprocessor.max.words.camelcase=10
23
24
25mongodb.user=anupama
26mongodb.pwd=chang3m3
27# default mongodb port is 27017. Don't change the port unless you really have configured
28# your mongodb server to listen at some other port
29mongodb.port=27017
30mongodb.host=mongodb.cms.waikato.ac.nz
31#mongodb.dbname=ateacrawldata
32mongodb.dbname=anupama
Note: See TracBrowser for help on using the repository browser.