Last change
on this file since 36317 was 33643, checked in by ak19, 5 years ago |
Brought the template log4j.properties.in back up to speed. I forgot it existed and had committed the local log4j.properties and was solely modifying that.
|
File size:
1.3 KB
|
Line | |
---|
1 | # https://www.linuxjournal.com/content/downloading-entire-web-site-wget
|
---|
2 | # https://linuxreviews.org/Wget:_download_whole_or_parts_of_websites_with_ease
|
---|
3 | # https://www.webhostface.com/kb/knowledgebase/examples-using-wget/
|
---|
4 | # "You can replicate the HTML content of a website with the âmirror option (or -m for short)
|
---|
5 | # wget -m http://domain.com"
|
---|
6 | # https://www.linuxquestions.org/questions/linux-server-73/wget-how-to-download-more-than-one-file-at-once-instead-of-file-after-file-704693/
|
---|
7 | wget.mirror.cmd=wget -Q10m -m %%BASE_URL%%
|
---|
8 |
|
---|
9 | # for downloading a single file
|
---|
10 | wget.file.cmd=wget %%FILE_URL%%
|
---|
11 |
|
---|
12 | # Arbitrary cutoff values for WETProcessor.java
|
---|
13 | WETprocessor.min.content.length=100
|
---|
14 | WETprocessor.min.line.count=2
|
---|
15 | WETprocessor.min.content.length.wrapped.line=500
|
---|
16 | WETprocessor.min.spaces.per.wrapped.line=10
|
---|
17 |
|
---|
18 | # Arbitrary cutoff values for WETProcessor.java
|
---|
19 | # for determining whether a WET record has sufficient and sensible content
|
---|
20 | WETprocessor.max.word.length=15
|
---|
21 | WETprocessor.min.num.words=20
|
---|
22 | WETprocessor.max.words.camelcase=10
|
---|
23 |
|
---|
24 |
|
---|
25 | mongodb.user=anupama
|
---|
26 | mongodb.pwd=chang3m3
|
---|
27 | # default mongodb port is 27017. Don't change the port unless you really have configured
|
---|
28 | # your mongodb server to listen at some other port
|
---|
29 | mongodb.port=27017
|
---|
30 | mongodb.host=mongodb.cms.waikato.ac.nz
|
---|
31 | #mongodb.dbname=ateacrawldata
|
---|
32 | mongodb.dbname=anupama |
---|
Note:
See
TracBrowser
for help on using the repository browser.