Timeline


and .

30.05.2020:

16:42 Changeset [34135] by ak19
Changed the name of a collection making it more descriptive and also …
16:15 Changeset [34134] by ak19
Added an empty text file with instruction for the allismri collection too
16:14 Changeset [34133] by ak19
Added an empty text file with instruction
16:01 Changeset [34132] by ak19
Committing the commoncrawl site of Nutch recrawls of our CC data where …
15:18 Changeset [34131] by ak19
Allowing input keep-urls-file to contain a comma followed by country code …
01:27 Changeset [34130] by ak19
Some more tidying up while isMRI filtered collection rebuilding
01:01 Changeset [34129] by ak19
Implemented Kathy's suggestions: 1. Explicit ex prefix to ex meta removed, …

27.05.2020:

20:06 Changeset [34128] by ak19
When rebuilding the opotiki site today, had noticed that full-rebuild's …
19:43 Changeset [34127] by ak19
Spelling correction in filename: screeMshot to screeNshot
19:10 Changeset [34126] by ak19
When I'd modified the code to make the keep_urls_file non-compulsory, …
18:07 Changeset [34125] by ak19
Commit message went awry. Cleaned up some comments to recommit with proper …
18:03 Changeset [34124] by ak19
Decoding the title and text using the encoding seemed to have turned into …

26.05.2020:

02:18 Changeset [34123] by ak19
Some more minor changes
01:13 Changeset [34122] by ak19
1. After some testing of building the complete commoncrawl collection, …

25.05.2020:

23:53 Changeset [34121] by ak19
1. Introducing NutchTextDumpPlugin? to process the records (representing …

21.05.2020:

17:47 Changeset [34120] by ak19
CSV version of .ods file, so openoffice isn't required
17:28 Changeset [34119] by ak19
Committing the auto-generated analysis results folder, mongodb-data-auto. …
14:16 Changeset [34118] by ak19
Kathy's hard work for commit 34117 was done on a Windows machine where a …

20.05.2020:

15:53 Changeset [34117] by kjdon
tidied up the code. Moved a few commands that don't actually need site or …
14:44 Changeset [34116] by kjdon
use global.properties, not build.properties. therefore call this with …

19.05.2020:

15:03 Changeset [34115] by kjdon
a couple changes. 1 don't explicitly need to remove the lock file from …
13:22 Ticket #945 (GS3 needs to allow for https URLs) closed by ak19
fixed: GLI updated to use ProtocolPortProperties? when GS3 to work out port and …
12:25 Changeset [34114] by kjdon
for gs3, gwcgi is the tomcat context, i.e. greenstone3 by default. If you …
11:34 Changeset [34113] by ak19
1. tomcat.port no longer exists in build.properties after https also …

18.05.2020:

13:40 Changeset [34112] by ak19
GS3 source code seems to already use FileInputStream? with UTF-8 encoding …
11:24 Changeset [34111] by ak19
Undoing additions surrounding JAVA_TOOL_OPTIONS where file.encoding is set …

06.05.2020:

08:02 Changeset [34110] by kjdon
modified a couple of error strings to be more helpful

04.05.2020:

08:39 Changeset [34109] by kjdon
tidied this up a bit. Now we leave in _textmonth00_ if the month is …
08:27 Changeset [34108] by kjdon
added in a replace item for textmonth00, used by Datelist when month is …
08:26 Changeset [34107] by kjdon
added in a definition for textmonth00, used by DateList? when the month is …
Note: See TracTimeline for information about the timeline view.