Timeline



2019-12-04:

21:26 Changeset [33754] by davidb
Correcting spelling error in code
21:22 Changeset [33753] by davidb
Made the change Dr Bainbridge wanted: in the two locations that …
21:14 Changeset [33752] by davidb
Followed Dr Bainbridge's suggestion to correct hnz.Identifiers that …
17:56 Changeset [33751] by ak19
Related to previous commit. Dr Bainbridge came up with a better …
16:47 Changeset [33750] by ak19
Fixed a NullPointerException without stacktrace, noticed with …
15:58 Changeset [33749] by ak19
Still on the bugfix for GLI with non-ascii filenames assigned …

2019-12-03:

21:06 Changeset [33748] by ak19
Linux bugfixes to recent commits to do with getting file-level meta …
17:50 Changeset [33747] by ak19
Tidying up code some more and moving unused (but reusable and possibly …
17:31 Changeset [33746] by ak19
1. Bugfix for dealing with + in filenames: file-level metadata now …
16:38 Changeset [33745] by ak19
Fix to function decodeStringContainingHexEntities that I recently …
15:04 Changeset [33744] by ak19
Refactored code to do more inside functions rather than make callers …
12:15 Changeset [33743] by kjdon
added extra info to depositor line
12:14 Changeset [33742] by kjdon
get depositor name from dictionary
11:29 Changeset [33741] by kjdon
changed a comment
11:19 Changeset [33740] by kjdon
added a format statement to Titles classifier. This uses gsf:metadata …

2019-12-02:

23:15 Changeset [33739] by ak19
Bugfix to GLI to being able to parse metadata.xml files containing & …
20:43 Changeset [33738] by ak19
Got the filenameToURLEncoding(String) variant that reuses …
20:03 Changeset [33737] by ak19
A larger fix but not complete fix to the problem of attaching and …
13:54 Changeset [33736] by kjdon
fixed a spelling mistake
13:44 Changeset [33735] by kjdon
replaced text strings with dictionary lookups, and if the …
13:44 Changeset [33734] by kjdon
replaced a string with dictionary lookup
13:43 Changeset [33733] by kjdon
on the first page of depositor, only show the list of collections the …
13:31 Changeset [33732] by kjdon
added some depositor strings
12:59 Changeset [33731] by kjdon
now that userContext has teh right info, we don't need to check …

2019-11-29:

23:40 Changeset [33730] by ak19
Finally, got the code back to achieving the same thing as the partial …
22:27 Changeset [33729] by ak19
Fixes to variants of debug function printCaller().
21:46 Changeset [33728] by ak19
Introducing method that I've tested separately to decode a string that …

2019-11-28:

22:17 Changeset [33727] by ak19
Experimental encoding related bugfix to GLI. In GLI, meta assigned at …
16:23 Changeset [33726] by ak19
Changing lowercase utf-8 parameter to uppercase UTF-8 in case that …
14:10 Changeset [33725] by kjdon
tidied this up a bit with regards to solr core activation and …

2019-11-27:

23:41 Changeset [33724] by ak19
1. A bugfix to Base64.decode(String s) to handle null strings returned …
17:44 Changeset [33723] by ak19
On linux 64 bit, the additional wrap command did not work because the …

2019-11-25:

21:29 Changeset [33722] by ak19
Adding in additional instructions in mongodb.txt, before I forgot how …
20:15 Changeset [33721] by ak19
Inactive but committing to svn: Newer Locale.pm file, and introducing …
20:08 Changeset [33720] by ak19
Implemented Dr Bainbridge's suggestions based on Kathy's solution to …
10:46 Changeset [33719] by kjdon
fixed up processRedirectRequest method. for some reason parts had been …

2019-11-22:

10:44 Changeset [33718] by davidb
Useful top-level script
10:44 Changeset [33717] by davidb
Code tidy up; better error checking on running Java cmd; …
10:43 Changeset [33716] by davidb
Changed to also write out resource id as file, based on inputFileName

2019-11-21:

21:10 Changeset [33715] by ak19
Much shorter Eclipse project file, currently including just the jar …
14:40 Changeset [33714] by ak19
Updating Eclipse .classpath file's reference to jar lib files correct …
14:37 Changeset [33713] by kjdon
refactoring LibraryServlet. runSecurityChecks was happening too late. …
14:23 Changeset [33712] by kjdon
modified createBasicRequest to use userContext methods, and to include …
14:20 Changeset [33711] by kjdon
added editEnabled into UserContext

2019-11-20:

23:23 Changeset [33710] by ak19
Working queries and map coords for geojson.tools (ironically, Lat and …
18:49 Changeset [33709] by ak19
Forgot to commit the zip file before deleting it
11:30 Changeset [33708] by davidb
Changed code so api key can be in separate file, and passed in on the …
11:27 Changeset [33707] by davidb
Adds in maven to path if untarred in java/packages area
11:14 Changeset [33706] by davidb
Tempalte file

2019-11-19:

14:08 Changeset [33705] by kjdon
reindented the file, no code changes
14:04 Changeset [33704] by kjdon
added some authentication error strings. some are used by the …
14:03 Changeset [33703] by kjdon
added more breadcrumbs for ease of finding your way back to the start
14:00 Changeset [33702] by kjdon
added depositorTitleAndLink template. TODO - get depositor text from …
13:59 Changeset [33701] by kjdon
added more breadcrumbs to the page. And now it displays an error if …
13:56 Changeset [33700] by kjdon
starting to put some of the strings into a dictionary - using …
13:53 Changeset [33699] by kjdon
first stab at requiring a user to be logged in to use the depositor, …

2019-11-15:

23:14 Changeset [33698] by ak19
Links to more reading
23:10 Changeset [33697] by ak19
Changes to the README to provide instructions on making the fewest …
22:09 Changeset [33696] by ak19
Moved the individual READMEs into the top level too along with …
20:29 Changeset [33695] by ak19
Minor corrects and file rename
20:03 Changeset [33694] by ak19
interfaces\images folder structure can be customised for a site as an …
20:01 Changeset [33693] by ak19
Forgot we needed a toplevel collect folder
19:57 Changeset [33692] by ak19
Math collection toplevel folder restructure
19:56 Changeset [33691] by ak19
Math collection renames and moving things about
19:54 Changeset [33690] by ak19
Moving the science collection to the top level
19:52 Changeset [33689] by ak19
Rename again
19:51 Changeset [33688] by ak19
Moving the science collection related README, screenshot and …
19:49 Changeset [33687] by ak19
Renaming science collection to not have period mark in its name, on …
19:46 Changeset [33686] by ak19
18:50 Changeset [33685] by davidb
Upstream change related to Solr ext
18:49 Changeset [33684] by davidb
Changes made around the time of the launch
18:46 Changeset [33683] by davidb
Updated to process latest version of spreadsheet
18:45 Changeset [33682] by davidb
Changes made around the time of the launch
18:44 Changeset [33681] by davidb
Added in flock technique to avoid multiple people running the same script
18:43 Changeset [33680] by davidb
Greenstone3 is fixed, so don't need to print out message about runing …
18:38 Changeset [33679] by davidb
Folder for working on updates (PDFs to del, PDFs to add) from Kiri
17:57 Changeset [33678] by davidb
setup for greenstone ext
17:57 Changeset [33677] by davidb
Intro text
17:55 Changeset [33676] by davidb
Some initial work getting a plugin going that call's Alex's VirusTotal
00:22 Changeset [33675] by ak19
Committing the newer query results (but from before today's …
00:21 Changeset [33674] by ak19
Changes to support the top 5 predicted langcodes and their confidence …
00:17 Changeset [33673] by ak19
Waikato Education Department's Science Activities and Maths Activities …

2019-11-14:

14:14 Changeset [33672] by kjdon
modified slightly so that the error messages come from the dictionary …
14:12 Changeset [33671] by kjdon
added a static getTextString method - currently this is in Action.java …
14:10 Changeset [33670] by kjdon
added editEnabled att string
14:10 Changeset [33669] by kjdon
removed an annoying debug message
10:03 Changeset [33668] by kjdon
a few changes to debuginfo texts
09:55 Changeset [33667] by kjdon
preProcess.xsl renamed to expand-gslib.xsl to better indicate what it does

2019-11-13:

23:08 Changeset [33666] by ak19
Having finished sending all the crawl data to mongodb 1. Recrawled the …
17:18 Changeset [33665] by davidb
Fixed jar name
17:17 Changeset [33664] by davidb
Initial version code for running VirusTotal API against files, CLI scripts
17:12 Changeset [33663] by davidb
Changes after testing the scripts
17:04 Changeset [33662] by davidb
Scripts to compile and run java code
16:54 Changeset [33661] by davidb
Compiling needs to use Maven
16:53 Changeset [33660] by davidb
For Java source code
16:40 Changeset [33659] by davidb
Top-level folder for new extension based on TotalVirus API which scans …
16:40 Changeset [33658] by davidb
Top-level folder for new extension based on TotalVirus API which scans …

2019-11-12:

21:33 Changeset [33657] by ak19
Some fixes after brief testing against 1/3 of the crawl. Restarted …
21:11 Changeset [33656] by ak19
Final minor changes before I start processing the crawls of node2.
20:56 Changeset [33655] by ak19
Minor change to print statement
20:54 Changeset [33654] by ak19
Removing jar file that wasn't used after all.
20:51 Changeset [33653] by ak19
1. As suggested by Dr Bainbridge, made the code changes to use Morphia …
20:41 Changeset [33652] by ak19
Introducing morphia subpackage
18:11 Changeset [33651] by ak19
1. Bugfix: overlappingSentences works. 2. storing numSentencesInMaor
12:06 Changeset [33650] by kjdon
updated to match the new xsl file names; lots of variable renames to …
12:04 Changeset [33649] by kjdon
renamed config_format and text_fragment_format to better represent …
12:04 Changeset [33648] by kjdon
changed the debuginfo xsl and strings to match the new o=xxx debug options
09:30 Changeset [33647] by kjdon
added/changed a few of the output values for debugging the transform

2019-11-11:

18:46 Changeset [33646] by ak19
Saving the mongodb queries and learning links that Dr Bainbridge found …
18:45 Changeset [33645] by ak19
Fix to 2 bugs when sending data to MongoDB: 1. overlappingSentences …
11:50 Changeset [33644] by ak19
Just committing the growing mongodb.txt file with links and …
11:46 Changeset [33643] by ak19
Brought the template log4j.properties.in back up to speed. I forgot it …
11:06 Changeset [33642] by ak19
Forgot to commit the java driver for mongodb when I committed the Java …
10:53 Changeset [33641] by kjdon
commented out some debug statements
10:48 Changeset [33640] by kjdon
oops, I must have 'tidied' up the file and then not compiled it to …
10:23 Changeset [33639] by kjdon
need to select child nodes, otherwise the gsf:default node ends up in …
10:22 Changeset [33638] by kjdon
gslib doesn't use xml-to-string.xsl. its only used by formatmanager, …
10:21 Changeset [33637] by kjdon
we can now use gsf and gslib in layout files.
10:04 Changeset [33636] by kjdon
include means the stylesheet gets added inline, import mea s it gets …
09:38 Changeset [33635] by ak19
Maori-language-detection doesn't use Greenstone 3 at present, it's not …

2019-11-08:

23:59 Changeset [33634] by ak19
Rewrote NutchTextDumpProcessor as NutchTextDumpToMongoDB.java, which …
19:43 Changeset [33633] by ak19
1. TextLanguageDetector now has methods for collecting all sentences …

2019-11-07:

14:53 Changeset [33632] by kjdon
overhaul of TransformingReceptionist. changed the order of inlining …
14:52 Changeset [33631] by kjdon
added a bit more error reporting
14:44 Changeset [33630] by kjdon
minor comment changes
14:20 Changeset [33629] by kjdon
added methods using Parameter2 - for params with text node values
13:52 Changeset [33628] by kjdon
not sure why documentNode was a gsf:template here. Can't be like that …
09:28 Changeset [33627] by kjdon
removed unnecessary comments

2019-11-05:

21:59 Changeset [33626] by ak19
TODOs
21:58 Changeset [33625] by ak19
A file listing domains with seedurls containing /mi(/) that are …
21:48 Changeset [33624] by ak19
Some cleanup surrounding the now renamed function createSeedURLsFile, …
21:04 Changeset [33623] by ak19
1. Incorporated Dr Nichols earlier suggestion of storing page modified …
15:42 Changeset [33622] by ak19
File rename

2019-11-04:

20:35 Changeset [33621] by ak19
Comitting jotted down mongodb related instructions from what Dr …
14:24 Changeset [33620] by ak19
Final crawl, done on vagrant VM node6. Crawl site IDs 01407-01462.
11:36 Changeset [33619] by kjdon
need to handle the case where a collection file (eg image) gets …
Note: See TracTimeline for information about the timeline view.