source: other-projects

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @35884   5 months cstephen Upgrade log4j 2.16.0 -> 2.17.1 to resolve CVE-2021-44832 and CVE-2021-45105
(edit) @35829   5 months cstephen Upgrade log4j 2.15.0 -> 2.16.0 to resolve CVE-2021-45046
(edit) @35808   5 months cstephen Upgrade log4j from 2.12.1 -> 2.15.0. Resolves RCE vunerability and …
(edit) @35806   5 months cstephen Fix startup failure when temp dir already exists
(edit) @35791   6 months cstephen Add updated macroniser code. This is a significant change to the …
(edit) @35790   6 months cstephen Remove old macroniser code. Next commit will introduce an updated …
(edit) @35778   6 months cstephen Branch of original macroniser; before work was undertaken to …
(edit) @35777   6 months cstephen Prepare tags directory
(edit) @35750   6 months cstephen Cleanup DI
(edit) @35732   7 months cstephen Improve DI return to account for newlines in JSON. Fix DI ignoring …
(edit) @35725   7 months cstephen Add support for macronising PowerPoint files
(edit) @35722   7 months cstephen Remove unnecessary comment
(edit) @35721   7 months cstephen Implement json response for file macronisation
(edit) @35720   7 months cstephen Refactor FileUpload logic
(edit) @35719   7 months cstephen Add support for JSON response to direct input queries. Cleanup other …
(edit) @35529   8 months anupama Updating URL in README.txt
(edit) @35366   9 months davidb Version of war file compiled up from soruce
(edit) @35365   9 months davidb Initial round of files to provide a vncserver service that Guacamole …
(edit) @35231   10 months kjdon Another field (ex.File.FileCreateDate) that needs to be ignored when …
(edit) @34959   15 months davidb Top-level folder for a new project to support an individual researcher …
(edit) @34958   15 months davidb Top-level folder for a new project to support an individual researcher …
(edit) @34955   15 months anupama Some local changes on the 64 bit linux that I'd created long back to …
(edit) @34954   15 months anupama The newer EXIF that was committed introduced some additional …
(edit) @34941   15 months kjdon 1. Kathy's changes for working with the updated ID file to upload to …
(edit) @34940   15 months anupama Previous commit was from Win machine, this from 32 bit LSB linux. …
(edit) @34939   15 months kjdon http to https change for greenstone.org url (as was the case for …
(edit) @34934   15 months anupama AUTOCOMMIT by gen-model-colls.sh script. Message: Rebuilding …
(edit) @34933   15 months anupama AUTOCOMMIT by gen-model-colls.sh script. Message: Rebuilding …
(edit) @34637   17 months davidb Change of external reference to be SETUP.bash, not SETUP.sh
(edit) @34636   17 months davidb Initial set of svn:externals to get 'ml-processing' off the ground
(edit) @34635   17 months davidb Directory that holds together a skeleton set of the Greenstone3 …
(edit) @34617   17 months anupama Before we forget, putting Kathy's new script for uploading to the …
(edit) @34524   19 months ak19 Correct Mac OS name in log file being uploaded
(edit) @34523   19 months ak19 Minor. After testing on new release-kit mac.
(edit) @34520   19 months Jeremy Symon need to use ed25519 key on www-internal
(edit) @34519   19 months Jeremy Symon adding in code to upload to www-internal. Needs a new ed25519 identity …
(edit) @34518   19 months Jeremy Symon use a different identity file for www-internal - needs to be ed25519, …
(edit) @34515   19 months ak19 AUTOCOMMIT by gen-model-colls.sh script. Message: Forgot to svn up …
(edit) @34514   19 months ak19 AUTOCOMMIT by gen-model-colls.sh script. Message: Forgot to svn up …
(edit) @34513   19 months ak19 AUTOCOMMIT by gen-model-colls.sh script. Message: Rebuilding after …
(edit) @34512   19 months ak19 AUTOCOMMIT by gen-model-colls.sh script. Message: Rebuilding after …
(edit) @34418   20 months ak19 Attempted to upload diffcol report to wwwinternal instead of wwwdev. …
(edit) @34417   20 months ak19 Updates to diffcol to handle change introduced in commit 34394, which …
(edit) @34416   20 months ak19 Committing rebuilt model collections after new doc.xml meta …
(edit) @34231   2 years ak19 Rebuilding diffcol model collection Multimedia after recent update to …
(edit) @34127   2 years ak19 Spelling correction in filename: screeMshot to screeNshot
(edit) @34120   2 years ak19 CSV version of .ods file, so openoffice isn't required
(edit) @34119   2 years ak19 Committing the auto-generated analysis results folder, …
(edit) @34097   2 years ak19 Open office version of similarly named spreadsheet, just with columns …
(edit) @34089   2 years ak19 So far accumulated URLs to docs on Google scholar about or somewhat …
(edit) @34011   2 years ak19 Piechart data for sites prepared for crawling and the piecharts for these
(edit) @34007   2 years ak19 Prepared more data for the piecharts. This time for empty web pages vs …
(edit) @34006   2 years ak19 Committing more data I've collected for generating pie charts and the …
(edit) @34005   2 years ak19 InfoOnEmptyPagesNotInMongoDB.txt is now written out to a file, instead …
(edit) @34004   2 years ak19 Renaming csv file to have csv extension
(edit) @34003   2 years ak19 Redid the file with info on empty URL web pages as a csv file with …
(edit) @34001   2 years ak19 Tentative total urls from common crawl 12 month cral data.
(edit) @34000   2 years ak19 Some debugging and other minor changes
(edit) @33999   2 years ak19 Common crawl 12 month urls and CC provided stats
(edit) @33988   2 years ak19 1. Print out which web pages of which web site's dump.txt were empty. …
(edit) @33987   2 years ak19 Output of re-running NutchTextDumpToMongoDB to print out which web …
(edit) @33986   2 years ak19 Dr Bainbridge investigated the original data set more
(edit) @33985   2 years ak19 Data to back the piechart I need to make that will illustrate how we …
(edit) @33984   2 years ak19 Simple class to summarise some basic counts of the input common crawl data
(edit) @33983   2 years ak19 More sensible name for method which had too long kept its old name …
(edit) @33982   2 years ak19 SummaryTool.java now processed the handcrafted UNIQUE domains counts …
(edit) @33981   2 years ak19 As Dr Bainbridge suggested, code now opens a new firefox tab with a …
(edit) @33980   2 years ak19 Additional comments
(edit) @33979   2 years ak19 Clearly stating that counts are of unique domains
(edit) @33978   2 years ak19 Opens all geoJSON maps in new tabs instead of waiting for user to have …
(edit) @33977   2 years ak19 Added something on precision vs recall being applicable to our …
(edit) @33976   2 years ak19 Adding in what I could remember of Dr Bainbridge's statement about the …
(edit) @33966   2 years ak19 Added the origSequence and basicDomain columns to the random 260 web …
(edit) @33965   2 years ak19 1. Adding a basicDomain column (stripped of http/https and www prefix) …
(edit) @33964   2 years ak19 2 records were missing a value for the qualityLevel column.
(edit) @33963   2 years ak19 Added a new helper method to MongoDBQueryer.java to add numPagesInMRI …
(edit) @33962   2 years ak19 2 fields changed, as one was missed out and the other incorrectly …
(edit) @33961   2 years ak19 New category, LINK_TEXT, introduced for the random web page URL samples.
(edit) @33960   2 years ak19 Reviewed all the random sample web page URLs marked …
(edit) @33959   2 years ak19 URIEncoding the mapData makes it unparseable by geojson.io
(edit) @33952   2 years ak19 Minor changes for processing
(edit) @33951   2 years ak19 Reviewed the qualityLevel column where LITTLE_TEXT was assigned.
(edit) @33950   2 years ak19 Reviewed the qualityLevel column where MIXED_TEXT was assigned.
(edit) @33949   2 years ak19 Reviewed the qualityLevel column where NAV was assigned.
(edit) @33948   2 years ak19 Reviewed the random sampled web page URLs marked as …
(edit) @33947   2 years ak19 Some more questionmarked field values assigned.
(edit) @33946   2 years ak19 1. New function to handle user input assigning the newly introduced …
(edit) @33945   2 years ak19 Added a 4th column for all 260 sample web page URLs and have used the …
(edit) @33944   2 years ak19 Added the isReallyInMRI column after manually inspecting the remaining …
(edit) @33941   2 years ak19 1. Uppercase 3rd field (Y/N/? field) read back in from file before …
(edit) @33940   2 years ak19 1. In order to make it easier to do the manual work of inspecting 260 …
(edit) @33939   2 years ak19 1. Old random samples file doesn't apply as we're not sampling by …
(edit) @33938   2 years ak19 1. Don't regenerate random sample of web page urls and full web page …
(edit) @33937   2 years ak19 New counts of manual sites after reingesting into MongoDB. Forgot to …
(edit) @33936   2 years ak19 Renaming old file to place with new counts after reingesting into MongoDB.
(edit) @33926   2 years ak19 Investigated some other options for screen capturing and Google chrome …
(edit) @33925   2 years ak19 1. Bugfix: oversight, should return uri encoded URL for mapData, …
(edit) @33924   2 years ak19 Adding in Dr Bainbridge's command to check the JSON generated is …
(edit) @33919   2 years ak19 SummaryTool now uses the CountryCodeCountsMapData.java class to …
(edit) @33918   2 years ak19 Country codes added to each domain's URL of the manual site/domain …
Note: See TracRevisionLog for help on using the revision log.