source:

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @28018   11 years jmt12 Try really hard to capture the output from 'time' function as Medusa …
(edit) @28017   11 years jmt12 Forgot to add processing comment before call to hadoop_import.pl
(edit) @28016   11 years jmt12 Allow the hadoop report generator to parse start and end times …
(edit) @28015   11 years jmt12 Add an extra option that allows me to pass in the directory to write …
(edit) @28014   11 years jmt12 Remove tasks that have had data locality established from the array of …
(edit) @28013   11 years jmt12 A new script to run a battery of Hadoop ingests at varying replication …
(edit) @28012   11 years jmt12 Express start time as a double as well
(edit) @28011   11 years jmt12 Turn off debugging in the copy in SVN
(edit) @28010   11 years jmt12 Correctly set up the environment for calls to txt2tdb and also replace …
(edit) @28009   11 years ak19 Adding in the two Paged Img (Scanned Img) tutorial collections
(edit) @28008   11 years ak19 Updates to diffcol's doc.xml processing necessary for diffcol to …
(edit) @28007   11 years ak19 Adding the 2 collections for the MARC tutorial.
(edit) @28006   11 years sjm84 Fixing a bug caused when the context and the interface have the same name
(edit) @28005   11 years ak19 1. task.pl summarise cmd now prints out whether the diffcol result is …
(edit) @28004   11 years ak19 The task file (bash script) has been replaced with the perl version …
(edit) @28003   11 years ak19 The task file (bash script) has been replaced with the perl version …
(edit) @28002   11 years ak19 Adding the Demo-Section-Tagging tutorial collection
(edit) @28001   11 years jmt12 Write datestamp using dbutil if applicable
(edit) @28000   11 years jmt12 Functions for determining if the plugout supports writing Datestamp …
(edit) @27999   11 years sjm84 Fixes for metadata with base interfaces
(edit) @27998   11 years sjm84 Reformatting this file
(edit) @27997   11 years kjdon need to check that perfect hash function was generated otherwise we …
(edit) @27996   11 years jmt12 A new version of the archive with minor changes to log4j configuration
(edit) @27995   11 years jmt12 Just adding some code comments
(edit) @27994   11 years ak19 Renaming report from OS-diffcol- to diffcol-OS- for sorting reports on …
(edit) @27993   11 years ak19 Adding collections for Tudor tutorials that Jenny had gone through, …
(edit) @27992   11 years sjm84 Changing the way the file is turned into bytes
(edit) @27991   11 years sjm84 Get gslib.xsl the way we get all other xsl files
(edit) @27990   11 years ak19 2 fixes: 1. The Tudor collections' html source documents have stray …
(edit) @27989   11 years sjm84 Return was in the wrong place
(edit) @27988   11 years sjm84 Removing a print statement
(edit) @27987   11 years sjm84 If a file cannot be found in the given interface then check if it is …
(edit) @27986   11 years ak19 Fixing syntax error: forgot semi-colon
(edit) @27985   11 years ak19 Now prints out exit status and weights and passes commands that get …
(edit) @27984   11 years davidb Next round of changes after fruther testing and extra example development
(edit) @27983   11 years davidb EchoprintClassifier needs to be in the getMBID function as well
(edit) @27982   11 years sjm84 Fixed an error that was occuring on Windows due to backslashes
(edit) @27981   11 years ak19 Updating Word-PDF-Formatting tutorial model collection now that …
(edit) @27980   11 years ak19 Updating Word-PDF-Basic tutorial model collection now that extra_meta …
(edit) @27979   11 years ak19 Updating Small-HTML tutorial model collection now that extra_meta is …
(edit) @27978   11 years ak19 Updating PDFBox tutorial model collection now that extra_meta is sorted.
(edit) @27977   11 years ak19 Updating PDFBox tutorial model collection now that extra_meta is sorted.
(edit) @27976   11 years ak19 Updating Enhanced-PDF collection now that extra_meta is sorted and the …
(edit) @27975   11 years ak19 Cached folder is superfluous and not used by diffcol
(edit) @27974   11 years ak19 Rebuilt with extra_meta sorted
(edit) @27973   11 years ak19 Reinstating Dr Bainbridge's fix to getting the extra meta in sorted …
(edit) @27972   11 years ak19 1. Skips any modelcol (template) folder in model-collect, 2. Flag for …
(edit) @27971   11 years ak19 doc.xml should also ignore the FileSize metadata as there can be …
(edit) @27970   11 years ak19 Fix to regex. The regex sorting the ordering of the generated …
(edit) @27969   11 years ak19 When test-running diffcol on the Enhanced-PDF tutorial, it would choke …
(edit) @27968   11 years davidb 4store server needs to be run with -X to 'enable public cross-origin …
(edit) @27967   11 years ak19 Skip the 'cache' folder, which is the location where the paged_imgs …
(edit) @27966   11 years ak19 GS2 apache server had issues launching because it thought setup.bat …
(edit) @27965   11 years ak19 No need to make the ghostscript and imagemagick binaries executable …
(edit) @27964   11 years ak19 Setting the mac ghostscript and imagemagick binaries to executable in SVN
(edit) @27963   11 years ak19 Need Max's ghostscript binary with his imagemagick for darwin to …
(edit) @27962   11 years ak19 For darwin, need to get the imagemagick binaries that Max had …
(edit) @27961   11 years sjm84 More cascade-makeifying
(edit) @27960   11 years sjm84 Upgrading to the latest version of jodconverter
(edit) @27959   11 years sjm84 Renaming build-srcpack to packages
(edit) @27958   11 years ak19 Adding in the Enhanced-PDF model collection
(edit) @27957   11 years ak19 For now, undoing the change made to BasePlugin for the diffcol nightly …
(edit) @27956   11 years ak19 Rebuilt with latest pdfbox extension to bring it up to speed with …
(edit) @27955   11 years kjdon supports_memberof needs to be called from self otherwise we don't get …
(edit) @27954   11 years ak19 Removed old PDFBox collection
(edit) @27953   11 years ak19 Redid PDFBox collection without 2nd pdf file.
(edit) @27952   11 years ak19 Phasing out old PDFBox model collection
(edit) @27951   11 years ak19 Updating PDFBox collection with the extra metadata extracted (when …
(edit) @27950   11 years kjdon check that we actually have stem/case/accentfold before setting them - …
(edit) @27949   11 years ak19 Need to sort extra metadata (e.g. ex.PDF.* and ex.File.* meta …
(edit) @27948   11 years davidb First cut at a collection specificially designed to annotate the …
(edit) @27947   11 years davidb For the interface
(edit) @27946   11 years davidb For the collections
(edit) @27945   11 years davidb Top-level folder to contain the site/interface and collection …
(edit) @27944   11 years davidb Moving to one side to make way for more substantial set of collection files
(edit) @27943   11 years davidb Quick presentation fixes prior to giving demo
(edit) @27942   11 years sjm84 Adding PDF-box to the list of downloadable extensions
(edit) @27941   11 years sjm84 Using ArrayLists instead of arrays to fix an out of bounds exception …
(edit) @27940   11 years davidb Tidy up on error messages generated in this file
(edit) @27939   11 years davidb Symlink to help extension play nicely in a Greenstone3 context
(edit) @27938   11 years davidb Making PDF-Box something that Greenstone3 is always checked out with
(edit) @27937   11 years ak19 The tutorial doc changes for the last GS3 tutorials that needed going …
(edit) @27936   11 years ak19 General corrections made or the workshop version of the tutorial that …
(edit) @27935   11 years ak19 Basic changes and corrections to the Greenstone tutorials.
(edit) @27934   11 years davidb Some corrections after testing
(edit) @27933   11 years davidb Fixed small typo
(edit) @27932   11 years davidb Adding in ability to compile up Echonests fingerprint server
(edit) @27931   11 years davidb Adding in ability to compile up Echonests fingerprint server
(edit) @27930   11 years davidb Adding in ability to compile up Echonests fingerprint server
(edit) @27929   11 years davidb Further files used to compile up source cdoe
(edit) @27928   11 years ak19 Fixing the omission in the advbeat_large collection config file that …
(edit) @27927   11 years ak19 Correcting error introduced in earlier commit.
(edit) @27926   11 years sjm84 Part of moving the userDB from localsite to web
(edit) @27925   11 years sjm84 Part of moving the userDB from localsite to web
(edit) @27924   11 years sjm84 Part of moving the userDB from localsite to web
(edit) @27923   11 years sjm84 Part of moving the userDB from localsite to web
(edit) @27922   11 years sjm84 Part of moving the userDB from localsite to web
(edit) @27921   11 years sjm84 Part of moving the userDB from localsite to web
(edit) @27920   11 years xiao changed catalina memory size from 400m to 1000m
(edit) @27919   11 years sjm84 Custom metadata fields are now saved properly
Note: See TracRevisionLog for help on using the revision log.