source: gsdl/trunk/perllib/plugins

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @17289   16 years kjdon moved the actual parsing from read into parse_file so other plugins …
(edit) @17250   16 years kjdon forgot to pass the arguments to ImageConverter::begin()
(edit) @17216   16 years kjdon trying to get OAI files exploding. Have copied in some code from one …
(edit) @17213   16 years ak19 Significant changes to subroutine get_language_encoding to better work …
(edit) @17212   16 years ak19 Removed some unnecessary commented-out code
(edit) @17197   16 years kjdon previous metadata changes meant that there was no longer URL metadata …
(edit) @17196   16 years kjdon set cover_image to false as it makes no sense for images
(edit) @17127   16 years kjdon want to block body background, so added it into tabbg_matches regex …
(edit) @17126   16 years kjdon inherit and use args form ReadTextFile cos we want the file encoding stuff
(edit) @17112   16 years kjdon CJK text segmentation now done at indexing level (in buildproc), not …
(edit) @17103   16 years ak19 OAI files should be explodable, so added that back in as an option
(edit) @17099   16 years kjdon in get_language_encoding, we extract head from html files. if its not …
(edit) @17088   16 years davidb Plugin modified to only print out URL encoded filename if different to …
(edit) @17066   16 years ak19 OAIPlugin now works again: 1. needs to inherit from ReadTextFile as …
(edit) @17059   16 years ak19 The invalid MIMEtype image/jpg for generated images are now changed to …
(edit) @17054   16 years kjdon a previous change to ReadXMLFile meant that all namespaces were …
(edit) @17053   16 years kjdon made the default OIDtype auto, add message_id at end instead of start …
(edit) @17033   16 years kjdon fixed up some bugs in add_OID
(edit) @17028   16 years kjdon changed METS to GreenstoneMETS in the code
(edit) @17027   16 years kjdon METSPlugin only processes greenstone mets, so renamed it.
(edit) @17026   16 years kjdon OID generation modifications: OIDtype and OIDmetadata options now …
(edit) @16997   16 years ak19 Multilingual filehandling in subroutine extra_metadata
(edit) @16995   16 years kjdon fixed a misnamed method call
(edit) @16994   16 years kjdon add a test for invalid real media file
(edit) @16991   16 years kjdon fixed up a coupld of things to do with assoc files
(edit) @16981   16 years kjdon forgot to commit the changes to the BaseMediaConverter file after …
(edit) @16976   16 years kjdon this is a base class for converters, not an actual converter so …
(edit) @16970   16 years kjdon changed an output statement
(edit) @16961   16 years ak19 Fixed a bug introduced in last commit
(edit) @16960   16 years ak19 Fixed a bug introduced in last commit
(edit) @16958   16 years ak19 Expanded a comment
(edit) @16957   16 years ak19 Associated file names are url-encoded
(edit) @16956   16 years ak19 Associated file names are url-encoded
(edit) @16955   16 years ak19 Associated file names are url-encoded
(edit) @16954   16 years ak19 Minor edit since a subroutine has been renamed
(edit) @16953   16 years ak19 Associated file names are url-encoded
(edit) @16952   16 years ak19 Associated file names are url-encoded
(edit) @16937   16 years ak19 Correction to setting new SourceFile metadata
(edit) @16935   16 years ak19 Removed the commented out debug statements now, since I think …
(edit) @16932   16 years ak19 Fixed error I introduced in the changes committed previously
(edit) @16923   16 years ak19 1. Sets both the Source and new SourceFile metadata; 2. Calls …
(edit) @16922   16 years ak19 1. The doclink uses the new SourceFile metadata, which is the …
(edit) @16921   16 years ak19 src_link uses the new SourceFile metadata, which is the url-encoded …
(edit) @16920   16 years ak19 1. When checking whether the original_file is missing, it ensures that …
(edit) @16919   16 years ak19 set_Source_metadata modified to also set SourceFile metadata which …
(edit) @16904   16 years ak19 1. The percents in URL-encoded associated images' filenames are only …
(edit) @16901   16 years ak19 The percents in URL-encoded image filenames are only escaped with …
(edit) @16888   16 years ak19 URL encodes the name of the file(s) generated upon conversion to html, …
(edit) @16856   16 years kjdon in gli mode, print a warning about can't process images
(edit) @16854   16 years kjdon removed some forgotten debug statements
(edit) @16852   16 years kjdon no_block no longer affects blocking cover images. setting block_exp …
(edit) @16851   16 years kjdon removed a print statement I left in last commit
(edit) @16850   16 years kjdon changed a print statement to print to outhandle instead of STDERR
(edit) @16849   16 years kjdon fixed a bad error in previous commit where I had put super::init …
(edit) @16848   16 years kjdon added Thumbs.db to block list
(edit) @16847   16 years kjdon moved setting of self->basedir from BasePlugin to MediaConverter as …
(edit) @16846   16 years kjdon unhid the cache images option for gli
(edit) @16837   16 years ak19 Fixing an error introduced in the previous commit: after eval_dir_dots …
(edit) @16836   16 years ak19 1. When working out the web_url, it needs to know of any containing …
(edit) @16827   16 years davidb Edited needed to make pageImagePlugin work again after plugin restructuring
(edit) @16826   16 years davidb Base class use in modules such as ImageConverter.pm
(edit) @16825   16 years davidb Code for supporting cache merged back in with Katherine's restructured …
(edit) @16824   16 years davidb Filenames are now quoted in system calls, so they can have spaces in them.
(edit) @16822   16 years davidb White space adjustment
(edit) @16821   16 years davidb Introduced new merge_inheritance function, useful in a plugin …
(edit) @16820   16 years davidb Fixed cut and pase error in comment that talked about text plugin, …
(edit) @16812   16 years ak19 Undid the changes introduced in the last commit which url encode the …
(edit) @16799   16 years ak19 URL encoding href links for internal links since the database lookup …
(edit) @16771   16 years ak19 Changes to make it compatible with multilingual filenames. Uses URL …
(edit) @16769   16 years ak19 Intermediate version (with commented out debug statements). 1. Works …
(edit) @16768   16 years ak19 URL encodes filenames in order to handle cases of multilingual images …
(edit) @16767   16 years ak19 In progress: Filename encoding after working with it on Windows. Still …
(edit) @16765   16 years ak19 Only removes comments in head tag now when working out the encoding
(edit) @16753   16 years ak19 get_language_encoding for HTMLFiles strips out the comments before …
(edit) @16735   16 years ak19 When a directory of interlinking html files is dropped into GLI, …
(edit) @16724   16 years ak19 1. Dr Bainbridge added some language-encoding related methods that …
(edit) @16700   16 years kjdon changed a comment
(edit) @16699   16 years kjdon added auxiliary parameter to new - needed if you want to do new …
(edit) @16698   16 years kjdon added auxiliary parameter to new - needed if you want to do new …
(edit) @16697   16 years kjdon if marc mapping file cannot be located, print a warning about can't …
(edit) @16696   16 years kjdon added an option to XML parser to strip out namespaces. did this so …
(edit) @16695   16 years kjdon the last commit was by mistake - this one removes the print statements …
(edit) @16694   16 years kjdon MARCXMLPlugin uses textcat_language_and_encoding method from …
(edit) @16693   16 years kjdon MARCXMLPlugin uses textcat_language_and_encoding method from …
(edit) @16692   16 years kjdon code to read in marc mapping files moved from MARCXMLPlugin to …
(edit) @16677   16 years davidb Minor tweak to EmailPlugin to avoid directories that match \d+ being …
(edit) @16667   16 years kjdon get_language_encoding was setting ->input_encoding, which means its …
(edit) @16646   16 years kjdon now segments all metadata as well as text
(edit) @16644   16 years kjdon now uses CJKTextSegmenter to add segmentation functionality to text …
(edit) @16643   16 years kjdon removed a couple of 'use xxx' lines that are not needed
(edit) @16642   16 years kjdon separate_cjk option and code moved to CJKTextSegmenter, and used by …
(edit) @16640   16 years kjdon helper plugin to separate cjk text into individual characters
(edit) @16639   16 years kjdon moved the require diagnostics line to here from ReadTextFile
(edit) @16638   16 years kjdon modified store_block_files: includes script (js) files, don't add a …
(edit) @16632   16 years ak19 Work on supporting non-utf8 characters in filenames
(edit) @16580   16 years ak19 Shared subroutine tmp_area_convert_file now ensures that the tailname …
(edit) @16557   16 years ak19 Auto filename encoding has several additional settings now, these are …
(edit) @16555   16 years ak19 Instead of sub get_language_encoding applying function ensure_utf8 on …
(edit) @16521   16 years kjdon pass in the file extension to get_tmp_filename otherwise it doesn't …
(edit) @16520   16 years kjdon made smart_block option description say deprecated, and added a …
Note: See TracRevisionLog for help on using the revision log.