source: gsdl/trunk/perllib

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @16888   16 years ak19 URL encodes the name of the file(s) generated upon conversion to html, …
(edit) @16856   16 years kjdon in gli mode, print a warning about can't process images
(edit) @16854   16 years kjdon removed some forgotten debug statements
(edit) @16852   16 years kjdon no_block no longer affects blocking cover images. setting block_exp …
(edit) @16851   16 years kjdon removed a print statement I left in last commit
(edit) @16850   16 years kjdon changed a print statement to print to outhandle instead of STDERR
(edit) @16849   16 years kjdon fixed a bad error in previous commit where I had put super::init …
(edit) @16848   16 years kjdon added Thumbs.db to block list
(edit) @16847   16 years kjdon moved setting of self->basedir from BasePlugin to MediaConverter as …
(edit) @16846   16 years kjdon unhid the cache images option for gli
(edit) @16844   16 years davidb File::RandomAccess added to support Image::ExifTool. Both are pure …
(edit) @16843   16 years davidb Additions to strings dictionary
(edit) @16842   16 years davidb ExifTool added to cpan area to support metadata extraction from files …
(edit) @16841   16 years davidb Supporting classes for conversion
(edit) @16837   16 years ak19 Fixing an error introduced in the previous commit: after eval_dir_dots …
(edit) @16836   16 years ak19 1. When working out the web_url, it needs to know of any containing …
(edit) @16827   16 years davidb Edited needed to make pageImagePlugin work again after plugin restructuring
(edit) @16826   16 years davidb Base class use in modules such as ImageConverter.pm
(edit) @16825   16 years davidb Code for supporting cache merged back in with Katherine's restructured …
(edit) @16824   16 years davidb Filenames are now quoted in system calls, so they can have spaces in them.
(edit) @16822   16 years davidb White space adjustment
(edit) @16821   16 years davidb Introduced new merge_inheritance function, useful in a plugin …
(edit) @16820   16 years davidb Fixed cut and pase error in comment that talked about text plugin, …
(edit) @16812   16 years ak19 Undid the changes introduced in the last commit which url encode the …
(edit) @16799   16 years ak19 URL encoding href links for internal links since the database lookup …
(edit) @16794   16 years davidb Tweak to filename_cat that had an explicit '/' in it. Didn't seem to …
(edit) @16793   16 years davidb Double nested function call broken down in to two separate lines to …
(edit) @16792   16 years davidb Comment out some debugging code
(edit) @16791   16 years davidb Improvement to downloading capabilities for WGet and OAI
(edit) @16771   16 years ak19 Changes to make it compatible with multilingual filenames. Uses URL …
(edit) @16769   16 years ak19 Intermediate version (with commented out debug statements). 1. Works …
(edit) @16768   16 years ak19 URL encodes filenames in order to handle cases of multilingual images …
(edit) @16767   16 years ak19 In progress: Filename encoding after working with it on Windows. Still …
(edit) @16765   16 years ak19 Only removes comments in head tag now when working out the encoding
(edit) @16753   16 years ak19 get_language_encoding for HTMLFiles strips out the comments before …
(edit) @16735   16 years ak19 When a directory of interlinking html files is dropped into GLI, …
(edit) @16726   16 years mdewsnip Added quote marks around sqlite executable path so it works when …
(edit) @16725   16 years mdewsnip Undid one of my changes from yesterday -- turns out the oaiserver …
(edit) @16724   16 years ak19 1. Dr Bainbridge added some language-encoding related methods that …
(edit) @16719   16 years ak19 Dr Bainbridge has changed several methods so that they can now be …
(edit) @16704   16 years mdewsnip Fixed two bugs with resumption token support.
(edit) @16700   16 years kjdon changed a comment
(edit) @16699   16 years kjdon added auxiliary parameter to new - needed if you want to do new …
(edit) @16698   16 years kjdon added auxiliary parameter to new - needed if you want to do new …
(edit) @16697   16 years kjdon if marc mapping file cannot be located, print a warning about can't …
(edit) @16696   16 years kjdon added an option to XML parser to strip out namespaces. did this so …
(edit) @16695   16 years kjdon the last commit was by mistake - this one removes the print statements …
(edit) @16694   16 years kjdon MARCXMLPlugin uses textcat_language_and_encoding method from …
(edit) @16693   16 years kjdon MARCXMLPlugin uses textcat_language_and_encoding method from …
(edit) @16692   16 years kjdon code to read in marc mapping files moved from MARCXMLPlugin to …
(edit) @16677   16 years davidb Minor tweak to EmailPlugin to avoid directories that match \d+ being …
(edit) @16674   16 years ak19 Added caching for textcat results on filecontents as well: a second …
(edit) @16673   16 years ak19 Removed comment. URL encode and URL decode subroutines added.
(edit) @16672   16 years ak19 URL encode and URL decode subroutines added.
(edit) @16670   16 years ak19 Instead of base64 encoding the gsdl_source_filename, it now URL …
(edit) @16667   16 years kjdon get_language_encoding was setting ->input_encoding, which means its …
(edit) @16647   16 years kjdon removed the segmentation lines from store_saved_metadata - this only …
(edit) @16646   16 years kjdon now segments all metadata as well as text
(edit) @16644   16 years kjdon now uses CJKTextSegmenter to add segmentation functionality to text …
(edit) @16643   16 years kjdon removed a couple of 'use xxx' lines that are not needed
(edit) @16642   16 years kjdon separate_cjk option and code moved to CJKTextSegmenter, and used by …
(edit) @16641   16 years kjdon upgraded this (using unicode 4.0) to include more Chinese characters …
(edit) @16640   16 years kjdon helper plugin to separate cjk text into individual characters
(edit) @16639   16 years kjdon moved the require diagnostics line to here from ReadTextFile
(edit) @16638   16 years kjdon modified store_block_files: includes script (js) files, don't add a …
(edit) @16636   16 years kjdon seems to be no longer used - replaced by iso639.pm
(edit) @16635   16 years kjdon modified the line where we get rid of the #Updated 13-Mar-2007 bit …
(edit) @16634   16 years kjdon removed NULPlugin.add_metadata_as_text as the translation was …
(edit) @16632   16 years ak19 Work on supporting non-utf8 characters in filenames
(edit) @16580   16 years ak19 Shared subroutine tmp_area_convert_file now ensures that the tailname …
(edit) @16578   16 years ak19 1. Base64 encoded gsdlsourcefilename to preserve original filename. 2. …
(edit) @16557   16 years ak19 Auto filename encoding has several additional settings now, these are …
(edit) @16556   16 years ak19 Added strings for additional types of auto settings for …
(edit) @16555   16 years ak19 Instead of sub get_language_encoding applying function ensure_utf8 on …
(edit) @16554   16 years ak19 Added subroutines classify_cached and clear_cache. The first of these …
(edit) @16553   16 years ak19 Added method check_is_utf8 that will return 1/true if the given string …
(edit) @16521   16 years kjdon pass in the file extension to get_tmp_filename otherwise it doesn't …
(edit) @16520   16 years kjdon made smart_block option description say deprecated, and added a …
(edit) @16506   16 years mdewsnip Now adds gs2:docOID attributes into "<Sec>" tags as well, to prevent …
(edit) @16504   16 years mdewsnip Changed some variable names in preparation for fixing the Lucene …
(edit) @16462   16 years ak19 1. FEDORA_VERSION has become the secondary environment variable when …
(edit) @16442   16 years ak19 Fixed yesterday's adjustment to envvar_prepend and envvar_append to …
(edit) @16436   16 years ak19 Moved the utility subroutine is_dir_empty from gsConvert.pl into here …
(edit) @16431   16 years mdewsnip Now passes the Greenstone document OID into GS2LuceneIndexer, to help …
(edit) @16426   16 years ak19 Minor changes. Although it may not be necessary, using filename_cat to …
(edit) @16414   16 years ak19 Slightly better way of dealing with GSDL3HOME not being set in the …
(edit) @16411   16 years ak19 Correction to previous 'bugfix' which was actually a mistake. Use of …
(edit) @16407   16 years ak19 Corrected change made yesterday: dealing with the undefined case (as …
(edit) @16406   16 years ak19 fedora_client_bin is added to PATH using the util package, since it …
(edit) @16404   16 years ak19 Subroutines envvar_prepend and envvar_append now only append a new …
(edit) @16398   16 years ak19 Need to take into account that catalina_home is undefined for gs2, …
(edit) @16396   16 years ak19 Bugfix that caused ingest to fail in GS3 remote cases but (for some …
(edit) @16395   16 years ak19 1. For Fedora CATALINA_HOME ought to be its own tomcat, but When …
(edit) @16392   16 years kjdon global block pass: read_block is no more, use can_process_this_file to …
(edit) @16391   16 years kjdon global block pass: this plugin now does the blocking - when reading …
(edit) @16390   16 years kjdon global block pass: read_block is no more. blockign done in a first …
(edit) @16388   16 years kjdon global block pass: added in empty file_block_read method
(edit) @16386   16 years kjdon global block pass: now uses process_exp instead of block_exp. during …
(edit) @16384   16 years kjdon global block pass: new block_hash arg to read and metadata_read. Also …
(edit) @16383   16 years kjdon make sure filename is in utf8 before calling generate_images
Note: See TracRevisionLog for help on using the revision log.