|
|
@17127
|
16 years |
kjdon |
want to block body background, so added it into tabbg_matches regex …
|
|
|
@17126
|
16 years |
kjdon |
inherit and use args form ReadTextFile cos we want the file encoding stuff
|
|
|
@17112
|
16 years |
kjdon |
CJK text segmentation now done at indexing level (in buildproc), not …
|
|
|
@17103
|
16 years |
ak19 |
OAI files should be explodable, so added that back in as an option
|
|
|
@17099
|
16 years |
kjdon |
in get_language_encoding, we extract head from html files. if its not …
|
|
|
@17088
|
16 years |
davidb |
Plugin modified to only print out URL encoded filename if different to …
|
|
|
@17066
|
16 years |
ak19 |
OAIPlugin now works again: 1. needs to inherit from ReadTextFile as …
|
|
|
@17059
|
16 years |
ak19 |
The invalid MIMEtype image/jpg for generated images are now changed to …
|
|
|
@17054
|
16 years |
kjdon |
a previous change to ReadXMLFile meant that all namespaces were …
|
|
|
@17053
|
16 years |
kjdon |
made the default OIDtype auto, add message_id at end instead of start …
|
|
|
@17033
|
16 years |
kjdon |
fixed up some bugs in add_OID
|
|
|
@17028
|
16 years |
kjdon |
changed METS to GreenstoneMETS in the code
|
|
|
@17027
|
16 years |
kjdon |
METSPlugin only processes greenstone mets, so renamed it.
|
|
|
@17026
|
16 years |
kjdon |
OID generation modifications: OIDtype and OIDmetadata options now …
|
|
|
@16997
|
16 years |
ak19 |
Multilingual filehandling in subroutine extra_metadata
|
|
|
@16995
|
16 years |
kjdon |
fixed a misnamed method call
|
|
|
@16994
|
16 years |
kjdon |
add a test for invalid real media file
|
|
|
@16991
|
16 years |
kjdon |
fixed up a coupld of things to do with assoc files
|
|
|
@16981
|
16 years |
kjdon |
forgot to commit the changes to the BaseMediaConverter file after …
|
|
|
@16976
|
16 years |
kjdon |
this is a base class for converters, not an actual converter so …
|
|
|
@16970
|
16 years |
kjdon |
changed an output statement
|
|
|
@16961
|
16 years |
ak19 |
Fixed a bug introduced in last commit
|
|
|
@16960
|
16 years |
ak19 |
Fixed a bug introduced in last commit
|
|
|
@16958
|
16 years |
ak19 |
Expanded a comment
|
|
|
@16957
|
16 years |
ak19 |
Associated file names are url-encoded
|
|
|
@16956
|
16 years |
ak19 |
Associated file names are url-encoded
|
|
|
@16955
|
16 years |
ak19 |
Associated file names are url-encoded
|
|
|
@16954
|
16 years |
ak19 |
Minor edit since a subroutine has been renamed
|
|
|
@16953
|
16 years |
ak19 |
Associated file names are url-encoded
|
|
|
@16952
|
16 years |
ak19 |
Associated file names are url-encoded
|
|
|
@16937
|
16 years |
ak19 |
Correction to setting new SourceFile metadata
|
|
|
@16935
|
16 years |
ak19 |
Removed the commented out debug statements now, since I think …
|
|
|
@16932
|
16 years |
ak19 |
Fixed error I introduced in the changes committed previously
|
|
|
@16923
|
16 years |
ak19 |
1. Sets both the Source and new SourceFile metadata; 2. Calls …
|
|
|
@16922
|
16 years |
ak19 |
1. The doclink uses the new SourceFile metadata, which is the …
|
|
|
@16921
|
16 years |
ak19 |
src_link uses the new SourceFile metadata, which is the url-encoded …
|
|
|
@16920
|
16 years |
ak19 |
1. When checking whether the original_file is missing, it ensures that …
|
|
|
@16919
|
16 years |
ak19 |
set_Source_metadata modified to also set SourceFile metadata which …
|
|
|
@16904
|
16 years |
ak19 |
1. The percents in URL-encoded associated images' filenames are only …
|
|
|
@16901
|
16 years |
ak19 |
The percents in URL-encoded image filenames are only escaped with …
|
|
|
@16888
|
16 years |
ak19 |
URL encodes the name of the file(s) generated upon conversion to html, …
|
|
|
@16856
|
16 years |
kjdon |
in gli mode, print a warning about can't process images
|
|
|
@16854
|
16 years |
kjdon |
removed some forgotten debug statements
|
|
|
@16852
|
16 years |
kjdon |
no_block no longer affects blocking cover images. setting block_exp …
|
|
|
@16851
|
16 years |
kjdon |
removed a print statement I left in last commit
|
|
|
@16850
|
16 years |
kjdon |
changed a print statement to print to outhandle instead of STDERR
|
|
|
@16849
|
16 years |
kjdon |
fixed a bad error in previous commit where I had put super::init …
|
|
|
@16848
|
16 years |
kjdon |
added Thumbs.db to block list
|
|
|
@16847
|
16 years |
kjdon |
moved setting of self->basedir from BasePlugin to MediaConverter as …
|
|
|
@16846
|
16 years |
kjdon |
unhid the cache images option for gli
|
|
|
@16837
|
16 years |
ak19 |
Fixing an error introduced in the previous commit: after eval_dir_dots …
|
|
|
@16836
|
16 years |
ak19 |
1. When working out the web_url, it needs to know of any containing …
|
|
|
@16827
|
16 years |
davidb |
Edited needed to make pageImagePlugin work again after plugin restructuring
|
|
|
@16826
|
16 years |
davidb |
Base class use in modules such as ImageConverter.pm
|
|
|
@16825
|
16 years |
davidb |
Code for supporting cache merged back in with Katherine's restructured …
|
|
|
@16824
|
16 years |
davidb |
Filenames are now quoted in system calls, so they can have spaces in them.
|
|
|
@16822
|
16 years |
davidb |
White space adjustment
|
|
|
@16821
|
16 years |
davidb |
Introduced new merge_inheritance function, useful in a plugin …
|
|
|
@16820
|
16 years |
davidb |
Fixed cut and pase error in comment that talked about text plugin, …
|
|
|
@16812
|
16 years |
ak19 |
Undid the changes introduced in the last commit which url encode the …
|
|
|
@16799
|
16 years |
ak19 |
URL encoding href links for internal links since the database lookup …
|
|
|
@16771
|
16 years |
ak19 |
Changes to make it compatible with multilingual filenames. Uses URL …
|
|
|
@16769
|
16 years |
ak19 |
Intermediate version (with commented out debug statements). 1. Works …
|
|
|
@16768
|
16 years |
ak19 |
URL encodes filenames in order to handle cases of multilingual images …
|
|
|
@16767
|
16 years |
ak19 |
In progress: Filename encoding after working with it on Windows. Still …
|
|
|
@16765
|
16 years |
ak19 |
Only removes comments in head tag now when working out the encoding
|
|
|
@16753
|
16 years |
ak19 |
get_language_encoding for HTMLFiles strips out the comments before …
|
|
|
@16735
|
16 years |
ak19 |
When a directory of interlinking html files is dropped into GLI, …
|
|
|
@16724
|
16 years |
ak19 |
1. Dr Bainbridge added some language-encoding related methods that …
|
|
|
@16700
|
16 years |
kjdon |
changed a comment
|
|
|
@16699
|
16 years |
kjdon |
added auxiliary parameter to new - needed if you want to do new …
|
|
|
@16698
|
16 years |
kjdon |
added auxiliary parameter to new - needed if you want to do new …
|
|
|
@16697
|
16 years |
kjdon |
if marc mapping file cannot be located, print a warning about can't …
|
|
|
@16696
|
16 years |
kjdon |
added an option to XML parser to strip out namespaces. did this so …
|
|
|
@16695
|
16 years |
kjdon |
the last commit was by mistake - this one removes the print statements …
|
|
|
@16694
|
16 years |
kjdon |
MARCXMLPlugin uses textcat_language_and_encoding method from …
|
|
|
@16693
|
16 years |
kjdon |
MARCXMLPlugin uses textcat_language_and_encoding method from …
|
|
|
@16692
|
16 years |
kjdon |
code to read in marc mapping files moved from MARCXMLPlugin to …
|
|
|
@16677
|
16 years |
davidb |
Minor tweak to EmailPlugin to avoid directories that match \d+ being …
|
|
|
@16667
|
16 years |
kjdon |
get_language_encoding was setting ->input_encoding, which means its …
|
|
|
@16646
|
16 years |
kjdon |
now segments all metadata as well as text
|
|
|
@16644
|
16 years |
kjdon |
now uses CJKTextSegmenter to add segmentation functionality to text …
|
|
|
@16643
|
16 years |
kjdon |
removed a couple of 'use xxx' lines that are not needed
|
|
|
@16642
|
16 years |
kjdon |
separate_cjk option and code moved to CJKTextSegmenter, and used by …
|
|
|
@16640
|
16 years |
kjdon |
helper plugin to separate cjk text into individual characters
|
|
|
@16639
|
16 years |
kjdon |
moved the require diagnostics line to here from ReadTextFile
|
|
|
@16638
|
16 years |
kjdon |
modified store_block_files: includes script (js) files, don't add a …
|
|
|
@16632
|
16 years |
ak19 |
Work on supporting non-utf8 characters in filenames
|
|
|
@16580
|
16 years |
ak19 |
Shared subroutine tmp_area_convert_file now ensures that the tailname …
|
|
|
@16557
|
16 years |
ak19 |
Auto filename encoding has several additional settings now, these are …
|
|
|
@16555
|
16 years |
ak19 |
Instead of sub get_language_encoding applying function ensure_utf8 on …
|
|
|
@16521
|
16 years |
kjdon |
pass in the file extension to get_tmp_filename otherwise it doesn't …
|
|
|
@16520
|
16 years |
kjdon |
made smart_block option description say deprecated, and added a …
|
|
|
@16392
|
16 years |
kjdon |
global block pass: read_block is no more, use can_process_this_file to …
|
|
|
@16391
|
16 years |
kjdon |
global block pass: this plugin now does the blocking - when reading …
|
|
|
@16390
|
16 years |
kjdon |
global block pass: read_block is no more. blockign done in a first …
|
|
|
@16388
|
16 years |
kjdon |
global block pass: added in empty file_block_read method
|
|
|
@16386
|
16 years |
kjdon |
global block pass: now uses process_exp instead of block_exp. during …
|
|
|
@16384
|
16 years |
kjdon |
global block pass: new block_hash arg to read and metadata_read. Also …
|
|
|
@16383
|
16 years |
kjdon |
make sure filename is in utf8 before calling generate_images
|
|
|