|
|
@16771
|
16 years |
ak19 |
Changes to make it compatible with multilingual filenames. Uses URL …
|
|
|
@16769
|
16 years |
ak19 |
Intermediate version (with commented out debug statements). 1. Works …
|
|
|
@16768
|
16 years |
ak19 |
URL encodes filenames in order to handle cases of multilingual images …
|
|
|
@16767
|
16 years |
ak19 |
In progress: Filename encoding after working with it on Windows. Still …
|
|
|
@16765
|
16 years |
ak19 |
Only removes comments in head tag now when working out the encoding
|
|
|
@16753
|
16 years |
ak19 |
get_language_encoding for HTMLFiles strips out the comments before …
|
|
|
@16735
|
16 years |
ak19 |
When a directory of interlinking html files is dropped into GLI, …
|
|
|
@16724
|
16 years |
ak19 |
1. Dr Bainbridge added some language-encoding related methods that …
|
|
|
@16700
|
16 years |
kjdon |
changed a comment
|
|
|
@16699
|
16 years |
kjdon |
added auxiliary parameter to new - needed if you want to do new …
|
|
|
@16698
|
16 years |
kjdon |
added auxiliary parameter to new - needed if you want to do new …
|
|
|
@16697
|
16 years |
kjdon |
if marc mapping file cannot be located, print a warning about can't …
|
|
|
@16696
|
16 years |
kjdon |
added an option to XML parser to strip out namespaces. did this so …
|
|
|
@16695
|
16 years |
kjdon |
the last commit was by mistake - this one removes the print statements …
|
|
|
@16694
|
16 years |
kjdon |
MARCXMLPlugin uses textcat_language_and_encoding method from …
|
|
|
@16693
|
16 years |
kjdon |
MARCXMLPlugin uses textcat_language_and_encoding method from …
|
|
|
@16692
|
16 years |
kjdon |
code to read in marc mapping files moved from MARCXMLPlugin to …
|
|
|
@16677
|
16 years |
davidb |
Minor tweak to EmailPlugin to avoid directories that match \d+ being …
|
|
|
@16667
|
16 years |
kjdon |
get_language_encoding was setting ->input_encoding, which means its …
|
|
|
@16646
|
16 years |
kjdon |
now segments all metadata as well as text
|
|
|
@16644
|
16 years |
kjdon |
now uses CJKTextSegmenter to add segmentation functionality to text …
|
|
|
@16643
|
16 years |
kjdon |
removed a couple of 'use xxx' lines that are not needed
|
|
|
@16642
|
16 years |
kjdon |
separate_cjk option and code moved to CJKTextSegmenter, and used by …
|
|
|
@16640
|
16 years |
kjdon |
helper plugin to separate cjk text into individual characters
|
|
|
@16639
|
16 years |
kjdon |
moved the require diagnostics line to here from ReadTextFile
|
|
|
@16638
|
16 years |
kjdon |
modified store_block_files: includes script (js) files, don't add a …
|
|
|
@16632
|
16 years |
ak19 |
Work on supporting non-utf8 characters in filenames
|
|
|
@16580
|
16 years |
ak19 |
Shared subroutine tmp_area_convert_file now ensures that the tailname …
|
|
|
@16557
|
16 years |
ak19 |
Auto filename encoding has several additional settings now, these are …
|
|
|
@16555
|
16 years |
ak19 |
Instead of sub get_language_encoding applying function ensure_utf8 on …
|
|
|
@16521
|
16 years |
kjdon |
pass in the file extension to get_tmp_filename otherwise it doesn't …
|
|
|
@16520
|
16 years |
kjdon |
made smart_block option description say deprecated, and added a …
|
|
|
@16392
|
16 years |
kjdon |
global block pass: read_block is no more, use can_process_this_file to …
|
|
|
@16391
|
16 years |
kjdon |
global block pass: this plugin now does the blocking - when reading …
|
|
|
@16390
|
16 years |
kjdon |
global block pass: read_block is no more. blockign done in a first …
|
|
|
@16388
|
16 years |
kjdon |
global block pass: added in empty file_block_read method
|
|
|
@16386
|
16 years |
kjdon |
global block pass: now uses process_exp instead of block_exp. during …
|
|
|
@16384
|
16 years |
kjdon |
global block pass: new block_hash arg to read and metadata_read. Also …
|
|
|
@16383
|
16 years |
kjdon |
make sure filename is in utf8 before calling generate_images
|
|
|
@16382
|
16 years |
kjdon |
filename_no_path arg to generate_images must now be in utf8, and then …
|
|
|
@16341
|
16 years |
kjdon |
save attachments in binary mode so they work on windows. Use …
|
|
|
@16308
|
16 years |
kjdon |
unhide separate_cjk option in GLI - no longer a global option, just a …
|
|
|
@16301
|
16 years |
ak19 |
sub tmp_area_convert_file--called to replace a plain text source file …
|
|
|
@16257
|
16 years |
mdewsnip |
Tidied up the block of code that determines whether each doc.xml file …
|
|
|
@16247
|
16 years |
ak19 |
Regular expression that processes imagelinks is slightly modified by …
|
|
|
@16193
|
16 years |
kjdon |
forgot to define outhandle last commit
|
|
|
@16104
|
16 years |
kjdon |
tried to make the 'xxxplugin processing file' print statements more …
|
|
|
@16025
|
16 years |
kjdon |
added license info
|
|
|
@16024
|
16 years |
kjdon |
indented the file properly
|
|
|
@16022
|
16 years |
kjdon |
removed SourceUTF8 metadata, Source metadata is now utf8. Note, still …
|
|
|
@16021
|
16 years |
kjdon |
commented out input_encoding stuff cos we don't have that option …
|
|
|
@16019
|
16 years |
kjdon |
changed some more string keys
|
|
|
@16016
|
16 years |
kjdon |
changed some key names for strings.properties
|
|
|
@16014
|
16 years |
kjdon |
changed some strings.properties key names.
|
|
|
@16013
|
16 years |
kjdon |
updated soem plugin names in some of the keys for strings.properties
|
|
|
@16012
|
16 years |
kjdon |
moved the -first option to AutoExtractMetadata
|
|
|
@16011
|
16 years |
kjdon |
moved the -first option to here from ReadTextFile
|
|
|
@16010
|
16 years |
kjdon |
changed the imagemagick check before calling generate images
|
|
|
@16009
|
16 years |
kjdon |
changed some string keys, added a check for imagemagick before calling …
|
|
|
@16008
|
16 years |
kjdon |
create_thumbnail and create_screenview are now enum with true and …
|
|
|
@15971
|
16 years |
mdewsnip |
Changed call to get_full_filename() to get_full_filenames().
|
|
|
@15970
|
16 years |
kjdon |
changed an HTMLPlug to HTMLPlugin
|
|
|
@15969
|
16 years |
kjdon |
added in desc for metadata_fields arg
|
|
|
@15963
|
16 years |
kjdon |
commented out the textcat stuff in post_process - need to think about …
|
|
|
@15962
|
16 years |
kjdon |
added a check for info_only before lookign at the arguments
|
|
|
@15961
|
16 years |
kjdon |
added abstract field to options
|
|
|
@15925
|
16 years |
kjdon |
use the proper options for PagedImagePlugin
|
|
|
@15919
|
16 years |
kjdon |
moved the loadGISDatabase code into a new method …
|
|
|
@15918
|
16 years |
kjdon |
tidied up new method to match other plugins
|
|
|
@15914
|
16 years |
kjdon |
removed some old stuff and moved around some methods
|
|
|
@15911
|
16 years |
kjdon |
tidy up a couple of places using dummy text and NoText metadata
|
|
|
@15906
|
16 years |
kjdon |
inherits from AutoExtractMetadata now, not BasePlugin
|
|
|
@15905
|
16 years |
kjdon |
changed some comments, also, new ReadTextFile, need to pass in extra …
|
|
|
@15904
|
16 years |
kjdon |
input_encoding option no longer used
|
|
|
@15903
|
16 years |
kjdon |
input_encoding option no longer used
|
|
|
@15902
|
16 years |
kjdon |
no longer uses input_encoding
|
|
|
@15887
|
16 years |
mdewsnip |
Added "use strict" to the few files that were missing it, and fixing …
|
|
|
@15881
|
16 years |
kjdon |
auxiliary plugins now pass an extra argument to the PrintInfo …
|
|
|
@15880
|
16 years |
kjdon |
made this inherit from BasePlugin instead of AbstractPlugin - cos it …
|
|
|
@15877
|
16 years |
ak19 |
Minor edits to subroutine calls
|
|
|
@15872
|
16 years |
kjdon |
plugin overhaul: plugins renamed to xxPlugin, and in some cases the …
|
|
|
@15871
|
16 years |
kjdon |
plugin overhaul: Split plug renamed to SplitTextFile, XMLPlug renamed …
|
|
|
@15870
|
16 years |
kjdon |
plugin overhaul: ArchivesInf and Directory plugins are not true …
|
|
|
@15869
|
16 years |
kjdon |
plugin overhaul: BasPlug has been split into several base plugins: …
|
|
|
@15868
|
16 years |
kjdon |
plugin overhaul: BasPlug has been split into several base plugins: …
|
|
|
@15867
|
16 years |
kjdon |
plugin overhaul: automatic metadata extraction moved out of BasPlug …
|
|
|
@15866
|
16 years |
kjdon |
plugin overhaul: Image conversion stuff moved to this helper plugin, …
|
|
|
@15865
|
16 years |
kjdon |
renaming plugins in preparation for my plugin overhaul
|
|
|
@15864
|
16 years |
kjdon |
renaming plugins in preparation for my plugin overhaul
|
|
|
@15845
|
16 years |
ak19 |
Corrected error I made in regex expression used to call …
|
|
|
@15843
|
16 years |
ak19 |
The file URL added to doc.xml as Image and Source metadata is first …
|
|
|
@15841
|
16 years |
ak19 |
Filename metadata is turned into utf8 and then added to the document …
|
|
|
@15838
|
16 years |
ak19 |
Updated the regular expression on img src link to make sure that …
|
|
|
@15611
|
16 years |
ak19 |
Added a comment to a regex to explain what it does
|
|
|
@15607
|
16 years |
ak19 |
Dr Bainbridge corrected call to encodings::encodings (previously …
|
|
|
@15446
|
16 years |
davidb |
Fixed incorrect variable name. ::encoding needed as 's' at the end.
|
|
|
@15212
|
16 years |
kjdon |
for the greenstone archive collection, we are now using monthly …
|
|
|
@15182
|
16 years |
kjdon |
removed an unnecessary comment
|
|
|
@15179
|
16 years |
kjdon |
needed to add extra_metadata() call in close_document so that can get …
|
|
|
@15178
|
16 years |
kjdon |
needed to add extra_metadata() call in xml_end_tag so that can get …
|
|
|