|
|
@4894
|
21 years |
mdewsnip |
Added a missing ']' that was causing problems. Thanks to Ben Dwyer for …
|
|
|
@4873
|
21 years |
mdewsnip |
Further work on standardising option descriptions. Specifically, in …
|
|
|
@4845
|
21 years |
jrm21 |
use add_metadata instead of add_utf8_metadata for Source and URL …
|
|
|
@4844
|
21 years |
jrm21 |
database plugin doesn't take the "title_sub" option.
|
|
|
@4843
|
21 years |
mdewsnip |
Added check to ConvertToRogPlug creation so that 'pluginfo.pl …
|
|
|
@4842
|
21 years |
mdewsnip |
Added check when creating a ConvertToPlug object so that 'pluginfo.pl …
|
|
|
@4821
|
21 years |
jrm21 |
corrected extract_first_NNNN function so that it doesn't get confused …
|
|
|
@4792
|
21 years |
davidb |
Modified so BibTeX records with no key processed correctly.
|
|
|
@4791
|
21 years |
davidb |
Modified so -input_encoding flag used.
|
|
|
@4790
|
21 years |
davidb |
Addition of 'quotemeta' to protect directory separate under Windows …
|
|
|
@4785
|
21 years |
mdewsnip |
Commented out print_usage functions - plugins should now call …
|
|
|
@4778
|
21 years |
mdewsnip |
Modified the code for generating the usage texts to use the methods in …
|
|
|
@4764
|
21 years |
mdewsnip |
Replaced call to removed function print_generic_usage() with a call to …
|
|
|
@4750
|
21 years |
mdewsnip |
Improved formatting of usage texts automatically generated from John's …
|
|
|
@4748
|
21 years |
mdewsnip |
Changed "metadatum" type to "metadata".
|
|
|
@4747
|
21 years |
mdewsnip |
Added $options structure for storing plugin description.
|
|
|
@4746
|
21 years |
mdewsnip |
Initial attempt at a generic print usage function which works with the …
|
|
|
@4745
|
21 years |
mdewsnip |
Uncommented a line which shouldn't have been committed commented.
|
|
|
@4744
|
21 years |
mdewsnip |
Tidied up and structures (representing the options of the plugin) in …
|
|
|
@4726
|
21 years |
davidb |
Initial version of OAI plugin for parsing records downloaded from
an …
|
|
|
@4724
|
21 years |
davidb |
ImagePlug now stores metadata for srcicon, thumbicon and screenicon
to …
|
|
|
@4429
|
21 years |
jrm21 |
new plugin for importing data from perl's DBI database interface - eg …
|
|
|
@4224
|
21 years |
jrm21 |
fixed regexp for when we have a content type without a charset
|
|
|
@4103
|
21 years |
sjboddie |
Added a -nohidden PDFPlug option and made it pass the -hidden option …
|
|
|
@4089
|
21 years |
jrm21 |
added "\n" to headers as we weren't picking up messages that were only …
|
|
|
@3932
|
21 years |
jrm21 |
need to escape _ chars.
|
|
|
@3919
|
21 years |
jrm21 |
tidy and fix reg-exps when looking for #includes... it got stuck in a …
|
|
|
@3856
|
21 years |
davidb |
General improvement to the translator facility.
|
|
|
@3834
|
21 years |
sjboddie |
Prevent "use bytes" from causing errors for older perls
|
|
|
@3833
|
21 years |
jrm21 |
fixed up parsing the use_sections argument.
|
|
|
@3767
|
21 years |
sjboddie |
Scattered some "use bytes" pragmas around to try to prevent perl-5.8 …
|
|
|
@3737
|
21 years |
davidb |
Used to support music-centent based collections
|
|
|
@3732
|
21 years |
jrm21 |
need to escape ",", "<", and ">" in title metadata
|
|
|
@3731
|
21 years |
jrm21 |
If textcat returns too many possibilities, use the default language …
|
|
|
@3726
|
21 years |
jrm21 |
minor fix for "_" chars in urls... escape them after, not before.
…
|
|
|
@3724
|
21 years |
kde2 |
Submission of Interface Translation Agency
|
|
|
@3721
|
21 years |
jrm21 |
bug where some text/plain messages weren't having < > & properly …
|
|
|
@3720
|
21 years |
sjboddie |
Added options to PDFPlug to take advantage of the improvements in …
|
|
|
@3708
|
21 years |
sjboddie |
Fixed a bug where HTMLPlug failed to associate files whose filenames …
|
|
|
@3630
|
22 years |
jrm21 |
1) Correct typo in print_usage(): process_exp -> split_exp
2) Fixed …
|
|
|
@3629
|
22 years |
jrm21 |
need to look for associated files in the assocfilepath, if this …
|
|
|
@3627
|
22 years |
jrm21 |
added less-obfuscated quote-printable parsing in qp_decode()
|
|
|
@3614
|
22 years |
jrm21 |
modified section-handling stuff to work with output from v.0.34 of …
|
|
|
@3590
|
22 years |
jrm21 |
modified the split regular expression so it works with newer versions …
|
|
|
@3587
|
22 years |
jrm21 |
removed comments about storing "BibTex" metadata as we don't do that …
|
|
|
@3542
|
22 years |
jrm21 |
ghtml returns utf8, not iso-8859-1, so any html entities were being …
|
|
|
@3540
|
22 years |
kjdon |
added John T's changes into CVS - added info to enable retrieval of …
|
|
|
@3539
|
22 years |
kjdon |
added jpe to the process and block expressions
|
|
|
@3537
|
22 years |
jrm21 |
if process() returns undef, then the plugin couldn't process that …
|
|
|
@3524
|
22 years |
kjdon |
added the help message for the previous change
|
|
|
@3523
|
22 years |
kjdon |
now EMAILplug accepts the split_exp option - a regular expression that …
|
|
|
@3517
|
22 years |
davidb |
ImagePlug modified so 'Source' metadata set to be consistent with …
|
|
|
@3515
|
22 years |
jrm21 |
call a plugin's set_OID() method if one exists, otherwise use the …
|
|
|
@3508
|
22 years |
jrm21 |
modified copyright statement
|
|
|
@3430
|
22 years |
jrm21 |
Added MARCPlug, mostly done by David Bainbridge. It needs a …
|
|
|
@3427
|
22 years |
sjboddie |
The input encoding will now default to utf8 instead of iso-8859-1. …
|
|
|
@3426
|
22 years |
jrm21 |
Don't add \n to the end of each metadata value.
|
|
|
@3414
|
22 years |
jrm21 |
Need to escape "_" characters so that greenstone doesn't interprete them…
|
|
|
@3411
|
22 years |
jrm21 |
Now takes a "-use_sections" option to make a section per page.
|
|
|
@3400
|
22 years |
sjboddie |
WordPlug now handles .dot files as well as .doc files.
|
|
|
@3398
|
22 years |
jrm21 |
Oops... the last change to the regex was too permissive... fixed up to …
|
|
|
@3397
|
22 years |
jrm21 |
minor change to the regex for marking up urls (to allow #anchor at the end)
|
|
|
@3369
|
22 years |
sjboddie |
HTMLPlug will no longer prevent metadata extraction when the …
|
|
|
@3352
|
22 years |
jrm21 |
We can now properly handle messages with a content type of …
|
|
|
@3351
|
22 years |
jrm21 |
If a message is in an unsupported encoding, we assume iso8859-1. …
|
|
|
@3350
|
22 years |
sjboddie |
Added -use_strings option to ConvertToPlug. The default behaviour for …
|
|
|
@3349
|
22 years |
sjboddie |
Bug fix.
|
|
|
@3329
|
22 years |
jrm21 |
Oops, removed debugging statement!
|
|
|
@3328
|
22 years |
jrm21 |
Make sure that sender's name is more than 0 chars long, otherwise use …
|
|
|
@3307
|
22 years |
davidb |
Some minor modifications to Image Plugin: filenames can now
include …
|
|
|
@3249
|
22 years |
jrm21 |
1) add a space when joining consecutive lines, just in case.
2) Don't …
|
|
|
@3248
|
22 years |
jrm21 |
If we convert to HTML, we post-process to change named entities (eg …
|
|
|
@3247
|
22 years |
jrm21 |
Modified automatic title extraction to also recognise utf-8 nbsp as …
|
|
|
@3215
|
22 years |
jrm21 |
Fixed up some regexs for mime header encodings - eg people with …
|
|
|
@3206
|
22 years |
jrm21 |
Oops! Bad things were happening when the headers said utf-8 encoding, …
|
|
|
@3196
|
22 years |
sjboddie |
Added to the list of entities that HTMLPlug doesn't convert to utf-8
|
|
|
@3181
|
22 years |
sjboddie |
Altered the getcharequiv() function so it now converts entities to raw …
|
|
|
@3156
|
22 years |
jrm21 |
Added a few extra accented characters, and recognise some …
|
|
|
@3148
|
22 years |
jrm21 |
If a document has associated files that are also given a subdirectory, …
|
|
|
@3143
|
22 years |
jrm21 |
Minor tweak for badly formatted dates. We now use a window, so …
|
|
|
@3142
|
22 years |
jrm21 |
1) We can't use "Date" for the year metadata, as greenstone assumes …
|
|
|
@3137
|
22 years |
paynter |
Changed the way Width, Height, Size and Type metadata is calculated. …
|
|
|
@3136
|
22 years |
paynter |
Reconciled John's version of my changes to EMAILPlug with my version …
|
|
|
@3135
|
22 years |
jrm21 |
modified process_exp to process php3 -named files too.
|
|
|
@3134
|
22 years |
jrm21 |
1) Convert headers to detected charset if possible.
2) Convert header …
|
|
|
@3132
|
22 years |
jrm21 |
Try to determine the encoding used in the headers in case it is not …
|
|
|
@3116
|
22 years |
sjboddie |
RecPlug will now die with an error if it finds a metadata.xml file …
|
|
|
@3112
|
22 years |
jrm21 |
minor changes to formatted values (eg if enclosed in { and } ) and …
|
|
|
@3111
|
22 years |
jrm21 |
Allow .eml extension (IE and mozilla default to this for individual …
|
|
|
@3108
|
22 years |
jrm21 |
Don't recursive into directories if they are symbolic links and point …
|
|
|
@3107
|
22 years |
jrm21 |
fixed problem where documents after a "bad" document would not be
read …
|
|
|
@3094
|
22 years |
jrm21 |
Needed to add failhandle to the init() function, to pass to BasPlug.
|
|
|
@3086
|
22 years |
nzdl |
* empty log message *
|
|
|
@3073
|
22 years |
jrm21 |
1) Default Title now correctly escapes [ and ] chars.
2) …
|
|
|
@3038
|
22 years |
jrm21 |
Put \" \" around href for srclink, in case the collection name has …
|
|
|
@3037
|
22 years |
jrm21 |
title_sub seems to always get defined by parsargv, so we test that it …
|
|
|
@3019
|
22 years |
jrm21 |
Fixes for when on windows - it was having a lot of trouble sorting out …
|
|
|
@2996
|
22 years |
sjboddie |
* empty log message *
|
|
|
@2995
|
22 years |
sjboddie |
Fixed a bug preventing HTML headers from being removed correctly when …
|
|
|
@2990
|
22 years |
jrm21 |
Do MS Excel using ConvertToPlug, which currently uses the xlhtml package.
|
|
|