|
|
@11090
|
18 years |
kjdon |
made all plugins that implement read() call read_block to check …
|
|
|
@11089
|
18 years |
kjdon |
removed a couple of unnecessary bits of code like repeated arguments, …
|
|
|
@11072
|
18 years |
mdewsnip |
Removed this from CVS because it is a bit too specific to be in the …
|
|
|
@11071
|
18 years |
mdewsnip |
Undid my previous change. This is going to be removed from CVS and put …
|
|
|
@11069
|
18 years |
mdewsnip |
Added an option to use Kea 4.0 -- this isn't included with Greenstone, …
|
|
|
@11044
|
18 years |
mdewsnip |
The "-extract_keyphrase" and "-extract_keyphrase_options" arguments …
|
|
|
@11043
|
18 years |
mdewsnip |
No idea what this plugin does or is for, but it shouldn't be blocking …
|
|
|
@11008
|
18 years |
mdewsnip |
Added an option to run the "fribidi" Unicode Bidirectional Algorithm …
|
|
|
@10997
|
18 years |
kjdon |
new OpenDocument plugin written by Reuben Evans as a 517 project
|
|
|
@10994
|
18 years |
kjdon |
commented out a line which was incrementing self->num_not_processed. …
|
|
|
@10985
|
18 years |
kjdon |
added a new option srcicon - can specify a different icon instead of …
|
|
|
@10978
|
18 years |
kjdon |
added assoc_field option to NULPlug
|
|
|
@10956
|
18 years |
jrm21 |
now catch and exit if we got an error while parsing/evaling any …
|
|
|
@10923
|
18 years |
jrm21 |
1) allow a 2nd sql query for 'priming' the db
2) add a space when …
|
|
|
@10890
|
18 years |
kjdon |
changed convetr_to default to auto (was html), got rid of findType …
|
|
|
@10889
|
18 years |
kjdon |
added a description to metadata_fields arg, also retabbed the argument list
|
|
|
@10888
|
18 years |
kjdon |
PS can't convert to html, but the default for convert_to was html. so …
|
|
|
@10839
|
18 years |
jrm21 |
better match when looking at sub-part types so we don't match …
|
|
|
@10835
|
18 years |
kjdon |
made the -input_encoding=utf8 always be set for htmlplug secondary plugin
|
|
|
@10834
|
18 years |
jrm21 |
moved utf8 checking code into separate function. (maybe it should be …
|
|
|
@10833
|
18 years |
jrm21 |
store the names of files we've already checked when looking for a …
|
|
|
@10827
|
18 years |
jrm21 |
1) include %xx bits when making hrefs out of urls
2) test if text is …
|
|
|
@10769
|
19 years |
mdewsnip |
When processing Word documents in an 8-bit encoding wvWare would …
|
|
|
@10725
|
19 years |
chi |
For some reasons, to change the date format to "yyymmdd" used "date" …
|
|
|
@10724
|
19 years |
chi |
Add an option-metadata_fields to allow user-specified metadata fields …
|
|
|
@10723
|
19 years |
chi |
Change the option of extracted_word_metadata_fields to metadata_fields.
|
|
|
@10620
|
19 years |
kjdon |
now prints out some gli tags when bad args are encountered for plugins …
|
|
|
@10613
|
19 years |
kjdon |
modified the item file metadata regex so that space is allowed (and …
|
|
|
@10609
|
19 years |
kjdon |
if convert doesn't work, should return -1 (tried and failed) not 0 …
|
|
|
@10606
|
19 years |
kjdon |
I hadn't actually tested teh last fix, so this is the correct fix
|
|
|
@10605
|
19 years |
kjdon |
make pagedimgplug simple format version use the OID_type option
|
|
|
@10600
|
19 years |
chi |
modifications for deal with document title (as the first H1 heading) …
|
|
|
@10595
|
19 years |
chi |
Modification of level header regular expression.
|
|
|
@10594
|
19 years |
kjdon |
mime type added as MimeType metadata
|
|
|
@10592
|
19 years |
kjdon |
in read, call title_fallback to make sure that we have a title - pdf …
|
|
|
@10582
|
19 years |
kjdon |
added in cover image handling into read()
|
|
|
@10580
|
19 years |
kjdon |
if created from pluginfo.pl (self->info_only == 1)then don't load up …
|
|
|
@10579
|
19 years |
kjdon |
copied classify.pm and BasClas.pm, added -gsdlinfo flag - if this is …
|
|
|
@10549
|
19 years |
chi |
Modifications to deal with the "dc value" without qualifier.
|
|
|
@10537
|
19 years |
chi |
Set up the auto conversion type of PSPlug to text.
|
|
|
@10536
|
19 years |
chi |
Modification of adding pagedimg types of conversion for PS documents. …
|
|
|
@10514
|
19 years |
kjdon |
added in description_tags option, as it wasn't valid cos no longer …
|
|
|
@10513
|
19 years |
mdewsnip |
Absolute image tags, like <img src="/image.gif"> were being …
|
|
|
@10504
|
19 years |
kjdon |
fixed a bug with -convert_to auto handling
|
|
|
@10503
|
19 years |
kjdon |
added some handling of auto convert to type when windows scripting is on
|
|
|
@10501
|
19 years |
kjdon |
had to add set_keepold to these cos they are loaded like plugins but …
|
|
|
@10496
|
19 years |
kjdon |
added some sanity checks, renamed the checkout_toc option to delete_toc
|
|
|
@10491
|
19 years |
kjdon |
fixed a typo
|
|
|
@10478
|
19 years |
kjdon |
arcPlug now knows about keepold, and if its not set, it wont try to do …
|
|
|
@10466
|
19 years |
chi |
convert_to pagedimg_(gif|jpg|png) will only be shown in the PPTPlug …
|
|
|
@10465
|
19 years |
chi |
To add the convert_post_process() to handle some encoding problems for now.
|
|
|
@10463
|
19 years |
mdewsnip |
Removing the collection "tmp" directory is now only done when …
|
|
|
@10453
|
19 years |
kjdon |
fixed up some mistakes from previous merging of davids new code and …
|
|
|
@10452
|
19 years |
kjdon |
added in allowimagesonly option for use with convert_to html (thanks …
|
|
|
@10450
|
19 years |
kjdon |
changed from a dos file to unix file (no hat Ms)
|
|
|
@10446
|
19 years |
chi |
Modifications for converting windows-1252 to windows_1252.
|
|
|
@10443
|
19 years |
chi |
Modifications to check different StructuredHTML formating conditions.
|
|
|
@10442
|
19 years |
chi |
To retrieve encoding information for the HTML file generated from …
|
|
|
@10441
|
19 years |
chi |
Modifications for pushing required option and argument lists to …
|
|
|
@10434
|
19 years |
chi |
Tidy up the item file to convert the "&" sign in the metadata to "&".
|
|
|
@10430
|
19 years |
chi |
Allow to remove the soft_link.
|
|
|
@10429
|
19 years |
chi |
Modification of the way passing argument and option lists for the …
|
|
|
@10428
|
19 years |
chi |
Modification of the way passing argument and option lists for the …
|
|
|
@10427
|
19 years |
chi |
Modification of the way passing argument and option list for the …
|
|
|
@10426
|
19 years |
chi |
Add an option -extracted_word_metadata to extract metadata based on …
|
|
|
@10425
|
19 years |
chi |
Modification of the way passing argument and option lists for the …
|
|
|
@10424
|
19 years |
chi |
Modification of the way passing argument and options list for the …
|
|
|
@10423
|
19 years |
chi |
Modify the structure of pushing argument and option lists to secondary …
|
|
|
@10419
|
19 years |
kjdon |
is_incremental renamed is_incremental_capable
|
|
|
@10406
|
19 years |
chi |
If the -windows_scripting is on in WordPlug, the secondary plugin will …
|
|
|
@10405
|
19 years |
chi |
Adding structured HTML formating arguments here.
|
|
|
@10404
|
19 years |
chi |
remove the plugin arguments to WordPlug.
|
|
|
@10403
|
19 years |
chi |
Modifications for tidying up the item file generated through pdftoimg.pl.
|
|
|
@10395
|
19 years |
mdewsnip |
A plugin for RealMedia files. By Xin Gao for the 517 Digital Libraries …
|
|
|
@10356
|
19 years |
chi |
tidy up the code.
|
|
|
@10355
|
19 years |
chi |
Remove heading_title options to StructuredHTMLPlug.
|
|
|
@10354
|
19 years |
chi |
Add an argument "title_sub" here in PagedImgPlug
|
|
|
@10353
|
19 years |
chi |
Modification for allowing PDF document being converted to various …
|
|
|
@10352
|
19 years |
chi |
Change the pagedimg_png,jpg,gif (hyphen to underscore) setting in …
|
|
|
@10347
|
19 years |
kjdon |
removed the unneeded 'use parsargv'
|
|
|
@10344
|
19 years |
kjdon |
if there was no Title, add PageNum as a Title
|
|
|
@10329
|
19 years |
mdewsnip |
Changed the default_language string to be of type "string", since …
|
|
|
@10313
|
19 years |
mdewsnip |
Fixed undefined doc_oid variable problem.
|
|
|
@10305
|
19 years |
davidb |
newly introduced is_incremental() used to help determine if file needs …
|
|
|
@10280
|
19 years |
chi |
Some major changes to allow secondary plugin setting.
|
|
|
@10279
|
19 years |
chi |
A modification to allow a secondary plugin setting through ConvertToPlug
|
|
|
@10278
|
19 years |
chi |
A major modification to allow a secondary-plugin setting. With this …
|
|
|
@10277
|
19 years |
chi |
tidy up the filename in add_file().
|
|
|
@10276
|
19 years |
chi |
Add a read_into_doc_obj() for enabling secondary_pluging function. …
|
|
|
@10275
|
19 years |
chi |
A modification to allow a secondary plugin setting through ConvertToPlug
|
|
|
@10274
|
19 years |
chi |
A modification to allow a secondary plug setting through ConvertToPlug.
|
|
|
@10273
|
19 years |
chi |
A modification to allow a secondary-plugin setting through ConvertToPlug.
|
|
|
@10272
|
19 years |
chi |
A modification to allow a secondary-plugin setting.
|
|
|
@10271
|
19 years |
chi |
A new program to demonstrate HTML document (converted from other …
|
|
|
@10270
|
19 years |
chi |
The modification to allow the secondary-plugin setting.
|
|
|
@10254
|
19 years |
kjdon |
added 'use strict' to all plugins, and made modifications (mostly …
|
|
|
@10229
|
19 years |
kjdon |
fixed up some stuff for printing args (pluginfo.pl, classinfo.pl)
|
|
|
@10218
|
19 years |
kjdon |
Jeffrey's new parsing modifications, committed approx 6 July, 15.16
|
|
|
@10170
|
19 years |
kjdon |
made our, added two parse methods - if you want to do xml parsing …
|
|
|
@10168
|
19 years |
kjdon |
modified this to use a new xml format. it should work as before on the …
|
|
|