|
|
@23753
|
13 years |
davidb |
Tidied up on 'book-keeping' information used by this plugin, now that …
|
|
|
@23752
|
13 years |
davidb |
Missing @_ parameter needs to be passed on from AutoLoadConverter …
|
|
|
@23751
|
13 years |
davidb |
Missing @_ parameter needs to be passed in to AutoLoadConverter …
|
|
|
@23564
|
13 years |
kjdon |
added a bit more comment about store_block_files
|
|
|
@23561
|
13 years |
kjdon |
moved the block_filename from BasePlugin into util. then I don't need …
|
|
|
@23544
|
13 years |
kjdon |
on windows, if have a .JPG cover image, then a -e xxx.jpg test works, …
|
|
|
@23484
|
13 years |
ak19 |
Further improvements by Dr Bainbridge to pretty-printing.
|
|
|
@23472
|
13 years |
ak19 |
Erroneous forth argument (a filename) left over from an earlier time, …
|
|
|
@23465
|
13 years |
ak19 |
Dr Bainbridge fixed the change to gs.filenameEncoding (previously: …
|
|
|
@23463
|
13 years |
ak19 |
Previously, when reverting back to an earlier RE match for …
|
|
|
@23461
|
13 years |
kjdon |
set_Source_metadata now takes an optional section argument so that we …
|
|
|
@23460
|
13 years |
kjdon |
pass in the section to set_Source_metadata as we may be processing a …
|
|
|
@23458
|
13 years |
kjdon |
added a check that deduced_filename_encoding is defined before testing …
|
|
|
@23457
|
13 years |
kjdon |
reindented the file in emacs
|
|
|
@23452
|
13 years |
kjdon |
use filename_cat when generating the full path for blocking. If teh …
|
|
|
@23419
|
13 years |
max |
Setting the values to store as block files is now done through an API …
|
|
|
@23418
|
13 years |
davidb |
A few further additions to help windows keep track of c\... and C:\... …
|
|
|
@23415
|
13 years |
davidb |
More careful handling of filenames going into 'block' hash. On …
|
|
|
@23392
|
13 years |
sjm84 |
Reverted a regular expression designed to locate various tags inside …
|
|
|
@23387
|
13 years |
davidb |
Further changes to deal with documents that use different filename …
|
|
|
@23377
|
13 years |
ak19 |
Perl syntax error fixed: referring to uninitialised variable metadata …
|
|
|
@23371
|
13 years |
davidb |
Further refinement of code to support HTML linking between documents …
|
|
|
@23364
|
13 years |
sjm84 |
C and Posix added to the possible locales as well as removing the …
|
|
|
@23363
|
13 years |
davidb |
Plugin code upgrade to support Greenstone working with filenames under …
|
|
|
@23355
|
13 years |
kjdon |
use unicode for mp3 data. patch thanks to Dan Wright
|
|
|
@23353
|
13 years |
davidb |
Modifications to code to support filename encoding issues when tested …
|
|
|
@23352
|
13 years |
davidb |
Modifications to code to support filename encoding issues when tested …
|
|
|
@23349
|
13 years |
davidb |
More careful use of encoding parameter to $self->set_Source_metadata …
|
|
|
@23348
|
13 years |
davidb |
Added extra parameter to call to deduce_filename_encoding()
|
|
|
@23347
|
13 years |
davidb |
Tidy up of debugging statements for handline filename encodings, plus …
|
|
|
@23335
|
13 years |
davidb |
Work done on improving handing of filenames when the actualy filename …
|
|
|
@23280
|
14 years |
kjdon |
fixed this plugin up for incremental import. need to set …
|
|
|
@23279
|
14 years |
kjdon |
in extra_metadata, new special case for gsdlzipfilename metadata - if …
|
|
|
@23277
|
14 years |
kjdon |
removed a commented out line
|
|
|
@23261
|
14 years |
kjdon |
ZIPPlugin needs to do a block pass on the extracted folder so we don't …
|
|
|
@23248
|
14 years |
ak19 |
Bugfix: file called mimetype (among the files extracted from an Open …
|
|
|
@23212
|
14 years |
kjdon |
metadata_read no longer takes maxdocs args - metadata_read must …
|
|
|
@23171
|
14 years |
kjdon |
if infodbtype is gdbm-txtgz, we need to use gdbm for all archives dbs
|
|
|
@23167
|
14 years |
davidb |
GreenstoneXMLPlugin used to (or at least in theory used to) to be able …
|
|
|
@22953
|
14 years |
davidb |
Further code tweaks to correctly support Unicode aware strings in our …
|
|
|
@22951
|
14 years |
davidb |
Encode::decode cannot be applied to all characters returned by …
|
|
|
@22900
|
14 years |
kjdon |
getting this to work properly
|
|
|
@22896
|
14 years |
kjdon |
fixed an odd bug. If had a metadata file directly in import folder, …
|
|
|
@22894
|
14 years |
kjdon |
added wpd (word perfect) extension into the list that can be processed …
|
|
|
@22887
|
14 years |
kjdon |
use new util::get_timestamped_dir, and clean_up_after_doc_processing …
|
|
|
@22882
|
14 years |
kjdon |
set up convert_to list for the case when windows_scripting and …
|
|
|
@22880
|
14 years |
kjdon |
implemented the read method for when using open office to convert to …
|
|
|
@22879
|
14 years |
kjdon |
now have an html_multi option to convert_to (for PowerPointPlugin)
|
|
|
@22874
|
14 years |
kjdon |
no longer use filename_extension, as we should be using the original …
|
|
|
@22871
|
14 years |
kjdon |
added code to generate an item file if asked for pagedimg output with …
|
|
|
@22865
|
14 years |
kjdon |
forgot to set openoffice_available so that get_default_process_exp works
|
|
|
@22864
|
14 years |
kjdon |
needed use ConvertBinaryFile
|
|
|
@22862
|
14 years |
kjdon |
changed a comment
|
|
|
@22861
|
14 years |
kjdon |
now uses new AutoLoadConverters instead of AutoloadConverterScripting. …
|
|
|
@22860
|
14 years |
kjdon |
changed a line
|
|
|
@22859
|
14 years |
kjdon |
this plugin inherits from others
|
|
|
@22858
|
14 years |
kjdon |
I have written a new version of AutoloadConverterScripting, called …
|
|
|
@22857
|
14 years |
davidb |
Further adjustments to our reading in of text files/data to be Unicode …
|
|
|
@22853
|
14 years |
kjdon |
print parse errors to failhandle and GLI xml as well as to outhandle
|
|
|
@22852
|
14 years |
kjdon |
now prints errors to outhandle, failhandle and gli xml instead of just …
|
|
|
@22844
|
14 years |
davidb |
More explicit use of utf8 for input and output file handling. Relies …
|
|
|
@22842
|
14 years |
davidb |
Minor tidy up of code
|
|
|
@22841
|
14 years |
davidb |
More explicit use of utf8 for input and output file handling. Relies …
|
|
|
@22840
|
14 years |
davidb |
More explicit use of utf8 for input and output file handling. Relies …
|
|
|
@22814
|
14 years |
kjdon |
removes tidy_item_file from store_block_files as it makes the file new …
|
|
|
@22709
|
14 years |
davidb |
Fixed up -process_exp so it now dynamically configures itself …
|
|
|
@22705
|
14 years |
davidb |
User of AutoloadConverterScripting expanded to encompass PowerPoint …
|
|
|
@22702
|
14 years |
davidb |
Introduction of new plugin AutoloadConverterScripting to replace …
|
|
|
@22689
|
14 years |
mdewsnip |
Trac ticket #634: change so "ftp://" is used instead of "http://" in …
|
|
|
@22675
|
14 years |
sjm84 |
Modified PDFPlugin to use PDFBox if it is available
|
|
|
@22674
|
14 years |
sjm84 |
Added a version of ConvertBinaryFile for PDFBox
|
|
|
@22673
|
14 years |
sjm84 |
Dr. Bainbridge added a begin method to OOConvertBinaryFile
|
|
|
@22666
|
14 years |
davidb |
Commented out debugging statement
|
|
|
@22664
|
14 years |
mdewsnip |
Minor comment change.
|
|
|
@22663
|
14 years |
mdewsnip |
Changed "srclink_file" metadata to always contain the filename, …
|
|
|
@22658
|
14 years |
mdewsnip |
Changed "srcicon" values in ImageConverter.pm and ImagePlugin.pm to …
|
|
|
@22656
|
14 years |
mdewsnip |
Changed to add "srclink_file" metadata instead of the deprecated …
|
|
|
@22655
|
14 years |
mdewsnip |
Removed some old (commented out) "[srclink]" code, as part of tidying …
|
|
|
@22654
|
14 years |
mdewsnip |
Removed some old (commented out) "[srclink]" code, as part of tidying …
|
|
|
@22652
|
14 years |
mdewsnip |
Removed call to ghtml::dmsafe() from …
|
|
|
@22641
|
14 years |
kjdon |
now inherits from OOConvertBinaryFile. still a couple of things to iron out
|
|
|
@22640
|
14 years |
kjdon |
now uses new OOConvertBInaryFile super class
|
|
|
@22639
|
14 years |
kjdon |
now uses new OOConvertBinaryFile as super class
|
|
|
@22638
|
14 years |
kjdon |
new ConvertBinaryFile plugin that will include OpenOfficeConverter if …
|
|
|
@22636
|
14 years |
davidb |
Using -utf8 as options to html-tidy leads to wrong encoding for HTML …
|
|
|
@22632
|
14 years |
mdewsnip |
Changed "use textcat" to a "require textcat", so it is only loaded if …
|
|
|
@22612
|
14 years |
kjdon |
made the default process exp a bit nicer to read
|
|
|
@22611
|
14 years |
kjdon |
now uses OpenOfficeConverter that is not ConvertBinaryFile
|
|
|
@22597
|
14 years |
kjdon |
code tidy up. rearranged how convertbinaryfile plugins set up their …
|
|
|
@22594
|
14 years |
kjdon |
committing for davidb. pipe output of tidy to /dev/null if verbosity is low
|
|
|
@22565
|
14 years |
kjdon |
removed block exp. now it scans the item file to work out which files …
|
|
|
@22552
|
14 years |
kjdon |
by default we want this to process all files, so changed default …
|
|
|
@22515
|
14 years |
kjdon |
added open office support for PowerPoint and Excel plugins. followed …
|
|
|
@22514
|
14 years |
kjdon |
small tidyings
|
|
|
@22507
|
14 years |
kjdon |
some moving around and tidying up of code
|
|
|
@22505
|
14 years |
kjdon |
added the openoffice_scripting arg here instead of in OpenOfficeConverter
|
|
|
@22504
|
14 years |
kjdon |
for WordPLugin, if openoffice_scripting is set, need to use …
|
|
|
@22503
|
14 years |
kjdon |
StructuredHTMLPlugin needs -description tags if office_scripting is …
|
|
|
@22462
|
14 years |
ak19 |
Commented out useful (but no needed in svn commited version) print …
|
|
|
@22451
|
14 years |
kjdon |
added metadata_field_separator option. if set eg to ;, will split all …
|
|
|