|
|
@23277
|
14 years |
kjdon |
removed a commented out line
|
|
|
@23261
|
14 years |
kjdon |
ZIPPlugin needs to do a block pass on the extracted folder so we don't …
|
|
|
@23248
|
14 years |
ak19 |
Bugfix: file called mimetype (among the files extracted from an Open …
|
|
|
@23212
|
14 years |
kjdon |
metadata_read no longer takes maxdocs args - metadata_read must …
|
|
|
@23171
|
14 years |
kjdon |
if infodbtype is gdbm-txtgz, we need to use gdbm for all archives dbs
|
|
|
@23167
|
14 years |
davidb |
GreenstoneXMLPlugin used to (or at least in theory used to) to be able …
|
|
|
@22953
|
14 years |
davidb |
Further code tweaks to correctly support Unicode aware strings in our …
|
|
|
@22951
|
14 years |
davidb |
Encode::decode cannot be applied to all characters returned by …
|
|
|
@22900
|
14 years |
kjdon |
getting this to work properly
|
|
|
@22896
|
14 years |
kjdon |
fixed an odd bug. If had a metadata file directly in import folder, …
|
|
|
@22894
|
14 years |
kjdon |
added wpd (word perfect) extension into the list that can be processed …
|
|
|
@22887
|
14 years |
kjdon |
use new util::get_timestamped_dir, and clean_up_after_doc_processing …
|
|
|
@22882
|
14 years |
kjdon |
set up convert_to list for the case when windows_scripting and …
|
|
|
@22880
|
14 years |
kjdon |
implemented the read method for when using open office to convert to …
|
|
|
@22879
|
14 years |
kjdon |
now have an html_multi option to convert_to (for PowerPointPlugin)
|
|
|
@22874
|
14 years |
kjdon |
no longer use filename_extension, as we should be using the original …
|
|
|
@22871
|
14 years |
kjdon |
added code to generate an item file if asked for pagedimg output with …
|
|
|
@22865
|
14 years |
kjdon |
forgot to set openoffice_available so that get_default_process_exp works
|
|
|
@22864
|
14 years |
kjdon |
needed use ConvertBinaryFile
|
|
|
@22862
|
14 years |
kjdon |
changed a comment
|
|
|
@22861
|
14 years |
kjdon |
now uses new AutoLoadConverters instead of AutoloadConverterScripting. …
|
|
|
@22860
|
14 years |
kjdon |
changed a line
|
|
|
@22859
|
14 years |
kjdon |
this plugin inherits from others
|
|
|
@22858
|
14 years |
kjdon |
I have written a new version of AutoloadConverterScripting, called …
|
|
|
@22857
|
14 years |
davidb |
Further adjustments to our reading in of text files/data to be Unicode …
|
|
|
@22853
|
14 years |
kjdon |
print parse errors to failhandle and GLI xml as well as to outhandle
|
|
|
@22852
|
14 years |
kjdon |
now prints errors to outhandle, failhandle and gli xml instead of just …
|
|
|
@22844
|
14 years |
davidb |
More explicit use of utf8 for input and output file handling. Relies …
|
|
|
@22842
|
14 years |
davidb |
Minor tidy up of code
|
|
|
@22841
|
14 years |
davidb |
More explicit use of utf8 for input and output file handling. Relies …
|
|
|
@22840
|
14 years |
davidb |
More explicit use of utf8 for input and output file handling. Relies …
|
|
|
@22814
|
14 years |
kjdon |
removes tidy_item_file from store_block_files as it makes the file new …
|
|
|
@22709
|
14 years |
davidb |
Fixed up -process_exp so it now dynamically configures itself …
|
|
|
@22705
|
14 years |
davidb |
User of AutoloadConverterScripting expanded to encompass PowerPoint …
|
|
|
@22702
|
14 years |
davidb |
Introduction of new plugin AutoloadConverterScripting to replace …
|
|
|
@22689
|
14 years |
mdewsnip |
Trac ticket #634: change so "ftp://" is used instead of "http://" in …
|
|
|
@22675
|
14 years |
sjm84 |
Modified PDFPlugin to use PDFBox if it is available
|
|
|
@22674
|
14 years |
sjm84 |
Added a version of ConvertBinaryFile for PDFBox
|
|
|
@22673
|
14 years |
sjm84 |
Dr. Bainbridge added a begin method to OOConvertBinaryFile
|
|
|
@22666
|
14 years |
davidb |
Commented out debugging statement
|
|
|
@22664
|
14 years |
mdewsnip |
Minor comment change.
|
|
|
@22663
|
14 years |
mdewsnip |
Changed "srclink_file" metadata to always contain the filename, …
|
|
|
@22658
|
14 years |
mdewsnip |
Changed "srcicon" values in ImageConverter.pm and ImagePlugin.pm to …
|
|
|
@22656
|
14 years |
mdewsnip |
Changed to add "srclink_file" metadata instead of the deprecated …
|
|
|
@22655
|
14 years |
mdewsnip |
Removed some old (commented out) "[srclink]" code, as part of tidying …
|
|
|
@22654
|
14 years |
mdewsnip |
Removed some old (commented out) "[srclink]" code, as part of tidying …
|
|
|
@22652
|
14 years |
mdewsnip |
Removed call to ghtml::dmsafe() from …
|
|
|
@22641
|
14 years |
kjdon |
now inherits from OOConvertBinaryFile. still a couple of things to iron out
|
|
|
@22640
|
14 years |
kjdon |
now uses new OOConvertBInaryFile super class
|
|
|
@22639
|
14 years |
kjdon |
now uses new OOConvertBinaryFile as super class
|
|
|
@22638
|
14 years |
kjdon |
new ConvertBinaryFile plugin that will include OpenOfficeConverter if …
|
|
|
@22636
|
14 years |
davidb |
Using -utf8 as options to html-tidy leads to wrong encoding for HTML …
|
|
|
@22632
|
14 years |
mdewsnip |
Changed "use textcat" to a "require textcat", so it is only loaded if …
|
|
|
@22612
|
14 years |
kjdon |
made the default process exp a bit nicer to read
|
|
|
@22611
|
14 years |
kjdon |
now uses OpenOfficeConverter that is not ConvertBinaryFile
|
|
|
@22597
|
14 years |
kjdon |
code tidy up. rearranged how convertbinaryfile plugins set up their …
|
|
|
@22594
|
14 years |
kjdon |
committing for davidb. pipe output of tidy to /dev/null if verbosity is low
|
|
|
@22565
|
14 years |
kjdon |
removed block exp. now it scans the item file to work out which files …
|
|
|
@22552
|
14 years |
kjdon |
by default we want this to process all files, so changed default …
|
|
|
@22515
|
14 years |
kjdon |
added open office support for PowerPoint and Excel plugins. followed …
|
|
|
@22514
|
14 years |
kjdon |
small tidyings
|
|
|
@22507
|
14 years |
kjdon |
some moving around and tidying up of code
|
|
|
@22505
|
14 years |
kjdon |
added the openoffice_scripting arg here instead of in OpenOfficeConverter
|
|
|
@22504
|
14 years |
kjdon |
for WordPLugin, if openoffice_scripting is set, need to use …
|
|
|
@22503
|
14 years |
kjdon |
StructuredHTMLPlugin needs -description tags if office_scripting is …
|
|
|
@22462
|
14 years |
ak19 |
Commented out useful (but no needed in svn commited version) print …
|
|
|
@22451
|
14 years |
kjdon |
added metadata_field_separator option. if set eg to ;, will split all …
|
|
|
@22450
|
14 years |
kjdon |
missing argument to autorun_general_cmd. oh the trouble that caused, …
|
|
|
@22448
|
14 years |
kjdon |
metadata values mught be array type - add each individual item as a …
|
|
|
@22431
|
14 years |
davidb |
Correction to caching technique to work with input file rather than …
|
|
|
@22428
|
14 years |
davidb |
Restructuring of WordPlugin to dynamically inherit from …
|
|
|
@22427
|
14 years |
davidb |
Adjustment of whitespace
|
|
|
@22412
|
14 years |
davidb |
More accurate comment added
|
|
|
@22401
|
14 years |
kjdon |
tidied up a bit. Removed options that (I think) will always be set the …
|
|
|
@22364
|
14 years |
ak19 |
MediainfoOGVPlugin now includes the final changes Arnaud made to his …
|
|
|
@22363
|
14 years |
ak19 |
Adding in the adjustments to the mediainfoogv plugin that were mailed …
|
|
|
@22362
|
14 years |
ak19 |
Committing Arnaud Yvan's MediainfoOGVPlugin.pm for the next release of …
|
|
|
@22355
|
14 years |
kjdon |
previously, when use_realistic_book was set, all files listed in …
|
|
|
@22351
|
14 years |
davidb |
White-space tidy up
|
|
|
@22349
|
14 years |
kjdon |
if metadata extracted from item file has a namespace, then prefix it …
|
|
|
@22348
|
14 years |
kjdon |
store any extracted metadata that has a namespace as ex.ns.meta
|
|
|
@22330
|
14 years |
kjdon |
we want to store the original file name not the tidy filename as the …
|
|
|
@22329
|
14 years |
kjdon |
changed mp3:meta to ex.id3.meta. apparently id3 isa better name for …
|
|
|
@22316
|
14 years |
kjdon |
store extracted namespaced metadata as ex.metadata, eg ex.dc.Title, …
|
|
|
@22293
|
14 years |
kjdon |
extracted metadata is now going to be added as ex.meta, then GLI will …
|
|
|
@22267
|
14 years |
kjdon |
fixed a mistake in a method name
|
|
|
@22232
|
14 years |
mdewsnip |
New OAIMetadataXMLPlugin.pm for extracting information from OAI …
|
|
|
@22215
|
14 years |
kjdon |
added store_original_file option, used for eg Text, HTML plugins to …
|
|
|
@22074
|
14 years |
kjdon |
extrametadata needs the filename with no subfolder as that is added in …
|
|
|
@21981
|
14 years |
kjdon |
fix for ticket #676. Conversion of pdf to html where two pdfs had the …
|
|
|
@21958
|
14 years |
kjdon |
ppthtml and xslhtml don't seem to output utf8, so remove the …
|
|
|
@21916
|
14 years |
kjdon |
made this work with a user specified process_exp so that your metadata …
|
|
|
@21905
|
14 years |
mdewsnip |
Changes made by Jeffrey Ke at DL Consulting Ltd to remove the global …
|
|
|
@21866
|
14 years |
kjdon |
added some code for if identify returns size in mb.
|
|
|
@21803
|
14 years |
kjdon |
set file_id to null if ID doesn't match FILE.* (previously it was …
|
|
|
@21801
|
14 years |
kjdon |
extended HTMLPlugin's metadata_field_separator option to Word and …
|
|
|
@21800
|
14 years |
kjdon |
added a new option, metadata_field_separator, which specifies what to …
|
|
|
@21764
|
14 years |
kjdon |
fixed up all my copy and paste errors. doh.
|
|
|
@21763
|
14 years |
kjdon |
don't modify document_field is info_only - doesn't appear to be …
|
|
|
@21760
|
14 years |
kjdon |
srclink now generated dynamically at runtime. instead of storing …
|
|
|