|
|
@23285
|
13 years |
sjm84 |
Moving subroutine committed previously to util.pm to unicode.pm where …
|
|
|
@23284
|
13 years |
sjm84 |
Dr Bainbridge's modification of nice-string subroutine which will …
|
|
|
@23280
|
13 years |
kjdon |
fixed this plugin up for incremental import. need to set …
|
|
|
@23279
|
13 years |
kjdon |
in extra_metadata, new special case for gsdlzipfilename metadata - if …
|
|
|
@23278
|
13 years |
kjdon |
split out the encoding filename bit from set_source_metadata so that …
|
|
|
@23277
|
13 years |
kjdon |
removed a commented out line
|
|
|
@23261
|
13 years |
kjdon |
ZIPPlugin needs to do a block pass on the extracted folder so we don't …
|
|
|
@23250
|
13 years |
ak19 |
Accidentally committed a debug version of the file
|
|
|
@23249
|
13 years |
ak19 |
A useful debug version of the rm method which got added in when Dr …
|
|
|
@23248
|
13 years |
ak19 |
Bugfix: file called mimetype (among the files extracted from an Open …
|
|
|
@23212
|
13 years |
kjdon |
metadata_read no longer takes maxdocs args - metadata_read must …
|
|
|
@23198
|
13 years |
davidb |
Extra curly-brace had found its way into the code as the very last …
|
|
|
@23193
|
13 years |
davidb |
Whitespace addition to improve formatting
|
|
|
@23182
|
13 years |
kjdon |
fixed up bug with deleting assoc files. Was fine for a delete, but for …
|
|
|
@23181
|
13 years |
sjm84 |
The end brace of the delete_assoc_files sub in the lucenebuildproc.pm …
|
|
|
@23176
|
13 years |
kjdon |
added delete_assoc_files. If we are asked to process a deleted or …
|
|
|
@23172
|
13 years |
kjdon |
when using gdbm-txtgz infodb type, the runtime system will generate …
|
|
|
@23171
|
13 years |
kjdon |
if infodbtype is gdbm-txtgz, we need to use gdbm for all archives dbs
|
|
|
@23170
|
13 years |
kjdon |
if infodbtype is gdbm-txtgz, we need to use gdbm for all archives dbs
|
|
|
@23169
|
13 years |
kjdon |
need to set binmode for pipes to get utf8 input and output. also need …
|
|
|
@23167
|
13 years |
davidb |
GreenstoneXMLPlugin used to (or at least in theory used to) to be able …
|
|
|
@23166
|
13 years |
kjdon |
copying code from gdbm.pm: Attached :utf8 encoding to db pipes to …
|
|
|
@23165
|
13 years |
kjdon |
removed a couple of print statements
|
|
|
@23160
|
13 years |
kjdon |
member hash renamed, some tidying up
|
|
|
@23159
|
13 years |
kjdon |
buildproc member hash renamed
|
|
|
@23158
|
13 years |
kjdon |
added a couple of missing plugin description strings
|
|
|
@23156
|
13 years |
kjdon |
removed an unused string
|
|
|
@23154
|
13 years |
kjdon |
store a hash of all doc oids, then check against this hash when asked …
|
|
|
@23141
|
13 years |
davidb |
|
|
|
@23138
|
13 years |
davidb |
Elimination of rand_string
|
|
|
@23133
|
13 years |
kjdon |
still workign on incremental infodb updating. cleaning up code now …
|
|
|
@23132
|
13 years |
kjdon |
for manifest files, if the user has specified Index (not Reindex) and …
|
|
|
@23131
|
13 years |
kjdon |
added a method get_total_text_length. returns the total lenght for the …
|
|
|
@23121
|
13 years |
kjdon |
small changes based on the fact that we need to store ids for updated …
|
|
|
@23120
|
13 years |
kjdon |
process the reconstructed docs after reading through the archives …
|
|
|
@23119
|
13 years |
kjdon |
removed some print statements, and don't go into .svn folder when …
|
|
|
@23118
|
13 years |
kjdon |
removed edit_mode from classobj->classify call
|
|
|
@23116
|
13 years |
kjdon |
for incremental build, classifiers are not really done incrementally. …
|
|
|
@23102
|
13 years |
davidb |
Attached :utf8 encoding to GDBM pipes to prevent double encoding …
|
|
|
@23085
|
13 years |
davidb |
Minor tweak to talkback.pm
|
|
|
@23084
|
13 years |
davidb |
Routines to help CGI scripts that implement DL Talkback
|
|
|
@23081
|
13 years |
kjdon |
added a classifyOID param to get_OID_entry. Used to set the …
|
|
|
@23064
|
13 years |
davidb |
Supporting Perl classes (100% pure Perl) for DL talkback facility
|
|
|
@23053
|
13 years |
kjdon |
reworking of manifest stuff. Now, instead of calling plugin::read on …
|
|
|
@23042
|
13 years |
ak19 |
Kathy fixed a cut and paste error that prevented the depositor from …
|
|
|
@22986
|
14 years |
kjdon |
need to do a file_block_read before a read when adding new docs …
|
|
|
@22953
|
14 years |
davidb |
Further code tweaks to correctly support Unicode aware strings in our …
|
|
|
@22952
|
14 years |
davidb |
Encode::decode cannot be applied to all characters returned by …
|
|
|
@22951
|
14 years |
davidb |
Encode::decode cannot be applied to all characters returned by …
|
|
|
@22950
|
14 years |
davidb |
Old routine used to work on raw binary strings that just happened to …
|
|
|
@22921
|
14 years |
sjm84 |
Anu changed this to escape backslashes in file names before using them …
|
|
|
@22900
|
14 years |
kjdon |
getting this to work properly
|
|
|
@22896
|
14 years |
kjdon |
fixed an odd bug. If had a metadata file directly in import folder, …
|
|
|
@22894
|
14 years |
kjdon |
added wpd (word perfect) extension into the list that can be processed …
|
|
|
@22887
|
14 years |
kjdon |
use new util::get_timestamped_dir, and clean_up_after_doc_processing …
|
|
|
@22886
|
14 years |
kjdon |
new method get_timestamped_tmp_folder, used by ConvertBinaryFile and …
|
|
|
@22883
|
14 years |
kjdon |
corrected a typo
|
|
|
@22882
|
14 years |
kjdon |
set up convert_to list for the case when windows_scripting and …
|
|
|
@22881
|
14 years |
kjdon |
added some powerpoint strings
|
|
|
@22880
|
14 years |
kjdon |
implemented the read method for when using open office to convert to …
|
|
|
@22879
|
14 years |
kjdon |
now have an html_multi option to convert_to (for PowerPointPlugin)
|
|
|
@22874
|
14 years |
kjdon |
no longer use filename_extension, as we should be using the original …
|
|
|
@22873
|
14 years |
kjdon |
new subroutine get_timestamped_tmp_filename_in_collection, which does …
|
|
|
@22871
|
14 years |
kjdon |
added code to generate an item file if asked for pagedimg output with …
|
|
|
@22865
|
14 years |
kjdon |
forgot to set openoffice_available so that get_default_process_exp works
|
|
|
@22864
|
14 years |
kjdon |
needed use ConvertBinaryFile
|
|
|
@22862
|
14 years |
kjdon |
changed a comment
|
|
|
@22861
|
14 years |
kjdon |
now uses new AutoLoadConverters instead of AutoloadConverterScripting. …
|
|
|
@22860
|
14 years |
kjdon |
changed a line
|
|
|
@22859
|
14 years |
kjdon |
this plugin inherits from others
|
|
|
@22858
|
14 years |
kjdon |
I have written a new version of AutoloadConverterScripting, called …
|
|
|
@22857
|
14 years |
davidb |
Further adjustments to our reading in of text files/data to be Unicode …
|
|
|
@22856
|
14 years |
davidb |
Tidy up of caller() debug statements
|
|
|
@22855
|
14 years |
davidb |
added ':utf8' to call to open file handle
|
|
|
@22853
|
14 years |
kjdon |
print parse errors to failhandle and GLI xml as well as to outhandle
|
|
|
@22852
|
14 years |
kjdon |
now prints errors to outhandle, failhandle and gli xml instead of just …
|
|
|
@22849
|
14 years |
ak19 |
Further changes to ticket 152 (movable collectdir. Both export.pl and …
|
|
|
@22844
|
14 years |
davidb |
More explicit use of utf8 for input and output file handling. Relies …
|
|
|
@22843
|
14 years |
davidb |
More explicit use of utf8 for input and output file handling. Relies …
|
|
|
@22842
|
14 years |
davidb |
Minor tidy up of code
|
|
|
@22841
|
14 years |
davidb |
More explicit use of utf8 for input and output file handling. Relies …
|
|
|
@22840
|
14 years |
davidb |
More explicit use of utf8 for input and output file handling. Relies …
|
|
|
@22839
|
14 years |
davidb |
More explicit use of utf8 for input and output file handling. Relies …
|
|
|
@22823
|
14 years |
davidb |
Backed off utf-8 binmode statement as this now causes problems in …
|
|
|
@22820
|
14 years |
davidb |
Need to call 'set_output_handle()' in additional places, to insure the …
|
|
|
@22819
|
14 years |
davidb |
We always assume the output stream is UTF-8. Stating this explicitly …
|
|
|
@22818
|
14 years |
davidb |
Tightening up (slightly) of DOCTYPE line. Previously, it was quite …
|
|
|
@22814
|
14 years |
kjdon |
removes tidy_item_file from store_block_files as it makes the file new …
|
|
|
@22804
|
14 years |
kjdon |
change to import.OIDtype.incremental
|
|
|
@22749
|
14 years |
mdewsnip |
Added copyright statements to perllib/ClassifyTree*.pm, for consistency.
|
|
|
@22735
|
14 years |
mdewsnip |
Added DL Consulting Ltd. to the copyright statement, since we wrote …
|
|
|
@22734
|
14 years |
mdewsnip |
Added copyright header to perllib/XMLParser.pm.
|
|
|
@22732
|
14 years |
mdewsnip |
Added copyright header to perllib/parse2.pm.
|
|
|
@22731
|
14 years |
mdewsnip |
Added copyright header to perllib/manifest.pm.
|
|
|
@22709
|
14 years |
davidb |
Fixed up -process_exp so it now dynamically configures itself …
|
|
|
@22705
|
14 years |
davidb |
User of AutoloadConverterScripting expanded to encompass PowerPoint …
|
|
|
@22702
|
14 years |
davidb |
Introduction of new plugin AutoloadConverterScripting to replace …
|
|
|
@22689
|
14 years |
mdewsnip |
Trac ticket #634: change so "ftp://" is used instead of "http://" in …
|
|
|
@22675
|
14 years |
sjm84 |
Modified PDFPlugin to use PDFBox if it is available
|
|
|
@22674
|
14 years |
sjm84 |
Added a version of ConvertBinaryFile for PDFBox
|
|
|