Ignore:
Timestamp:
2021-02-26T19:39:51+13:00 (3 years ago)
Author:
anupama
Message:

Committing the improvements to EmbeddedMetaPlugin's processing of Keywords vs other metadata fields. Keywords were literally stored as arrays of words rather than phrases in PDFs (at least in Diego's sample PDF), whereas other meta fields like Subjects and Creators stored them as arrays of phrases. To get both to work, Kathy updated EXIF to a newer version, to retrieve the actual EXIF values stored in the PDF. And Kathy and Dr Bainbridge came up with a new option that I added called apply_join_before_split_to_metafields that's a regex which can list the metadata fields to apply the join_before_split to and whcih previously always got applied to all metadata fields. Now it's applied to any *Keywords metafields by default, as that's the metafield we have experience of that behaves differently to the others, as it stores by word instead of phrases. Tested on Diego's sample PDF. Diego has double-checked it to works on his sample PDF too, setting the split char to ; and turning on the join_before_split and leaving apply_join_before_split_to_metafields at its default of .*Keywords. File changes are strings.properties for the tooltip, the plugin introducing the option and working with it and Kathy's EXIF updates affecting cpan/File and cpan/Image.

File:
1 edited

Legend:

Unmodified
Added
Removed
  • main/trunk/greenstone2/perllib/cpan/File/RandomAccess.pod

    r24107 r34921  
    44# Description:  Buffer to support random access reading of sequential file
    55#
    6 # Legal:        Copyright (c) 2003-2010 Phil Harvey (phil at owl.phy.queensu.ca)
     6# Legal:        Copyright (c) 2003-2021 Phil Harvey (philharvey66 at gmail.com)
    77#               This library is free software; you can redistribute it and/or
    88#               modify it under the same terms as Perl itself.
     
    4848=item Inputs:
    4949
    50 0) Reference to RandomAccess object.
     500) Reference to RandomAccess object or RandomAccess class name.
    5151
    52521) File reference or scalar reference.
    5353
    54 2) flag set if file is already random access (disables automatic SeekTest).
     542) Flag set if file is already random access (disables automatic SeekTest).
    5555
    5656=item Returns:
     
    216216=back
    217217
     218=head1 OPTIONS
     219
     220=over 4
     221
     222=item B<NoBuffer>
     223
     224Avoid buffering sequential files.
     225
     226    $raf->{NoBuffer} = 1;
     227
     228When this option is set, old data is purged from the internal buffer before
     229a read operation on a sequential file.  In this mode, memory requirements
     230may be significantly reduced when reading sequential files, but seeking
     231backward is limited to within the size of the internal buffer (which will be
     232at least as large as the last returned data block), and seeking relative to
     233the end of file is not allowed.
     234
     235=back
     236
    218237=head1 AUTHOR
    219238
    220 Copyright 2003-2011 Phil Harvey (phil at owl.phy.queensu.ca)
     239Copyright 2003-2021 Phil Harvey (philharvey66 at gmail.com)
    221240
    222241This library is free software; you can redistribute it and/or modify it
Note: See TracChangeset for help on using the changeset viewer.