Ignore:
Timestamp:
2005-10-26T11:56:02+13:00 (18 years ago)
Author:
mdewsnip
Message:

When processing Word documents in an 8-bit encoding wvWare would correctly convert the documents to Unicode, but then HTMLPlug would incorrectly convert them again. Now sets the input_encoding option of the secondary HTMLPlug to "utf8" always.

File:
1 edited

Legend:

Unmodified
Added
Removed
  • trunk/gsdl/perllib/plugins/WordPlug.pm

    r10723 r10769  
    153153    my $text_options = $secondary_plugin_options->{'TextPlug'};
    154154   
    155     # wvWare will always produce html files encoded as utf-8
     155    # wvWare will always produce html files encoded as utf-8, so make sure the secondary HTMLPlug knows this
     156    push(@$html_options,"-input_encoding", "utf8");
    156157    if ($self->{'input_encoding'} eq "auto") {
    157158    $self->{'input_encoding'} = "utf8";
    158159    $self->{'extract_language'} = 1;
    159     push(@$html_options,"-input_encoding", "utf8");
    160160    push(@$html_options,"-extract_language");
    161161    }
Note: See TracChangeset for help on using the changeset viewer.