Changeset 10442


Ignore:
Timestamp:
2005-08-09T10:44:32+12:00 (19 years ago)
Author:
chi
Message:

To retrieve encoding information for the HTML file generated from word2html vb script.

File:
1 edited

Legend:

Unmodified
Added
Removed
  • trunk/gsdl/perllib/plugins/BasPlug.pm

    r10347 r10442  
    10061006    }
    10071007    }
    1008    
    1009 
     1008   
     1009    # VB scripting generated Word to HTML file
     1010    if ($text =~ /charset=(.*)[\"]/ig){
     1011    my $vbhtml_encoding = $1;
     1012    $vbhtml_encoding =~ s/-+/_/g;
     1013    $self->{'input_encoding'} = $vbhtml_encoding;
     1014    }
     1015   
    10101016    # remove <title>stuff</title> -- as titles tend often to be in English
    10111017    # for foreign language documents
Note: See TracChangeset for help on using the changeset viewer.