Changeset 16753
- Timestamp:
- 2008-08-13T13:10:23+12:00 (16 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
gsdl/trunk/perllib/plugins/ReadTextFile.pm
r16724 r16753 340 340 (exists $self->{'converted_to'} && $self->{'converted_to'} eq 'HTML')){ 341 341 342 # remove comments, including multiline ones, so that we don't match on 343 # inactive tags (those that are nested inside comments) 344 $text =~ s/<!--.*?-->//sg; 345 342 346 # remove <title>stuff</title> -- as titles tend often to be in English 343 347 # for foreign language documents … … 349 353 } 350 354 # check the meta http-equiv charset tag unless it is commented out 351 elsif ( ($text !~ /<!--[^<>]?<meta http-equiv/i) && ($text =~ /<meta http-equiv.*content-type.*charset=(.+?)\"/i)) {355 elsif ($text =~ m/<meta http-equiv.*content-type.*charset=(.+?)\"/i) { 352 356 $best_encoding = $1; 353 357 # print STDERR "**** meta tag found, encoding is: $best_encoding\n";
Note:
See TracChangeset
for help on using the changeset viewer.