Changeset 23284


Ignore:
Timestamp:
2010-11-03T17:38:32+13:00 (11 years ago)
Author:
sjm84
Message:

Dr Bainbridge's modification of nice-string subroutine which will print the unicode strings for debugging (printable ASCII chars are shown as-is, other chars are shown with their hex values for their unicode code points.

File:
1 edited

Legend:

Unmodified
Added
Removed
  • main/trunk/greenstone2/perllib/util.pm

    r23249 r23284  
    13741374}
    13751375
     1376# To print UTF8 (unicode) for debugging:
     1377# Characters that are easily displayed (that is, printable ASCII)
     1378# are shown as-is, whereas hexcodes are shown for all other chars.
     1379# Useful method for debugging when doing encoding related work.
     1380sub debug_unicode_string
     1381{
     1382    join("",
     1383         map { $_ > 128 ?                      # if wide character...
     1384                   sprintf("\\x{%04X}", $_) :  # \x{...}
     1385                   chr($_)         
     1386               } unpack("U*", $_[0]));         # unpack Unicode characters
     1387}
    13761388
    137713891;
Note: See TracChangeset for help on using the changeset viewer.