[interchange-i18n] HTML::Entities and Unicode

Ethan Rowe ethan at endpoint.com
Mon Oct 11 15:08:10 UTC 2004


Hi.

Back in November of 2003, Chen Naor posted a patch for the 
HTML::Entities Perl module that made the HTML encoding routines 
multi-byte safe.  I'm wondering if anybody knows this to work or not 
work with Unicode UTF-8 encoding (specifically, Unicode encoding of 
traditional Chinese characters, or Farsi, or anything else well outside 
the Latin1 subset).  Forgive me if this question is silly for some 
reason but I'm quite new to the world of multi-language systems, 
character encoding, etc.

Is this patch still the way to go if you want to get outside of the 
latin1 character set using Interchange?

Thanks very much in advance.

-- 
Ethan Rowe
End Point Corporation
ethan at endpoint.com




More information about the interchange-i18n mailing list