[interchange-i18n] HTML::Entities and Unicode

Chen Naor chen at lilux.co.il
Mon Oct 11 21:17:05 UTC 2004


Stefan Hornburg wrote:
> On Mon, 11 Oct 2004 11:08:10 -0400
> Ethan Rowe <ethan at endpoint.com> wrote:
>
>> Hi.
>>
>> Back in November of 2003, Chen Naor posted a patch for the
>> HTML::Entities Perl module that made the HTML encoding routines
>> multi-byte safe.  I'm wondering if anybody knows this to work or not
>> work with Unicode UTF-8 encoding (specifically, Unicode encoding of
>> traditional Chinese characters, or Farsi, or anything else well
>> outside the Latin1 subset).  Forgive me if this question is silly
>> for some reason but I'm quite new to the world of multi-language
>> systems, character encoding, etc.
>>
>> Is this patch still the way to go if you want to get outside of the
>> latin1 character set using Interchange?
>
> IMHO this needs thorough investigation and testing. If you happen
> to have Perl 5.6 running, that makes thing even more difficult.
> But it should really done.
>
> Bye
> Racke

Hi,
If you use perl 5.8.x, while compiling the HTML::Entities it ask if you want 
to enable UTF-8/Unicode.
I never tryed it but logicaly it have to work :).
& then you should not use my patch.
Good luck,
Chen




More information about the interchange-i18n mailing list