[ic] Filters with UTF-8 body

Peter peter at pajamian.dhs.org
Thu Mar 12 10:31:01 UTC 2009


On 03/12/2009 03:17 AM, Peter wrote:
> On 03/12/2009 03:04 AM, Stefan Hornburg wrote:
>> Peter Ajamian suggested that the following code in Interpolate.pm
>> causes the problem:
>>
>>  '_filter'               => qr($T{_filter}\s+($Some)\]($Some)),
>>  my $Some = '[\000-\377]*?';
> 
> More specifically $Some, $All, $XSome and $XAll will only parse 8 bit
> characters in the range \000-\377.  Not positive about this, but I think
> that changing them to the following will work:
> my $All = '(?:(?s).*)';
> my $Some = '(?:(?s).*?)';
> my $XAll = qr{(?:(?s).*)};
> my $XSome = qr{(?:(?s).*?)};


On further reflection this would probably work just as well and is less
complex looking:
my $All = '[.\n]*';
my $Some = '[.\n]*?';
my $XAll = qr{[.\n]*};
my $XSome = qr{[.\n]*?};

Peter



More information about the interchange-users mailing list