[ic] Strip HTML and IC tags

Ron Phipps interchange-users@interchange.redhat.com
Fri Feb 22 19:22:01 2002


> From: interchange-users-admin@interchange.redhat.com
[mailto:interchange-
> users-admin@interchange.redhat.com] On Behalf Of Joachim Leidinger
> 
> Ron Phipps wrote:
> >
> > How should I go about stripping all HTML and IC tags from a variable
or
> > field?  A search of the filter tag only turned up text2html which
will
> > convert line breaks to <BR> for display on an html page.  I'd like
to go
> > the other way, but remove all ITL and HTML tags.  I gather that it
would
> > take a complex set of regexes to do this from my search of google.
Is
> > there a way to do this that is included with IC or should I look at
> > writing a usertag of my own?
> 
> Maybe that or you can define your own filter, which can used like the
> other IC filter.

Thanks Joachim, that is what I ended up doing.  Both filters have been
submitted in bugzilla to be added to the filter list.

> 
> But, I've trouble to understand your wish. Maybe you want to get the
IC
> page (like the page, which is viewing in the browser) as a simple text
> file? How about creating a script, which use LWP or any kind of Agent
to
> access that page and store it as a file and call a script or program
to
> convert that HTML page into any kind of file like a text file, pdf
file
> and so on?
> 
> I'm in a muddle!
> 
> Joachim

We actually needed to strip out itl and html that was stored in a
database field so that when the plain text report is received there will
be no html or itl code, just text.

Take it easy :)
-Ron