[ic] {Spam?} Re: How would you search, store, and display documents
Rick Bragg
lists at gmnet.net
Tue Aug 30 18:04:41 UTC 2011
On Fri, 2011-08-12 at 00:23 -0700, Paul Jordan wrote:
> I'm tasked with building a pretty complex training/educational system for
> one of my clients. This would be bring their paper manual and assets, html
> newsletters, FAQ, videos, how-to's, etc, etc all into one intuitive
> "knowledgebase" if you will.
>
> I know how I want it to work and look, but what I don't know ATM is what is
> the best format to use. My main concern is the searchability and storage of
> the main body of each article. This text will arbitrarily contain html for
> formatting, images, div's for quotes, or tables for data, and the like
> (everything will be styled with css of course)
>
> It seems to me there are several paths...
>
> #1 Store the text page with any html needed for the article in table and
> assuming html doesn't play well with fulltext searches, work around that by
> saving text-only into a second field used for searching only.
>
> #2 Delve into xml/xsl.
>
> #3 Create some sort of wiki parser to use in conjunction with IC. I *really*
> would have liked to use Kevins system, and improved upon that, but that
> doesn't seem likely.
>
> #4 Have a parser like Kevin's made, extend it to handle images, and create a
> simple online "editor" for it.
>
> It should be noted that I don't use anything but IC and Mysql, I don't like
> the headaches, worries, or distractions of some other platform. IC can do
> it, so I'd rather just have IC do it.
>
> Thank you for any advice they can lend.
>
> Paul Jordan
>
>
> _______________________________________________
> interchange-users mailing list
> interchange-users at icdevgroup.org
> http://www.icdevgroup.org/mailman/listinfo/interchange-users
>
Hi,
I would not store 2 versions of the same content. (one html, and one
clean) I would only store the html version. I would use html2text
filter to clean it up for meta description for example.
Rick
More information about the interchange-users
mailing list