[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [syndication] Re: News Aggregator Software



> > I am looking for news aggregator software that will allow me to
> view the actual article from syndicated web pages, but stripped of
> HTML tags and layout.  All the software I have see so far simply
> pulls back the web page.  I only want the text of the article.  Does
> anyone know of any such software?
>
> I'm guessing that you want the actual article itself stripped, not
> the RSS feed.

Scraping from web pages is an adventure.  It will break should the site in
question do anything to change what your scraper depends upon.  In general it's
considered better all around to encourage the source to create their own RSS
feed.  They're not hard to make, most content/portal tools support it.  Or can
be made to do so easily.

Have you checked to see if a feed exists already?  Look at
http://www.syndic8.com and search around.

-Bill Kearney