[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

ANN: xpath2rss 0.2

To: syndication@yahoogroups.com
Subject: ANN: xpath2rss 0.2
From: Mark Nottingham <mnot@mnot.net>
Date: Sat, 16 Jun 2001 12:47:51 -0700
User-agent: Mozilla/5.0 (X11; U; Linux 2.4.5 i686; en-US; rv:0.9.1) Gecko/20010607

Just for fun...

http://www.mnot.net/xpath2rss/

From the README:

xpath2rss is Yet Another HTML->RSS scraper. This one's different in that
instead of using regular expressions, as most do, it uses XPATH.

Why XPATH?

The first reason to use XPATH is to learn it; XPATH is an important tool
for XML. Currently, it's used in XSLT and not too many other places.

Secondly, XPATH is smarter than regex because XPATH is aware of XML's
(and therefore HTML's) syntax; theoretically, this makes it a more
natural fit for scraping HTML and XML, and should make it more reliable.


Cheers,

--
Mark Nottingham
http://www.mnot.net/

Follow-Ups:
- Re: [syndication] ANN: xpath2rss 0.2
  - From: Mark Nottingham <mnot@mnot.net>

Prev by Date: Re: What to call the RSS file
Next by Date: yellowbrix to acquire isyndicate's assets
Previous by thread: OT: Nice Quote from XML.com
Next by thread: Re: [syndication] ANN: xpath2rss 0.2
Index(es):
- Date
- Thread