[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
ANN: xpath2rss 0.2
- To: syndication@yahoogroups.com
- Subject: ANN: xpath2rss 0.2
- From: Mark Nottingham <mnot@mnot.net>
- Date: Sat, 16 Jun 2001 12:47:51 -0700
- User-agent: Mozilla/5.0 (X11; U; Linux 2.4.5 i686; en-US; rv:0.9.1) Gecko/20010607
Just for fun...
http://www.mnot.net/xpath2rss/
From the README:
xpath2rss is Yet Another HTML->RSS scraper. This one's different in that
instead of using regular expressions, as most do, it uses XPATH.
Why XPATH?
The first reason to use XPATH is to learn it; XPATH is an important tool
for XML. Currently, it's used in XSLT and not too many other places.
Secondly, XPATH is smarter than regex because XPATH is aware of XML's
(and therefore HTML's) syntax; theoretically, this makes it a more
natural fit for scraping HTML and XML, and should make it more reliable.
Cheers,
--
Mark Nottingham
http://www.mnot.net/