[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [syndication] ANN: xpath2rss 0.5
Java isn't my cup of tea (yet, tho I'm learning...). However, I don't
know if you can really do SAX for plain HTML; HTML, after all, isn't
XML.
I'm able to use XPath because you can generate a DOM for HTML, and
there's a relatively easy mapping from XPath expressions to a DOM.
Also, using SAX pretty much sentences users to either writing code,
or you to coming up with a syntax for indexing into the document that
probably will end up looking like XPath.
Just my .02...
On Tue, Nov 06, 2001 at 01:50:06PM -0800, Mike Dierken wrote:
> Do you know of a Java HTML->SAX event generator?
> I'd like to do the same sort of 'screen scraping' via a normalized XML
> document created from HTML.
--
Mark Nottingham
http://www.mnot.net/