[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [syndication] XML Character encoding (again)



> Are there any out there that are not using XML parsers then? UTF-8 is
> one of the few-encodings that you can actually rely on when writing
> XML. If anyone is still parsing XML with regex then they knew what
> trouble they were getting into when they started...

The naive persist regardless.  You're right, of course, that depending on
anything other than a legitimate XML parser is a mistake.  But people are
happily consuming the RSS using all sorts of hacks and scripts.  As more
internationalized material becomes available they're in for some rough going.

-Bill Kearney