[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [syndication] Entities and Char sets
2001-11-27 16:04:58, Julian Bond <julian_bond@voidstar.com>:
> But when you're trying to get the content
> down an XML-RPC pipe and the toolkits don't interoperate
> because they barf on whitespace between tags or tell you
> the name-value is missing when actually it's just empty,
Just curious, could you name a few toolkits that barf on
whitespace between tags? (I'm not sure what you mean by 'name-
value' missing.)
> So what is the *correct* way of dealing with reserved and
> high order characters in RSS content, such that the maximum
> number of clients can read it and get the content to the
> reader?
Use the 'UTF-8' encoding. Write all ASCII characters directly,
and write all other characters (i.e. all characters at
codepoint > 127) as decimal character references. Then I
believe you should be pretty safe. Simplified example:
<?xml version="1.0" encoding="UTF-8"?>
<title>Trademark symbol: ™</title>
--
Karl Ove Hufthammer