[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [syndication] XML Character encoding (again)

To: syndication@yahoogroups.com
Subject: Re: [syndication] XML Character encoding (again)
From: Julian Bond <julian_bond@voidstar.com>
Date: Wed, 16 Apr 2003 18:16:16 +0100
In-reply-to: <004501c30422$8f9ebd10$2000a8c0@wkearney.com>
References: <N0zMtdBSIIn+EALt@jblaptop.voidstar.com> <69188434.20030416143702@internetalchemy.org> <004501c30422$8f9ebd10$2000a8c0@wkearney.com>
User-agent: Turnpike/6.02-U (<NihPaT1eq7QPEjqHWIMIZRyCL3>)

Bill Kearney <ml_yahoo@ideaspace.net> wrote:

Ian's right on all counts.  The only way to do this realiably is to rigorously
examine the content being input and preview it back to the users.   Convert it
all to UTF8.


Umm. Err.

I'm getting seriously pissed off with this. The site where this ishappening typically contains UK English text so we're talking about alimited number of awkward characters. £, €, and smart quotes andthat's about it.

I'm really tempted to just say "tough". If you don't like the characterput in a "?". If you're parser barfs on my feed, well don't read it.Programming hours are too short to start figuring out client browsercapability, UTF-8 conversion from arbitrary encodings and so on.

The point here is that it's RSS containing plain text read by humanbeenz. I'm not trying to get 100% perfect transfer of data, I'm tryingto facilitate human communication.


Getting back to trying to solve this.

I'm genuinely puzzled that a CDATA block isn't enough to protect thetext byte stream from aggressive parsers.

And I wonder if I'm confusing everyone by suggesting UTF-8. Perhaps if Iused another encoding, the feed would be more likely to survive giventhat the vast majority of users are generating this text with Wintel PCsrunning IE.


--
Julian Bond Email&MSM: julian.bond@voidstar.com
Webmaster:              http://www.ecademy.com/
Personal WebLog:       http://www.voidstar.com/
CV/Resume:          http://www.voidstar.com/cv/
M: +44 (0)77 5907 2173   T: +44 (0)192 0412 433

Follow-Ups:
- Re: [syndication] XML Character encoding (again)
  - From: "Bill Kearney" <ml_yahoo@ideaspace.net>
- Re: [syndication] XML Character encoding (again)
  - From: Klaus Johannes Rusch <KlausRusch@atmedia.net>
- Re: [syndication] XML Character encoding (again)
  - From: Ian Davis <iand@internetalchemy.org>

References:
- XML Character encoding (again)
  - From: Julian Bond <julian_bond@voidstar.com>
- Re: [syndication] XML Character encoding (again)
  - From: Ian Davis <iand@internetalchemy.org>
- Re: [syndication] XML Character encoding (again)
  - From: "Bill Kearney" <ml_yahoo@ideaspace.net>

Prev by Date: Re: [syndication] XML Character encoding (again)
Next by Date: Re: [syndication] XML Character encoding (again)
Previous by thread: Re: [syndication] XML Character encoding (again)
Next by thread: Re: [syndication] XML Character encoding (again)
Index(es):
- Date
- Thread