Skip to Content.
Sympa Menu

en - RE: [sympa-users] Odd characters in archived messages

Subject: The mailing list for listmasters using Sympa

List archive

Chronological Thread  
  • From: "Adam Bernstein" <address@concealed>
  • To: Olivier Salaün - CRU <address@concealed>, "Mark Valiukas" <address@concealed>
  • Cc: <address@concealed>
  • Subject: RE: [sympa-users] Odd characters in archived messages
  • Date: Fri, 22 Apr 2005 15:54:25 -0700

I'm very happy to see someone else is seeing this problem -- we've
seen it manifest in several slightly different ways, and often the
subscribers see the altered characters in their received messages,
not just in the Web archives (and sometimes not at all in the Web
archives). FWIW, I believe it usually, or possibly always, happens
when the sender of the message is on a Mac. Is that true for you, Mark?
We have not been able to do much troubleshooting because of that fact.

> Actually there's nothing Mhonarc can do with mis-encoding problems.
> There's no way it can find out these caracters are supposed to be utf-8
> encoded. This issue has to be fixed at the Operating system level : it
> should properly handle charset mapping while doing copy/paste.

I accept that as a reasonable diagnosis, but this is still a problem
that seems to come up only when people are composing newsletters and
sending them to lists. So, as far as they're concerned it's a Sympa
problem, so we should do our best to fix it.

If Mark's diagnosis is correct, I can imagine a filter on all incoming
messages that would look at the character encoding header, then scan
for any 8-bit characters, and if it finds them, change the character
encoding header to UTF-8...? I'm speaking somewhat in ignorance here,
don't know much about character sets and encoding. Or it could look
for specific 8-bit characters (end-of-line, apostrophe, hyphen) and
replace them with appropriate substitutes.

adam




Archive powered by MHonArc 2.6.19+.

Top of Page