Re: Spaced-out Unicode Cyrillic text
- From: John Chambers <jcsd@xxxxxxxxxxxxx>
- Date: Thu, 23 Mar 2006 20:23:01 -0500
Alice Faber wrote:
T: п≈п?п?я?п?я?п? п?я?я?я?п?п?
T: ь?ь?ы?ь?ь? ь?ь?ь?ь?ь?
When I open the file in Firefox or Opera, the Russian title looks fine, but
when I open it in Safari, the Russian title has the extra spaces. And so
on for other apps.
Any clues?
Well, there are extra characters in there. I looked at various encodings and can't get back to how it originally posted, with the extra-wide spaces. This is Cyrillic. Note the ? between characters. I suspect that in some encodings this is a dead-key diacritic (in Central European, there are cedilla-like things *under* the "real" characters) and in some it's a padding space (some of my phonetics fonts used to have this as a way to fine-tune character spacing).
Nope; this is Russian. There ain't no diacritics. Well, ok; the fourth letter
is a "yo", which has what looks like an umlaut, but that's all. The two-byte
pattern is because it's UTF-8 Unicode, and the first byte gives the code
block that the next byte is in (to over-simplify a bit). What's really
there is two-byte characters.
The funny thing is that most of the Mac apps obviously get the UTF-8 part
right, because they in fact display the proper Russian characters (which
obviously didn't make it through to the above message). They couldn't
do this unless they are properly recognizing and decoding the UTF-8
charset. But they draw those funny spaces.
You could be partly right. The rendering routines could be decoding the
UTF-8 Russian letters, but using the byte count to position the character
glyphs. If so, they've got a serious rendering bug.
I wonder if we could somehow test this conjecture ...
.
- References:
- Spaced-out Unicode Cyrillic text
- From: John Chambers
- Spaced-out Unicode Cyrillic text
- Prev by Date: Re: Safari doesn't open pdf anymore..
- Next by Date: Turbo Internet problems
- Previous by thread: Re: Spaced-out Unicode Cyrillic text
- Next by thread: Re: Spaced-out Unicode Cyrillic text
- Index(es):
Relevant Pages
|
Loading