Re: Quote marks and Question marks



In article <1151497829.214312.131430@xxxxxxxxxxxxxxxxxxxxxxxxxxxx>,
"Conrad" <cweiler1@xxxxxxx> wrote:

When I send e-mail file messages written in AppleWorks or Word -- they
leave looking great. However, any quotation marks show up at the
receiving end as question marks. If I include myself in the mailing --
the messages comes back to me with question marks instead of quotation
marks. I'm using an iBook G4 Mac with mac.com address.

Yes, that is correct.

To understand why, it is first necessary to understand a little bit
about what text is. As far as a computer is concerned, there is no such
thing as text (or sounds or picutres or anything else). Everything a
computer works with, without exception, is simply numbers. We decide
what those numbers mean--whether they describe values of color in a
picture, or characters of text in a word processing program.

A long time ago, a large group of people got together and agreed on one
particular, completely arbitrary, scheme for representing text in a
computer, which they named "ASCII." The number 32, they said, would draw
on a screen as a space. The number 110 would be a lowercase letter "n".
The number 55 would be a "7". The number 46 would be a period, the
number 85 would be an uppercase letter U, and so on.

Now, when i say a long time ago, I mean a LONG time ago. As in, before
the world had settled on the notion that there are eight bits in a byte.
Many early computing devices used 7 bits, not 8 bits, to represent a
single character, so ASCII only uses 7 bits. That means that this scheme
can only have 127 characters in it.

127 characters is enough for all the upper and lowercase letters in the
English alphabet, numbers, standard punctuation, and a handful of
special control codes that early computing devices and printers needed
to tell them when to stop and start. It is not enough for accented or
foreign-language characters and it is not enough for special punctuation
like typographer's (curly) quotes.

The email system, even today, can send only 7-bit ASCII text. Nothing
else. A standard email server uses 7-bit ASCII, period. This is even
true of file attachments in email; When you attach a file like a JPEG or
a PDF to an email, it gets specially "encoded" into a system that
represents it using 7-bit ASCII.

If you type an email message that contains characters which are not part
of standard ASCII, such as curly quotes or accented characters, those
characters are lost, because email works only with ASCII.

--
Art, photography, shareware, polyamory, literature, kink:
all at http://www.xeromag.com/franklin.html
Nanohazard, Geek shirts, and more: http://www.villaintees.com
.



Relevant Pages

  • Re: convert unicode characters to visibly similar ascii characters
    ... I want to convert unicode character into ascii one. ... But there are still lots of characters beyond this range, which can be manually converted to some visibly similar ascii characters. ... For instance, there are several quotation marks in unicode, which can be converted into ascii quotation mark. ...
    (comp.lang.python)
  • Re: ASCII convention
    ... Angle brackets are typically larger and have an obtuse angle, ... > Angle brackets do not belong to ASCII, of course, and they are somewhat ... It also debatable which characters you should for angle ... > quotation marks (according to the rules of the language you are writing ...
    (sci.lang)
  • Re: File-Compare "fc" falsely reports mismatch between identical files
    ... first and last lines of each set of differences, whereas /L is said to compare files as ascii text. ... Show me a couple of "text files" that fc/a does not compare properly, and I would argue that they are so extreme in some way that I would not consider them "text files". ... One of the definitions found by google is this: "A file that contains characters organized into one or more lines. ... the tax department reacted to a customer's complaint and insisted that the faulty tax calculation be fixed. ...
    (microsoft.public.win2000.cmdprompt.admin)
  • Re: POS. Cash Register on AS400.- New and Updates
    ... Probably the easiest way would be to send them as ASCII. ... You need to change the printer file to not convert unprintable characters. ... "The INITPRT tag defines the ASCII control ... but still can not open cash drawer. ...
    (comp.sys.ibm.as400.misc)
  • Re: Unicode Support
    ... consider:)...but, you know, a file is still just a "stream of characters" ... "escape sequence" but accessing an ordinary ASCII character) are considered ... English, while all your identifiers are in "Romanji" Japanese or something ... NASM appears already to do so with strings and comments in ...
    (alt.lang.asm)