Re: Foreign language characters in forms



Erwin Moller wrote:
It is totally fine to use random charencodingsetX and store them in a database with charencodingY.
Works just fine, not very handy if you care for reading them back someday, but it works.

Why go through that trouble?

Charset conversions are pretty common in web applications.
For web publishing lots of people chooses utf-8 when they must deal with content in different languages, and it's not a bad choice as almost all browsers now offers good implementations of this encoding (while utf-8 is not always a goof choice for certain languages in particular, for exemple cjk).
But lots of time you also have to send emails, generate export files (csv, ...), etc. And for that types of use you can't always use Unicode and one of its encoding scheme...
.



Relevant Pages

  • Re: page encoding question
    ... >> I would like to know is what encoding would be most effective for these ... > content-type header indicating the encoding), the Western European ... > languages you are using should work all right in the standard Western ... iso-8859-1 at the moment but I am going to change it into UTF-8 to add the ...
    (alt.html)
  • Re: page encoding question
    ... They are all covered by the ISO-8859-1 encoding, ... whereas UTF-8 uses two bytes for some of the characters you'd use). ... characters in other languages. ...
    (alt.html)
  • Re: =?utf-8?Q?=E3=82=88=E3=81=95=E3=81=BB=E3=81=84=E7=AF=80?=
    ... in base64 or other 7bit encoding is "recommended" on the ... Nobody recommends base64 for text newsgroups, ... UTF-8 is one of the most versatile encoding schemes for people that want to ... Then why people who want to send the mixed languages text don't use ...
    (sci.lang.japan)
  • Re: OT: character encodings (was: Linux 2.6.20-rc4)
    ... which assume an encoding you can't write most world languages in (BTW ... Show me one program that spurns Unicode I'll show you one that "passed on" ... The only reason you have the UTF-8 big stick approach nowadays is people ...
    (Linux-Kernel)
  • Re: automating the SQL warning and the choice of text format
    ... automatically select 'yes' and 'utf-8' rather than changing the registry, ... In order to get the correct encoding, I believe that you have to do the ... You need one of those for each data source. ... For a comma-delimited file using UTF-8 encoding, ...
    (microsoft.public.word.mailmerge.fields)

Loading