Re: case-sensitivity



Ray Dillinger <bear@xxxxxxxxx> writes:
I don't think I'm ready to conduct the straw poll myself, but
here's my (somewhat paradoxical) answer to it. For my
*personal* comfort, I'd prefer case-insensitivity for the
characters A-Z(and a-z) ONLY. But I'd never recommend that
as a standard, because it's culturecentric against those
people whose "first and most familiar" alphabet isn't the
roman alphabet.

The English & Latin alphabets.

I don't know any other roman language with no accent.

boîte -> BOîTE, or BOÎTE -> boÎte are unfortunate.

So now you may want to extend it to ISO-8859-1, but then you'll hit ß.

For the standard, I think I'd recommend case-sensitivity,
just because I don't want to have to figure out whether
identifiers in a character set I'm unfamiliar with are
"the same" identifier under case mapping rules I don't
know.

Indeed it's the best.


Otherwise (if case-insensitivity is preserved) then if I
ever work with code that has non-ascii identifiers, I'm
going to have to write a "code sanitizer" that smashes
case deliberately in order to make all identifiers that
are *logically* the same *look* the same.

Some Pascal Pretty Printers did that.

--
A: Because it messes up the order in which people normally read text.
Q: Why is top-posting such a bad thing?
A: Top-posting.
Q: What is the most annoying thing on usenet and in e-mail?
__Pascal Bourguignon__ http://www.informatimago.com/
.



Relevant Pages

  • Re: Attention: European C/C++/C#/Java Programmers-Call for Input
    ... For any language using a Latin ... Look at existing tools and source code that supports UTF-8, and see how it can make your work easier and give a result that users might actually be able to *use*. ... But you'll find something that does a reasonable job and *will* work perfectly for most programmers who stick to ASCII identifiers. ... A related problem is if you are making identifiers case-insensitive - it's hard to figure out cases for non-ASCII characters. ...
    (comp.arch.embedded)
  • Re: Enhanced Unicode support for "Go" tools
    ... Kanji or Arabic identifiers? ... find this next to impossible to read...many characters look the ... Latin characters in European languages could be included... ... I'd be 100% supportive and some Japanese source code ...
    (alt.lang.asm)
  • Re: Q: password generation
    ... because the larger the alphabet gets, the less is the effect of adding ... more characters to it. ... four lowercase letters each. ... Given some hundreds of passwords made according to them, ...
    (sci.crypt)
  • Re: Q: password generation
    ... because the larger the alphabet gets, the less is the effect of adding ... more characters to it. ... four lowercase letters each. ... Given some hundreds of passwords made according to them, ...
    (sci.crypt)
  • Re: PEP 3131: Supporting Non-ASCII Identifiers
    ... Lengthy texts are either already available digitally or are entered by someone skilled in the language. ... Since - AFAIK - you have to type some characters before they can be of any help, I don't think they can help much here. ... I also did have to copy/paste identifiers to program, and found it extremely difficult to handle. ... I have never learned Japanese but have had to deal with Japanese text at a couple of jobs and it isn't that big of a problem. ...
    (comp.lang.python)

Loading