Re: Yunnan 6FTM Geography map



You filled in the gaps heretofore. I hope NeoCathay will take pity on
me again and do the same when I ask the Chinese character questions.
Sasha has helped in the past. For Lew, you go to Unicode.Org and
download the 26 meg Unihan.txt file and extract the corresponding two
byte national language sets with Unicodes you want. Just beware the
GB2312 codes are in raw decimal row-column format so you add
160(32+168) to each row and column and convert to hex for the text
values. The BIG5 codes are used as is. Basically you take the two
bytes for a Chinese character from a language set, look it up in a
table you extracted from Unihan.txt, and use the corresponding Unicode
value to look up the character using the Unicode website which will
return glyph and Unicode browser representation which you can copy and
paste for Google searches. I'll let you know if there are any special
BUT-IFS for Japanese and Korean when I get to it.

Jim

samarkand wrote:
> No Jim,
>
> NeoCathay should be credited for the effort...I merely fill in the gaps.
>
> Danny
>
>
> "Space Cowboy" <netstuff@xxxxxxxxxxxxx> wrote in message
> news:1133295239.864999.63120@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
> > Thanks for trying, Danny. Everytime I think I understand the Chinese
> > character sets I don't. My backdoor from GB2312 to UNICODE seems to
> > work in more than one case. Now if I can find a BIG5 equivalent. I
> > think it odd that the Chinese gov't bets that a 4k simplified character
> > set can handle popular media versus the 25k traditional for academics
> > and literati.
> >
> > Jim
> >
> > samarkand wrote:
> >> Sorry, it ain't working...
> >>
> >> Danny
> >>
> >> "samarkand" <samarkand@xxxxxxx> wrote in message
> >> news:438c8e57$1@xxxxxxxxxxxxxxxxxxxxxx
> >> > No. 2 is Mang Zhi, not Jie Zhi. (??, not ??).
> >> > No. 4 is Man Sa (??), not Yi Wu (??).
> >> > No. 6 is Yi Bang (??), phonetically correct, but the spelling for Bang
> >> > (?)
> >> > is incorrect.
> >> >
> >> > "samarkand" <samarkand@xxxxxxx> wrote in message
> >> > news:438c8d96$1@xxxxxxxxxxxxxxxxxxxxxx
> >> >> Let me try and see if I can work this tricky thing out:
> >> >>
> >> >> No. 2 is Mang Zhi, not Jie Zhi. (??, not ??).
> >> >> No. 4 is Man Sa (??), not Yi Wu (??).
> >> >> No. 6 is Yi Bang (??), phonetically correct, but the spelling for Bang
> >> >> (?)
> >> >> is incorrect.
> >> >>
> >> >> "Space Cowboy" <netstuff@xxxxxxxxxxxxx> wrote in message
> >> >> news:1133282343.959429.196330@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
> >> >>> Looking back it's a minor point but your Chinese character
> >> >>> corrections
> >> >>> got lost in non Chinese charset. If you can give me the Pinyin Vowel
> >> >>> Numbers and brief English I can pick it up from there. I can also
> >> >>> handle anything in GB2312 or UNICODE.
> >> >>>
> >> >>> Thanks,
> >> >>> Jim
> >> >>>
> >> >>> No. 2 is Man Zhi, not Jie Zhi. (??).
> >> >>> No. 4 is Man Sa (??), not Yi Wu (??).
> >> >>> No. 6 is Yi Bang (??), phonetically correct, but the spelling for
> >> >>> Bang
> >> >>> is incorrect
> >> >>>
> >> >>> samarkand wrote:
> >> >>>> Most of what NeoCathay translated for you are correct, the ones that
> >> >>>> are
> >> >>>> incorrect I have inserted to his thread.
> >> >>>>
> >> >>>> :")
> >> >>>>
> >> >>>> Danny

.



Relevant Pages

  • Re: Yunnan 6FTM Geography map
    ... the name of the mountain which in this case is four GB2312 Chinese ... representing each chinese character for illustrative purposes. ... UNICODE lookup using the mapping table at: ...
    (rec.food.drink.tea)
  • Re: VB - Ascii to Unicode and then Unicode to UTF-8 conversion (Very desperate!!)
    ... Latin together) then you have to use a Unicode column type. ... AscW returns the real Unicode character ... for Chinese characters, ... then the next thing to worry about is your CSV file. ...
    (microsoft.public.vb.general.discussion)
  • Re: Unicode Support
    ... if two Unicode strings are the same? ... UTF-16 is basically telling everyone "ok we all got to start ... character, and will likely support *both* endians. ... UTF-8 encodings are also easy to learn to ...
    (alt.lang.asm)
  • Re: KANJD212
    ... >>Who decides the factors and what are their criteria, Unicode? ... But once a character is defined/get a codepoint in Unicode it ... standard modifies the codepoint of the kanji to a totally new ... I can use a code like JIS X0208 along with a font ...
    (sci.lang.japan)
  • Re: Determining if a string is Unicode
    ... there's nothing magic about Unicode. ... where each character occupies 2 bytes, as opposed to a Single-Byte Character ... You could load up a string with rubbish, ... > INF file like so: ...
    (microsoft.public.vb.general.discussion)