switching charset - mid-document



Hi!

I have document which I am trying to parse using JavaCC. It is in known
charset, say UTF-8. Howver somewhere inside the document I might have a
fragment in some other charset, which is specified inline. For example:

UTF8{ bla-bla-bla }
UTF-16{ bla16-bla16-bla16}

Does anybody know if I can somehow instruct JavaCC to change input
charset at some point? Thanks!

Vadim

.



Relevant Pages

  • Re: different encoding handling between old ASP and ASP.Net
    ... globalization support and configuration between ASP and ASP.NET. ... charset to utf-8. ... decode as utf-8 encoding. ... In ASP.NET, we don't need to set these, since ASP.NET bydefault use utf-8 ...
    (microsoft.public.dotnet.framework.aspnet)
  • Re: Changing the default charset for composing messages
    ... > correct default for the localized version of Entourage you're using. ... > UTF-8 if your message contains characters from more than one character set. ... > will just choose the correct charset on the basis of the characters you've ...
    (microsoft.public.mac.office.entourage)
  • Re: How to convert the charset of texts in a Execl which has multi-language text and charset?
    ... How to convert the charset of texts in a Execl which has multi-language text and charset? ... So I use Iconv to convert the them into UTF-8. ... I managed to convert your Korean text from UTF-8 to EUC-KR, write it to a file and display it correctly in Firefox, once ...
    (comp.lang.ruby)
  • Re: Input Character Set Handling
    ... that compares a UTF-8 string to a string that a user has inputted into ... rather often if they have any clue at all about Unicode). ... Unicode is a *charset*: a set of characters where each character unit ...
    (comp.lang.javascript)
  • Re: Input Character Set Handling
    ... UTF-8" checked: ... The first URL leads to illegal HTTP transmission (no charset ... correction mechanics in browser. ...
    (comp.lang.javascript)