[HanoiLUG] vietnamese language and Unicode

le nhu xuan lnhxuan at yahoo.com
Fri Jul 6 03:48:57 ICT 2007


we need to thingking about 32 bits

Phan Thái Trung <phanthaitrung at gmail.com> wrote:  2007/7/5, Jean Christophe André <jean-christophe.andre at auf.org>:    
> which only supports ANSI characters. In those environments, apps need
> encode non-ANSI characters like TCVN3 into encoding character sets
> such as ISO-8859-1 or UTF-x.
I guess you are talking about ASCII = ISO-646, the 7 bits American 
Standard Code for Information Interchange (cf "man ascii") and not ANSI,
the American National Standards Institute (american equivalent for TCVN).

If you are really talking about ASCII-only compatible application, then 
the only reasonnable choice is to use UTF-7 which allow to store the
full Unicode charset in a 7-bits space.

I'm not sure to understand the problem here... I'm a developper myself
and, even if I never developped under recent Windows versions, I can 
tell that they *do* have a very good Unicode support from Windows 2000.
Windows Unicode support even started with Windows 98 but was very
limited at this time. The best proof of this is OpenOffice.org that does
a very good use of the Unicode support.

In my opinion, the biggest problem is about Unicode understanding. There
is not only one encoding for Unicode and, as always, Microsoft made some
incompatibility strategy choice in using UCS-2 instead of the most 
frequently used UTF-8 in the Internet world. So that Windows Unicode
encoded text are totaly different from Unix one and here begins the
needs for recoding and so starts the difficulties for young
unexperimented students. 



You'r right that Win32 supports Unicode string from WinNT (Win2k). But I told about "common Win developing IDE", mean Integrated Dev Environment such as common  text editor, compilers.... they usually do not allow Win programmer work well with Unicode. If you meet traditional Win programmer, you will find that their products have difficulty to support Unicode unless they use .NET compilers or other 3rd party tools. The best way for now still use "TCVN3-like" for apps even in a fully supported unicode OS like WinXP. 

Win API funtions that support Unicode are ending by W (wide) and functions for non-unicode string are ending by A (ANSI, not ASCII). E.g GetWindowTextW() vs. GetWindowTextA(). That's why I called "ANSI" like MS way. 




_______________________________________________
HanoiLUG mailing list
HanoiLUG at lists.hanoilug.org
http://lists.hanoilug.org/mailman/listinfo/hanoilug


       
---------------------------------
Get the free Yahoo! toolbar and rest assured with the added security of spyware protection. 
-------------- section suivante --------------
Une pi?ce jointe HTML a ?t? nettoy?e...
URL: http://lists.hanoilug.org/pipermail/hanoilug/attachments/20070705/764da535/attachment-0001.htm 


More information about the HanoiLUG mailing list