Home arrow static arrow Java Programming [Archive] - Conversion of non-unicode to unicode for mobile phones
Warning: Creating default object from empty value in /www/htdocs/w008deb8/wiki/components/com_staticxt/staticxt.php on line 51
Java Programming [Archive] - Conversion of non-unicode to unicode for mobile phones
This topic has 4 replies on 1 page.

Posts:3
Registered: 3/15/02
Conversion of non-unicode to unicode for mobile phones  
Aug 3, 2004 1:25 AM



 
I want to write an application to do conversion from maybe GB/ Big5 encoding to unicode? I had read on the tutorial provided by sun but my senior told me that it is not much of a help. He was telling me that in the industry I'm working in, the non-unicode character does not have an escape character before it (eg: CDF5 instead of \uCDF5).
I'm totally confused with what he is trying to tell me. One more problem that I am facing is that I read from some websites that for gb encoding, it consist of 1 byte char ( alphabetic) and 2 byte char (chinese characters). Is there anyway to determine when does it need to do the shift sequence to the 2 byte?
Can anyone provide me a guidance/hint on doing this? Please bear with me as I'm just a beginner to this.
 

Posts:5,965
Registered: 5/17/03
Re: Conversion of non-unicode to unicode for mobile phones  
Aug 3, 2004 1:43 AM (reply 1 of 4)



 
You'll have to study the source and the target codes carefully. Here's the unicode homepage,

http://www.unicode.org/

When your boss was talking about the U maybe it was just a prefix he used to mark a code as beeing Unicode. Unicodes doesn't start with a Ctrl-u. They're 16 bit numbers
 

Posts:5,965
Registered: 5/17/03
Re: Conversion of non-unicode to unicode for mobile phones  
Aug 3, 2004 1:57 AM (reply 2 of 4)



 
I get that \u stuff now.

The Java compiler recognizes the sequence \udddd (where d is a hexadecimal character 0-9, a-f) as a Unicode character.
 

Posts:10,967
Registered: 4/7/01
Re: Conversion of non-unicode to unicode for mobile phones  
Aug 3, 2004 1:46 PM (reply 3 of 4)



 
Maybe this helps [url http://java.sun.com/docs/books/tutorial/i18n/text/convertintro.html]Converting Non-Unicode Text [/url]
 

Posts:3,258
Registered: 00-08-28
Re: Conversion of non-unicode to unicode for mobile phones  
Aug 3, 2004 1:55 PM (reply 4 of 4)



 
I want to write an application to do conversion from
maybe GB/ Big5 encoding to unicode? I had read on the

first of all, All the Strings in java are always unicode utf-16 and nothing else. So when you say conversion from GB to Big5 or unicode what do you actually mean to say. And what in unicode utf-8 or utf-16


tutorial provided by sun but my senior told me that it
is not much of a help. He was telling me that in the
industry I'm working in, the non-unicode character
does not have an escape character before it (eg: CDF5
instead of \uCDF5).

The escape character would be \u or in xml it would be &#

I'm totally confused with what he is trying to tell
me. One more problem that I am facing is that I read
from some websites that for gb encoding, it consist of
1 byte char ( alphabetic) and 2 byte char (chinese
characters). Is there anyway to determine when does it
need to do the shift sequence to the 2 byte?

All the charactersets are single byte except for the CJK languages where they are always multibyte. In unicode utf-8 are one byte and utf-16 is double byte.

Can anyone provide me a guidance/hint on doing this?
Please bear with me as I'm just a beginner to this.

If you could clarify a little more as to what are you try to accomplish I am sure somebody would be glad to do so.
 
This topic has 4 replies on 1 page.