Oracle FAQ | Your Portal to the Oracle Knowledge Grid |
Home -> Community -> Usenet -> c.d.o.server -> Re: Oracle 8.i --> Oracle 9i + Unicode
Tanel Poder wrote:
>> > Everything out of ASCII is at least double byte in UTF-8 - most of the >> > additional latin letters, cyrillics, and many others take up two bytes
>> > UTF-8. >> >> Totally correct. >> >> O-umlaut and a-umlaut are double-byte in UTF-8, not triple-byte.
For what it's worth, here's the Oracle 9i New Features course documentation:
"utf-8 encoding is the 8-bit encoding of Unicode. It is a variable-width encoding and also a strict superset of ASCII. ... One Unicode character can be one, two, three or four bytes in this encoding. Characters from the European scripts are represented in either one or two bytes; characters from most Asian scripts are represented in three bytes, while supplementary characters are represented in four bytes."
I'd question what vsize is actually measuring.
Regards
HJR
Received on Tue Sep 23 2003 - 06:29:05 CDT