Oracle FAQ | Your Portal to the Oracle Knowledge Grid |
Home -> Community -> Usenet -> c.d.o.server -> Re: Oracle 8.i --> Oracle 9i + Unicode
> It depends. UTF8 (now called something like AL32UTF8) is a variable-width
> encoding, where letters like "t", "e" and "a" are single byte characters.
> But letters like "?" "Ö" and "?" are double bytes. And then Chinese,
> Korean, Arabic and so on 'characters' can be triple or even quadruple
Just for correctness, letter Ö is 3 bytes in UTF-8 for some reason. So is
non-capital "ä". Both letters are used in Estonian, Finnish, Swedish +
several other Northern-Europe languages. Strange that these letters have
been ranked that low...
(note that capital "Ä" is only 2 bytes, while non-capital "ä" is 3)
SQL> select value from nls_database_parameters where parameter = 'NLS_CHARACTERSET'; VALUE
A VSIZE(A)
---------- ----------
ä 3 Ä 2 ö 3 Ö 3
Tanel. Received on Fri Sep 12 2003 - 07:38:42 CDT