Re: Oracle 8.i --> Oracle 9i + Unicode

From: Tanel Poder <change_to_my_first_name_at_integrid.info>
Date: Fri, 12 Sep 2003 15:38:42 +0300
Message-ID: <3f61be54$1_1@news.estpak.ee>

> It depends. UTF8 (now called something like AL32UTF8) is a variable-width
> encoding, where letters like "t", "e" and "a" are single byte characters.
> But letters like "?" "Ö" and "?" are double bytes. And then Chinese,
> Korean, Arabic and so on 'characters' can be triple or even quadruple

Just for correctness, letter Ö is 3 bytes in UTF-8 for some reason. So is non-capital "ä". Both letters are used in Estonian, Finnish, Swedish + several other Northern-Europe languages. Strange that these letters have been ranked that low...
(note that capital "Ä" is only 2 bytes, while non-capital "ä" is 3)

SQL> select value from nls_database_parameters where parameter = 'NLS_CHARACTERSET'; VALUE

UTF8 SQL> desc t;
Name Null? Type

-------- ------------ A VARCHAR2(10)

SQL> select a, vsize(a) from t;

A VSIZE(A)
---------- ----------

ä                   3
Ä                   2
ö                   3
Ö                   3

Tanel. Received on Fri Sep 12 2003 - 07:38:42 CDT