Re: Oracle 8.i --> Oracle 9i + Unicode

From: jallan <jallan_at_smrtytrek.com>
Date: 23 Sep 2003 08:23:17 -0700
Message-ID: <299f1138.0309230723.706f46f6@posting.google.com>

"Tanel Poder" <change_to_my_first_name_at_integrid.info> wrote in message news:<3f700f9e$1_1_at_news.estpak.ee>...
> > > Everything out of ASCII is at least double byte in UTF-8 - most of the
> > > additional latin letters, cyrillics, and many others take up two bytes
> in
> > > UTF-8.
> >
> > Totally correct.
> >
> > O-umlaut and a-umlaut are double-byte in UTF-8, not triple-byte.
>
> Well, despite of any standards out there, I used vsize command in Oracle to
> show, how many bytes a char really takes, and it took 3 bytes in above
> mentioned examples.
>
> Tanel.

Perhaps you are using UTF-8 and decomposed Unicode. Then an a-umluat would be a single-byte ASCII _a_ followed by a two-byte combining dieresis.

I suggest you look at a sample in hex and see what is happening.

Jim Allan Received on Tue Sep 23 2003 - 10:23:17 CDT