Oracle FAQ | Your Portal to the Oracle Knowledge Grid |
Home -> Community -> Usenet -> c.d.o.server -> Re: Oracle 8.i --> Oracle 9i + Unicode
"Tanel Poder" <change_to_my_first_name_at_integrid.info> wrote in message news:<3f700f9e$1_1_at_news.estpak.ee>...
> > > Everything out of ASCII is at least double byte in UTF-8 - most of the
> > > additional latin letters, cyrillics, and many others take up two bytes
> in
> > > UTF-8.
> >
> > Totally correct.
> >
> > O-umlaut and a-umlaut are double-byte in UTF-8, not triple-byte.
>
> Well, despite of any standards out there, I used vsize command in Oracle to
> show, how many bytes a char really takes, and it took 3 bytes in above
> mentioned examples.
>
> Tanel.
Perhaps you are using UTF-8 and decomposed Unicode. Then an a-umluat would be a single-byte ASCII _a_ followed by a two-byte combining dieresis.
I suggest you look at a sample in hex and see what is happening.
Jim Allan Received on Tue Sep 23 2003 - 10:23:17 CDT