Re: Any chance to surpass 4000 character limit?
From: Frank van Bortel <frank.van.bortel_at_gmail.com>
Date: Wed, 28 Jan 2009 10:35:21 +0100
Message-ID: <574a0$498026da$524b5c40$30126_at_cache5.tilbu1.nb.home.nl>
Vladimir M. Zakharychev wrote:
> On Jan 27, 11:14 am, stegem..._at_naviga.de wrote:
>
> Still, your database might eventually switch from ISO-8859-1 character
> encoding to, say, UTF-8 and there you have it: umlauts will occupy two
> bytes in UTF-8 instead of one in ISO-8859-1, and your maximum string
> length in characters will fall below 4000. Since maximum code point
> length in UTF-8 is 4 bytes, you can reliably fit at least 1000
> characters into 4000 bytes, but a string of any length beyond 1000
> characters is not guaranteed to fit. With UTF-32 every character
> occupies 4 bytes regardless, so maximum length of any string in
> characters is 1000.
>
> Regards,
> Vladimir M. Zakharychev
> N-Networks, makers of Dynamic PSP(tm)
> http://www.dynamicpsp.com
Date: Wed, 28 Jan 2009 10:35:21 +0100
Message-ID: <574a0$498026da$524b5c40$30126_at_cache5.tilbu1.nb.home.nl>
Vladimir M. Zakharychev wrote:
> On Jan 27, 11:14 am, stegem..._at_naviga.de wrote:
>> On 26 Jan., 18:48, joel garry <joel-ga..._at_home.com> wrote: >> >>> Amazing how many times I've heard this, and it has turned out to be >>> wrong. >> Hi Joel, >> >> our applications are created for German government and administration. >> The chance that German government and administration procedures ever >> might be adopted by any other administration in the world are zero, >> believe me. >> >> Cheers >> K-V
>
> Still, your database might eventually switch from ISO-8859-1 character
> encoding to, say, UTF-8 and there you have it: umlauts will occupy two
> bytes in UTF-8 instead of one in ISO-8859-1, and your maximum string
> length in characters will fall below 4000. Since maximum code point
> length in UTF-8 is 4 bytes, you can reliably fit at least 1000
> characters into 4000 bytes, but a string of any length beyond 1000
> characters is not guaranteed to fit. With UTF-32 every character
> occupies 4 bytes regardless, so maximum length of any string in
> characters is 1000.
>
> Regards,
> Vladimir M. Zakharychev
> N-Networks, makers of Dynamic PSP(tm)
> http://www.dynamicpsp.com
Ow...
since this is governmental, probably all clients will be running MicroSoft Windows of some version. Advice MSWIN1252 to be used on the database, and all client settings. No character conversions, no hassle. (although some German counties strongly advocate Linux)
Same story for NATO, btw, Joel, until countries with Cyrillic charactersets join (in which case, you are right, and no single byte set will do.)
-- Regards, Frank van BortelReceived on Wed Jan 28 2009 - 03:35:21 CST