[babel-devel] Unicode issues, esp security

Luis Oliveira luismbo at gmail.com
Mon Apr 13 22:02:52 UTC 2009


Stelian Ionescu <stelian.ionescu-zeus at poste.it> writes:

> On Mon, 2009-04-13 at 22:24 +0200, james anderson wrote:
>> [ironic in this discussion, is that utf-8b is non-conformant - by  
>> definition.]
>
> I don't think so. See http://www.unicode.org/versions/Unicode5.1.0/
> paragraph E: "in processing the UTF-8 code unit sequence <F0 80 80 41>,
> the only requirement on a converter is that the <41> be processed and
> correctly interpreted as <U+0041>."

I think James' point is that UTF-8B is not specified by any standard so
it has nothing to conform to.

You are right, though, that the UTF-8B decoding process is
compatible/conformant with UTF-8. Not so for the encoding process: a
UTF-8B encoder might generate invalid UTF-8.

-- 
Luís Oliveira
http://student.dei.uc.pt/~lmoliv/




More information about the babel-devel mailing list