1 Apr 18:08
Error in IDN eamples for testing
Mark Davis <mark.davis <at> jtcsv.com>
2003-04-01 16:08:50 GMT
2003-04-01 16:08:50 GMT
One of our engineers took a look at the IDN samples for testing, and found a
problem in the data. Apparently some of the UTF-8 is illegal (using 2
three-byte sequences for a supplementary character).
Mark
========
Hi Mark,
The following are the offending members of structure in Appendix A of
draft-josefsson-idn-test-vectors. The data claiming to be UTF-8 is actually
CESU-8.
struct stringprep
{
char *comment;
char *in;
char *out;
char *profile;
int flags;
int rc;
}
strprep[] =
{
..........
{
"Surrogate code U+DF42",
"\xED\xBD\x82", NULL, "Nameprep", 0,
STRINGPREP_CONTAINS_PROHIBITED
},
(Continue reading)
RSS Feed