[sc34wg3] Unicode normalization

Patrick Durusau sc34wg3@isotopicmaps.org
Thu, 14 Jul 2005 10:45:41 -0400


Lars,

The first note under 4.2 The fundamental types reads:

"This document does not require Unicode normalization to be applied to 
strings order to detect that syntactically different but logically 
equivalent strings are in fact equivalent. The application of such logic 
is encouraged, however. As it cannot be guaranteed that normalization 
will be performed reliance on normalization is strongly discouraged."

(The same issue occurs in the note in 4.3, Locators.)

I read the paragraph as saying:

"This document does not require Unicode normalization to be applied to 
strings order to detect that syntactically different but logically 
equivalent strings are in fact equivalent."

Unicode normalization is not required.

"The application of such logic is encouraged, however."

But use of logic to determine "equivalent strings" is encouraged.

"As it cannot be guaranteed that normalization will be performed 
reliance on normalization is strongly discouraged."

But reliance on normalization is discouraged.

So, what is being encouraged is use of logic to determine equivalent 
strings that does not rely on normalization?

OK, I'll buy that but the mixing of encourage and discourage seems 
confusing.

Suggest:

"Application of logic to determine the equivalency of strings is 
encouraged. However, since Unicode normalization of strings is not 
required and therefore cannot be guaranteed, reliance on normalization 
in the application of such logic is strongly discouraged."

Hope you are having a great day!

Patrick

-- 
Patrick Durusau
Patrick@Durusau.net
Chair, V1 - Text Processing: Office and Publishing Systems Interface
Co-Editor, ISO 13250, Topic Maps -- Reference Model
Member, Text Encoding Initiative Board of Directors, 2003-2005

Topic Maps: Human, not artificial, intelligence at work!