Bug #1588: The fn:normalize-unicode() function produces unexpected results for non-ascii characters - Saxon-CE - Saxonica Developer Community

Actions

Send by e-mail Copy link

Bug #1588

closed

The fn:normalize-unicode() function produces unexpected results for non-ascii characters

Added by Philip Fearon over 12 years ago. Updated almost 12 years ago.

Status:

Closed

Priority:

Normal

Assignee:

Philip Fearon

Sprint/Milestone:

Release 1.1

Start date:

2012-07-17

Due date:

% Done:

100%

Estimated time:

Platforms:

Description

The expression

string-to-codepoints(normalize-unicode('Eisbär', 'NFKD'))

is returning

69 105 115 98 4192 33536 114@

whereas the correct answer is

69 105 115 98 97 776 114

The following XSLT was used in the test:

<xsl:template name="main" match="/">
<xsl:result-document href="#main" method="append-content">
   <xsl:variable name="input" select="'Eisbär'"/>
   <xsl:variable name="normal" select="string-to-codepoints(normalize-unicode($input, 'NFKD'))"/>
   <p>String: <xsl:value-of select="$input"/></p>
   <p>To CodePoints result: <xsl:value-of select="$normal"/></p>
   <p>Expected: 69 105 115 98 97 776 114</p>
   <p>Round-trip codepoints-to-string: <xsl:value-of select="codepoints-to-string($normal)"/></p>
</xsl:result-document>
</xsl:template>