Project

Profile

Help

Numbering errors using "Kanji Digits"

Added by Anonymous almost 16 years ago

Legacy ID: #5197982 Legacy Poster: Scott A. Colcord (sacolcor)

<http://www.saxonica.com/documentation/conformance/xslt20.html>, item #13 (xsl:number support), says that Saxon provides support for Kanji digits through codepoint (x4e00). The digits themselves are correct. However, Saxon (as of v9.0) appears to use them as though they were ordinary decimal digits, producing the following sequence: 1: 一 2: 二 3: 三 4: 四 5: 五 6: 六 7: 七 8: 八 9: 九 10: 一〇 11: 一一 12: 一二 13: 一三 14: 一四 15: 一五 16: 一六 17: 一七 18: 一八 19: 一九 20: 二〇 21: 二一 22: 二二 23: 二三 But in CJK languages, number counting is done using a marking and grouping algorithm, and should, I believe, generate the following sequence: 1: 一 2: 二 3: 三 4: 四 5: 五 6: 六 7: 七 8: 八 9: 九 10: 十 11: 十一 (10 + 1) 12: 十二 (10 + 2) 13: 十三 14: 十四 15: 十五 16: 十六 17: 十七 18: 十八 19: 十九 20: 二十 (210) 21: 二十一 (210 + 1) 22: 二十二 (210 + 2) 23: 二十三 (210 + 3) A description of this algorithm is at <http://www.w3.org/TR/css3-lists/#cjk-ideographic>. Note, however, that while the algorithm there appears to be correct, a message at <http://lists.w3.org/Archives/Public/www-style/2003Apr/0063.html> indicates that several of the CJK character sets on that page are not. Would it be possible to use the CJK algorithm when numbering using codepoint U+4E00? Even just handling numbers 1-99 correctly would cover the majority of cases. Thanks, Scott Colcord (PTC/Arbortext)


Replies (6)

Please register to reply

RE: Numbering errors using &quot;Kanji Digits&quot; - Added by Anonymous almost 16 years ago

Legacy ID: #5198145 Legacy Poster: Michael Kay (mhkay)

Thanks for this information. I'll look into it. Michael Kay

RE: Numbering errors using &quot;Kanji Digits&quot; - Added by Anonymous almost 14 years ago

Legacy ID: #8691714 Legacy Poster: Scott A. Colcord (sacolcor)

Is there any further status on this? Should a tracking item be filed for it?

RE: Numbering errors using &quot;Kanji Digits&quot; - Added by Anonymous almost 14 years ago

Legacy ID: #8691907 Legacy Poster: Michael Kay (mhkay)

Thanks for the reminder. I appear to have dropped the ball on this one. It's back on the TODO list.

RE: Numbering errors using &quot;Kanji Digits&quot; - Added by Anonymous almost 14 years ago

Legacy ID: #8692425 Legacy Poster: Michael Kay (mhkay)

I've fixed this and I'm about to publish the fix as a patch in Subversion.

RE: Numbering errors using &quot;Kanji Digits&quot; - Added by Anonymous almost 14 years ago

Legacy ID: #8695165 Legacy Poster: Scott A. Colcord (sacolcor)

Thanks much; do you know when the next maintenance release of Saxon will be?

RE: Numbering errors using &quot;Kanji Digits&quot; - Added by Anonymous almost 14 years ago

Legacy ID: #8695229 Legacy Poster: Michael Kay (mhkay)

next maintenance release The 9.2 branch is very stable now so unless any surprises crop up, I don't think another maintenance release will be needed for a couple of months. Probably 9.3 will come out first, we're working hard on finishing it.

    (1-6/6)

    Please register to reply