ArrayIndexOutOfBoundsException at net.sf.saxon.serialize.HTMLIndenter.comment
The attached files lead to this exception:
C:\test>java -jar saxon9ee.jar -s:test.xml -t -xsl:test.xsl Saxon-EE 126.96.36.199J from Saxonica ... java.lang.ArrayIndexOutOfBoundsException: 20 at net.sf.saxon.serialize.HTMLIndenter.comment(HTMLIndenter.java:242) at net.sf.saxon.event.ProxyReceiver.comment(ProxyReceiver.java:210) at net.sf.saxon.event.ProxyReceiver.comment(ProxyReceiver.java:210) at net.sf.saxon.event.ProxyReceiver.comment(ProxyReceiver.java:210) at net.sf.saxon.event.SequenceNormalizer.comment(SequenceNormalizer.java:214) at net.sf.saxon.event.ProxyReceiver.comment(ProxyReceiver.java:210) at net.sf.saxon.event.ComplexContentOutputter.comment(ComplexContentOutputter.java:576) at net.sf.saxon.tree.tiny.TinyElementImpl.copy(TinyElementImpl.java:474) at net.sf.saxon.tree.tiny.TinyDocumentImpl.lambda$copy$0(TinyDocumentImpl.java:297) at net.sf.saxon.om.SequenceIterator.forEachOrFail(SequenceIterator.java:128) at net.sf.saxon.tree.tiny.TinyDocumentImpl.copy(TinyDocumentImpl.java:297) at net.sf.saxon.trans.rules.DeepCopyRuleSet.process(DeepCopyRuleSet.java:63) at net.sf.saxon.trans.Mode.applyTemplates(Mode.java:495) at net.sf.saxon.trans.XsltController.applyTemplates(XsltController.java:746) at net.sf.saxon.s9api.AbstractXsltTransformer.applyTemplatesToSource(AbstractXsltTransformer.java:347) at net.sf.saxon.s9api.Xslt30Transformer.applyTemplates(Xslt30Transformer.java:311) at net.sf.saxon.Transform.processFile(Transform.java:1283) at net.sf.saxon.Transform.doTransform(Transform.java:815) at net.sf.saxon.Transform.main(Transform.java:80) Fatal error during transformation: java.lang.ArrayIndexOutOfBoundsException: 20
#1 Updated by Michael Kay almost 2 years ago
Thank you, problem reproduced. The problem occurs when doing serialization with method="html", indent="yes", and a comment appears as the content of an element at depth 20 (or 20*2^n for some n).
Saxon is using the classic technique of using an int array with manual doubling of the size when it fills, rather than using a Java Stack (and it's failing to check for the array being full on this particular path). I suspect that in prehistoric times there was a performance justification for this (it's a stack of integers, so we avoid the cost of boxing and unboxing), but I rather doubt whether that is true in modern Java releases, and it's probably not a performance-critical path anyway. So the right solution is probably to use a Java
But that's not the actual issue: the code for outputting comments and PIs is trying to establish whether it's within an inline element by looking at the element properties at level L, when the enclosing element is actually at level L-1. Since properties aren't removed from the stack on endElement(), this is actually looking at whether the last element actually at level L (perhaps a preceding sibling, perhaps a preceding cousin) was an inline element. So perhaps a simpler fix is just to add the "-1".
#3 Updated by Michael Kay almost 2 years ago
- Category set to Serialization
- Status changed from New to Resolved
- Assignee set to Michael Kay
- Priority changed from Low to Normal
- Applies to branch 9.9, trunk added
- Fix Committed on Branch 9.9, trunk added
Fixed by changing the code for comment() and processingInstruction() to access
propertyStack[level-1] with a check for
level==0. Can leave changing the stack implementation to another day.
Please register to edit this issue