Bug #5649
closedxmlresolver does not handle repeated whitespace in public identifiers when using xml catalog
0%
Description
Please see the attached zip file.
I have there an xml file and a catalog.
The public identifiers in both the xml and the catalog contain a doubled whitespace.
Saxon 11.2 is unable to parse the XML:
[latternm@somwhere saxon11catalog]$ java -jar /path/to/saxon-ee-11.2.jar -s:xml.xml -xsl:xsl.xsl -catalog:catalog.xml
I/O error reported by XML parser processing file:/home/latternm/saxon11catalog/xml.xml: /home/latternm/saxon11catalog/main.dtd (No such file or directory)
When I add an entry into the catalog with the whitespaces collapsed, then it works.
I think the resolver normalizes (collapses) whitespace in the input, but not in the catalog.
Of course, we can work around this problem easily, reporting it primarily because others may encounter the same issue when upgrading to saxon11.
https://www.oasis-open.org/committees/entity/spec.html
6.2. Public Identifier Normalization
In order to accurately and interoperably compare public identifiers, catalog processors must perform normalization on public identifiers in both the catalog and the input passed to them. All strings of white space in public identifiers must be normalized to single space characters (#x20), and leading and trailing white space must be removed.
Files
Please register to edit this issue