Project

Profile

Help

Bug #5297

closed

String handling: performance

Added by Michael Kay over 2 years ago. Updated about 2 years ago.

Status:
Closed
Priority:
Normal
Assignee:
Category:
Performance
Sprint/Milestone:
-
Start date:
2022-02-09
Due date:
% Done:

100%

Estimated time:
Legacy ID:
Applies to branch:
11, trunk
Fix Committed on Branch:
11, trunk
Fixed in Maintenance Release:
Platforms:

Description

The forum post at https://saxonica.plan.io/boards/3/topics/8512 reveals some specific performance issues with string handling.

The escaping of HTML URIs is identified as a specific issue. Looking at the detail:

(a) The UnicodeBuilder class isn't designed to optimize character-by-character construction of a string, but it's often being used that way

(b) the specific case of HTML URI escaping would probably be better done using standard Java strings / stringBuilders anyway

(c) ZenoString.consolidate0() contains a call to verifySegmentLengths(), which is a diagnostic method to check the integrity of the data structure and should have been disabled in production code.

Please register to edit this issue

Also available in: Atom PDF