Subjects
Home
VOTE Move XML Commons to Xerces
Commented: (XERCESJ 589) Bug with pattern restriction on long strings
: Xerces J 2 8 1 Release on Wednesday, September 13th
: Xerces J 2 9 0 Release on Wednesday, November 22nd
Commented: (XERCESJ 1066) Restriction+choice+substitutionGroup error
Commented: (XERCESJ 1178) Error getting prefix for an attribute with no n
Updated: (XERCESJ 1244) XMLSchemaValidator does not contribute element 's
Some consideration about the xerces DOM implementation
Updated: (XERCESJ 1066) Restriction+choice+substitutionGroup error
Commented: (XERCESJ 1227) Poor performance / OutOfMemoryError for sequenc
retain exception stack traces
Updated: (XERCESJ 1193) NPE or hang when parsing using the "continue afte
Future of NekoHTML
Commented: (XERCESJ 1203) NPE in XMLDTDProcessor
DOM Level 3 APIs for Xalan J and a new Xalan release (2 7 1)
: xml commons external 1 3 04 Release on Wednesday, November 22nd
Commented: (XERCESJ 1247) Incorrect location information on SAX when usin
XInclude exceptions how to mirror Xerces J functionality into Xerces C++?
First proposal on SoC project "Add support for the StAX (JSR 173) cursor API
: xml commons resolver 1 2 Release on Wednesday, November 22nd
Typo in RangeToken java Please check
Validator features
java lang ClassCastException when adopting Node
using the org apache xerces impl xs identity package
Updated: (XERCESJ 1257) buffer overflow in UTF8Reader for characters out
Problem with ref attributes and schema validation
Updated: (XERCESJ 122) XMLSchemaValidator does not contribute element 's d
Performance problem under load Xerces with Weblogic 9 x
remove ignored memory allocation
Commented: (XERCESJ 1177) SAXXMLStreamReader doesn 't always report namesp
Commented: (XERCESJ 977) Null pointer exception during DOM parsing
Commented: (XERCESJ 1197) Code cleanup for org apache xml serialize
Commented: (XERCESJ 1201) Initial contribution for StAX Event API
Updated: (XERCESJ 1061) Regex "$ " and "^ " characters treated as special c
Commented: (XERCESJ 1199) SAXXMLStreamReader should attempt to register a
Commented: (XERCESJ 1061) Regex "$ " and "^ " characters treated as special
Updated: (XERCESJ 589) Bug with pattern restriction on long strings
StackOverflow
xerces Range unnecessarily not garbage collectable if not detached
Updated: (XERCESJ 1178) Error getting prefix for an attribute with no nam
Bug in xs:redefine
Commented: (XERCESJ 1204) Can not set XMLEntityResolver for LSParser
Updated: (XERCESJ 1253) Prototype for SoC2007 project "Add support for th
Updated: (XERCESJ 1259) Add SteamFilter Function to SoC2007 project "Add
Assigned: (XERCESJ 444) SAXException thrown by EntityResolver is reported
Google Summer of Code 2007
Xerces J and XInclude relative path issue
Assigned: (XERCESJ 206) Stack overflow when using a schema validation
Commented: (XERCESJ 1215) Restrictions involving two levels of substituti
Closed: (XERCESJ 1203) NPE in XMLDTDProcessor
non overriding equals methoda
Resolved: (XERCESJ 1079) invalid value returned for TOTALDIGITS facet in
Xerces AS3 port
Updated: (XERCESJ 325) Regular Expression; Pattern "| " clause order de
Updated: (XERCESJ 1196) Javadoc generation fails on Java SE 5 0
Closed: (XERCESJ 1202) DTD validation on XIncluded documents when the sch
Created: (XERCESJ 1124) Nonspecific schema error message
a bug in xerces
Updated: (XERCESJ 1201) Initial contribution for StAX Event API
Closed: (XERCESJ 1254) Empty uris in targetNamespace attribute not report
Links
Home
Oracle database error code
 
Search:  
Power your search with and, or, +, -, or "some phrase" operators.
Some consideration about the xerces DOM implementation

Some consideration about the xerces DOM implementation

2007-04-18       - By Dick Deneer
Reply:     1     2     3     4     5  

I am using the xerces parser and Dom implementation as the base of a  
simple XML editor.

The editor has a  source and a tree (DOM)  view.


Using the DOM in an editor requires other functionalitiy then just  
using a DOM  for showing or validating purposes.

In fact there are two issues  with the current implementation, making  
it very hard to use xerces for this purpose.

1. It may be clear that it is important that the user gets the same  
(or nearly the same) xml after switching from source to DOM and back.

2. The user don't want  unexpected additions or mutations in the DOM  
while editing or validating.


I will give some examples:

·      It is not possible store entity references in attribute values  
when building a DOM, making is impossible to get the original XML  
back after serializing. There is a jira issue for this problem, see  
https://issues.apache.org/jira/browse/XERCESJ-1225

·      The DOMNormalizer adds default required attributes to the DOM  
tree. Altough perfectly right according to the official  
specifications, this is not desired for the use in an editor.

·      First it will produce a error message "cvc-complex-type.4:  
Attribute 'xxxxx' must appear on element 'yyyyyy', which is in fact  
not valid anymore, because the problem is already solved by the  
DOMNormalizer, and secondly the XML is not the same anymore. These  
two things are confusing for the user.  At this moment it is not  
happening for elements , but It looks like the same behavior is going  
to be implemented.

·      Adding a textNode which is adjacent to another, will  
immediately be delete by the DOMNormalizer. Again, this may be  
perfectly right, but confusing for a user when editing in a DOM.  And  
it is not possible to validate the DOM without this "side-effect".



Feedback of what happend is given by all kind of document events,  
which is great to still give some explanation to the user and keep  
trees in sync.  But  I would  prefer to let some things not happen  
and be in control in the front.

I really hope that in the future xerces will bring more "features or  
properties" to control the behavior of the DOMParser and DOMNormalizer.


D. Deneer
<HTML><BODY style="word-wrap: break-word; -khtml-nbsp-mode: space; -khtml-line
-break: after-white-space; "><P style="margin: 0.0px 0.0px 16.0px 0.0px">I am
using the xerces parser and Dom implementation as the base of a simple XML
editor.</P><P style="margin: 0.0px 0.0px 16.0px 0.0px">The editor has a  source
and a tree (DOM)  view.</P><DIV style="margin-top: 0px; margin-right: 0px;
margin-bottom: 16px; margin-left: 0px; "> <BR class="khtml-block-placeholder"><
/DIV><P style="margin: 0.0px 0.0px 16.0px 0.0px">Using the DOM in an editor
requires other functionalitiy then just using a DOM  for showing or validating
purposes.</P><P style="margin: 0.0px 0.0px 16.0px 0.0px">In fact there are two
issues  with the current implementation, making it very hard to use xerces for
this purpose.</P><P style="margin: 0.0px 0.0px 16.0px 0.0px">1. It may be clear
that it is important that the user gets the same (or nearly the same) xml after
switching from source to DOM and back.</P><P style="margin: 0.0px 0.0px 16.0px
0.0px">2. The user don't want  unexpected additions or mutations in the DOM
while editing or validating.</P><DIV style="margin-top: 0px; margin-right: 0px;
margin-bottom: 16px; margin-left: 0px; "> <BR class="khtml-block-placeholder"><
/DIV><P style="margin: 0.0px 0.0px 16.0px 0.0px">I will give some examples:</P>
<P style="text-indent: -24px;margin-top: 0px; margin-right: 0px; margin-bottom:
16px; margin-left: 24px; "><FONT class="Apple-style-span" face="Symbol">·</FONT
>      It is not possible store entity references in attribute values when
building a DOM, making is impossible to get the original XML back after
serializing. There is a jira issue for this problem, see <A href="https:/
/issues.apache.org/jira/browse/XERCESJ-1225"><FONT class="Apple-style-span"
color="#000CEE">https://issues.apache.org/jira/browse/XERCESJ-1225</FONT></A><
/P><P style="text-indent: -24px;margin-top: 0px; margin-right: 0px; margin
-bottom: 16px; margin-left: 24px; "><FONT class="Apple-style-span" face="Symbol"
>·</FONT>      The DOMNormalizer adds default required attributes to the DOM
tree. Altough perfectly right according to the official specifications, this is
not desired for the use in an editor. </P><P style="text-indent: -24px;margin
-top: 0px; margin-right: 0px; margin-bottom: 16px; margin-left: 24px; "><FONT
class="Apple-style-span" face="Symbol">·</FONT>      First it will produce a
error message "cvc-complex-type.4: Attribute 'xxxxx' must appear on element
'yyyyyy', which is in fact not valid anymore, because the problem is already
solved by the DOMNormalizer, and secondly the XML is not the same anymore.
These two things are confusing for the user.  At this moment it is not
happening for elements , but It looks like the same behavior is going to be
implemented.</P><P style="text-indent: -24px;margin-top: 0px; margin-right: 0px
; margin-bottom: 16px; margin-left: 24px; "><FONT class="Apple-style-span" face=
"Symbol">·</FONT>      Adding a textNode which is adjacent to another, will
immediately be delete by the DOMNormalizer. Again, this may be perfectly right,
but confusing for a user when editing in a DOM.  And it is not possible to
validate the DOM without this "side-effect".</P><P style="margin: 0.0px 0.0px
16.0px 0.0px; font: 16.0px Helvetica; min-height: 19.0px"><BR></P><P style=
"margin: 0.0px 0.0px 16.0px 0.0px">Feedback of what happend is given by all kind
of document events, which is great to still give some explanation to the user
and keep trees in sync.  But  I would  prefer to let some things not happen and
be in control in the front.</P><P style="margin: 0.0px 0.0px 16.0px 0.0px">I
 really hope that in the future xerces will bring more "features or properties"
to control the behavior of the DOMParser and DOMNormalizer. </P><DIV style=
"margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; font:
normal normal normal 16px/normal Helvetica; min-height: 19px; "><BR></DIV><DIV
style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px
; ">D. Deneer</DIV></BODY></HTML>