Subjects
Home
VOTE Move XML Commons to Xerces
Commented: (XERCESJ 589) Bug with pattern restriction on long strings
: Xerces J 2 8 1 Release on Wednesday, September 13th
: Xerces J 2 9 0 Release on Wednesday, November 22nd
Commented: (XERCESJ 1066) Restriction+choice+substitutionGroup error
Commented: (XERCESJ 1178) Error getting prefix for an attribute with no n
Updated: (XERCESJ 1244) XMLSchemaValidator does not contribute element 's
Some consideration about the xerces DOM implementation
Updated: (XERCESJ 1066) Restriction+choice+substitutionGroup error
Commented: (XERCESJ 1227) Poor performance / OutOfMemoryError for sequenc
retain exception stack traces
Updated: (XERCESJ 1193) NPE or hang when parsing using the "continue afte
Future of NekoHTML
Commented: (XERCESJ 1203) NPE in XMLDTDProcessor
DOM Level 3 APIs for Xalan J and a new Xalan release (2 7 1)
: xml commons external 1 3 04 Release on Wednesday, November 22nd
Commented: (XERCESJ 1247) Incorrect location information on SAX when usin
XInclude exceptions how to mirror Xerces J functionality into Xerces C++?
First proposal on SoC project "Add support for the StAX (JSR 173) cursor API
: xml commons resolver 1 2 Release on Wednesday, November 22nd
Typo in RangeToken java Please check
Validator features
java lang ClassCastException when adopting Node
using the org apache xerces impl xs identity package
Updated: (XERCESJ 1257) buffer overflow in UTF8Reader for characters out
Problem with ref attributes and schema validation
Updated: (XERCESJ 122) XMLSchemaValidator does not contribute element 's d
Performance problem under load Xerces with Weblogic 9 x
remove ignored memory allocation
Commented: (XERCESJ 1177) SAXXMLStreamReader doesn 't always report namesp
Commented: (XERCESJ 977) Null pointer exception during DOM parsing
Commented: (XERCESJ 1197) Code cleanup for org apache xml serialize
Commented: (XERCESJ 1201) Initial contribution for StAX Event API
Updated: (XERCESJ 1061) Regex "$ " and "^ " characters treated as special c
Commented: (XERCESJ 1199) SAXXMLStreamReader should attempt to register a
Commented: (XERCESJ 1061) Regex "$ " and "^ " characters treated as special
Updated: (XERCESJ 589) Bug with pattern restriction on long strings
StackOverflow
xerces Range unnecessarily not garbage collectable if not detached
Updated: (XERCESJ 1178) Error getting prefix for an attribute with no nam
Bug in xs:redefine
Commented: (XERCESJ 1204) Can not set XMLEntityResolver for LSParser
Updated: (XERCESJ 1253) Prototype for SoC2007 project "Add support for th
Updated: (XERCESJ 1259) Add SteamFilter Function to SoC2007 project "Add
Assigned: (XERCESJ 444) SAXException thrown by EntityResolver is reported
Google Summer of Code 2007
Xerces J and XInclude relative path issue
Assigned: (XERCESJ 206) Stack overflow when using a schema validation
Commented: (XERCESJ 1215) Restrictions involving two levels of substituti
Closed: (XERCESJ 1203) NPE in XMLDTDProcessor
non overriding equals methoda
Resolved: (XERCESJ 1079) invalid value returned for TOTALDIGITS facet in
Xerces AS3 port
Updated: (XERCESJ 325) Regular Expression; Pattern "| " clause order de
Updated: (XERCESJ 1196) Javadoc generation fails on Java SE 5 0
Closed: (XERCESJ 1202) DTD validation on XIncluded documents when the sch
Created: (XERCESJ 1124) Nonspecific schema error message
a bug in xerces
Updated: (XERCESJ 1201) Initial contribution for StAX Event API
Closed: (XERCESJ 1254) Empty uris in targetNamespace attribute not report
Links
Home
Oracle database error code
 
Search:  
Power your search with and, or, +, -, or "some phrase" operators.
Valid XML characters

Valid XML characters

2003-01-03       - By Dima Gutzeit
Reply:     1     2     3  

<DIV>Thanks for you answer.</DIV>
<DIV>&nbsp;</DIV>
<DIV>Could you please provide me with the "legal" Unicode range for XML , so I
would know what to filter out.</DIV>
<DIV>&nbsp;</DIV>
<DIV><BR>Joseph Kesselman wrote:<BR>&gt;Subject: Re: Valid XML characters <BR>
&gt; From: Joseph Kesselman <KESHLAM@(protected)><BR>&gt; To: xerces-j-user@(protected)
.apache.org <BR>&gt; Date: Thu, 2 Jan 2003 23:27:28 -0500 <BR>&gt; <BR>&gt; <BR>
&gt;On Thursday, 12/26/2002 at 07:23 ZE2, "Dima Gutzeit" <DIMA@(protected)>
<BR>&gt;wrote: <BR>&gt;&gt; Sometimes when parsing XML files I get an error
message(exception) about <BR>&gt; <BR>&gt;&gt; "invalid Unicode characters" ,
is there any way to filter those before <BR>&gt;parsing ? <BR>&gt; <BR>&gt
;There's no way to do that within the parser. "If it contains illegal <BR>&gt
;characters, it isn't XML" and the error messages are entirely correct. <BR>&gt;
<BR>&gt;You could, of course, write your own stream filter and pass the data
<BR>&gt;through that, then use its output as the input to the parser. That's <BR
>&gt;fairly straightforward Java coding. The problem would be deciding what <BR>
&gt;you're going to do with those characters when yo!
u see them -- if you just <BR>&gt;discard them you may be changing the meaning
of the document, and if you <BR>&gt;turn them into some sort of private escape
sequence only applications <BR>&gt;which understand that convention will be
able to do anything with them. <BR>&gt;Fixing the source documents really is
the cleanest answer. <BR>&gt; <BR>&gt;For what it's worth: It has been proposed
that future versions of XML <BR>&gt;*may* relax the forbidden-character
restrictions, but there's still no <BR>&gt;firm consensus on whether that
change would be desirable or what version <BR>&gt;of XML it might find its way
into. <BR>&gt; <BR>&gt;______________________________________ <BR>&gt;Joe
Kesselman / IBM Research <BR>&gt; <BR>&gt; <BR>&gt;----------------------------
----------------------------------------- <BR>&gt;To unsubscribe, e-mail: xerces
-j-user-unsubscribe@(protected) <BR>&gt;For additional commands, e-mail:
xerces-j-user-help@(protected) <BR>&gt; <BR>&gt; <BR>&gt;____!
______________________________________________________________ <BR>&gt; </DIV>
<br><P><FONT color=#0000ff><FONT face="Comic Sans MS">Regards , <BR>Dima Gutzeit
</FONT>.<BR>---------------------------------<BR>MailVision LTD.
<BR>R&D Team.
<BR>Phone: 972 - 4 - 8508020<BR>Fax: 972 - 3 - 9285149
<BR>http://www.mailvision.com
</FONT></P><br>

---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-j-user-unsubscribe@(protected)
For additional commands, e-mail: xerces-j-user-help@(protected)