UTF-8 Encoding 2003-03-28 - By Shekhar Karani
Hi
I am using the xerces 2.2.1 to parse XML documents. One of the XML documents has a hex character B6. This character is being treated as an invalid UTF-8 character by the parser. The parser gives the error "Invalid byte 1 of UTF-8 byte stream". However, the editor XML SPY version 5, accepts this character.
Please let me know what I need to do in my code to accept this character.
The archives on the mailing list are not accessible hence I am not sure if this question is present there.
Thanks Shekhar
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"> <HTML><HEAD> <META content="text/html; charset=iso-8859-1" http-equiv=Content-Type> <META content="MSHTML 5.00.2314.1000" name=GENERATOR> <STYLE></STYLE> </HEAD> <BODY bgColor=#ffffff> <DIV><FONT face=Arial size=2>Hi<BR><BR>I am using the xerces 2.2.1 to parse XML documents. One of the XML <BR>documents has a hex character B6. This character is being treated as an <BR>invalid UTF-8 character by the parser. The parser gives the error <BR>"Invalid byte 1 of UTF-8 byte stream". However, the editor XML SPY <BR>version 5, accepts this character.<BR><BR>Please let me know what I need to do in my code to accept this <BR>character.<BR><BR>The archives on the mailing list are not accessible hence I am not sure <BR>if this question is present there.</FONT></DIV> <DIV><FONT face=Arial size=2> </DIV></FONT> <DIV><FONT face=Arial size=2>Thanks<BR>Shekhar<BR></FONT></DIV></BODY></HTML>
|
|