1 May 15:23
[jira] Created: (UIMA-387) XMI Serializer can write invalid control characters
XMI Serializer can write invalid control characters
---------------------------------------------------
Key: UIMA-387
URL: https://issues.apache.org/jira/browse/UIMA-387
Project: UIMA
Issue Type: Bug
Components: Core Java Framework
Affects Versions: 2.1
Reporter: Adam Lally
Fix For: 2.2
On 5/1/07, Leo Ferres <lferres@...> wrote:
> Hello,
>
> While trying to open an xmi file after processing in xml view, an
> error pops up telling me that there is an invalid  xml character.
> the error comes from the sax parser. Below is the stack trace. Thanks
> very much for your help,
>
Most control characters are not allowed in XML 1.0, even if they are
escaped with &#xxx. If your input document contains such characters,
the XMI CAS serializer is writing them to the output XMI document,
making it unreadable.
I checked that if you edit the XMI document and change the first line to:
<?xml version="1.1" encoding="UTF-8"?>
The problem goes away, because XML version 1.1 does allow escaped
(Continue reading)
RSS Feed