Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: CRLF and CR behavior corrected.

...

Character #xD (Carriage Return or CR) is also mapped to the PUA as #xE00D#xA (Line Feed, or LF). The CR character is allowed in the textual representation of XML documents, but not is always converted to LF in the XML Infoset. That is, it is read by XML processors, but CRLF is converted to just LF, and CR alone is converted to LF. Daffodil is in a sense a different 'reader' of data into the XML infoset, so to be consistent with XML we map CR to LF. 

The pair CRLF when it appears within data (i.e., is not a delimiter) is treated as regular text characters, so the CR is converted to LF, and so CRLF will become LFLF. 

For illegal characters #xD800 to #xDFFF, these values are mapped to the PUA by adding #x1000 to their character code. So #xD800 maps to #xE800, and #xDFFF maps to #xEFFF.

...