Uploaded image for project: 'Daffodil'
  1. Daffodil
  2. DFDL-1541

xs:hexBinary with dfdl:lengthKind="delimited" should be restricted to SBCS (single byte character set)

XMLWordPrintableJSON

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Normal Normal
    • never
    • None
    • Back End, Front End
    • None

      Say we have something like:

      <xs:element name="foo" type="xs:hexBinary" dfdl:occursCountKind="parsed" dfdl:separator="multi-byte utf-8 character" dfdl:encoding="UTF-8" />

      We currently don't allow this. But perhaps this should be allowed? Delimiters would be scanned using the specified encoding, and then the data up to that encoding would be converted to hexBinary data. Does it make sense to allow someone to specified a non byte size encoding. For example, a multi-byte UTF-8 character, or perhaps even non-byte-size encoding. If we allow non-byte-size encodings, is it then an error if the data consumed does not have a bitlength divisible by 8?

      The specification is not clear on how this should be handled. Right now, we just require that the encoding by ISO-8859-1 for delimited hex binary.

              Unassigned Unassigned
              slawrence Steve Lawrence
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated: