Uploaded image for project: 'Daffodil'
  1. Daffodil
  2. DFDL-1541

xs:hexBinary with dfdl:lengthKind="delimited" should be restricted to SBCS (single byte character set)

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Open
    • Priority: Normal
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: never
    • Component/s: Back End, Front End
    • Labels:
      None

      Description

      Say we have something like:

      <xs:element name="foo" type="xs:hexBinary" dfdl:occursCountKind="parsed" dfdl:separator="multi-byte utf-8 character" dfdl:encoding="UTF-8" />

      We currently don't allow this. But perhaps this should be allowed? Delimiters would be scanned using the specified encoding, and then the data up to that encoding would be converted to hexBinary data. Does it make sense to allow someone to specified a non byte size encoding. For example, a multi-byte UTF-8 character, or perhaps even non-byte-size encoding. If we allow non-byte-size encodings, is it then an error if the data consumed does not have a bitlength divisible by 8?

      The specification is not clear on how this should be handled. Right now, we just require that the encoding by ISO-8859-1 for delimited hex binary.

        Gliffy Diagrams

          Attachments

            Activity

              People

              Assignee:
              Unassigned Unassigned
              Reporter:
              slawrence Steve Lawrence
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Dates

                Created:
                Updated:

                  Tasks