Struct namespaces with Serde #218

Richterrettich · 2020-06-22T12:52:39Z

Hi,
it would be really cool to have a feature to define common namespaces for structs using serde. Something like this:

#[derive(Serialize, Deserialize)]
#[namespace(F,foourn)]
struct Foo {
  id: String,
  #[serde(flatten)]
  #[namespace(B,barurn)]
  bar: Bar
}

#[derive(Serialize, Deserialize)]
struct Bar {
  name: String,
  desc: String
}

resulting in:

<F:foo xmlns:B="foourn" xmlns:F="barurn">
      <F:id>123</F:id>
      <B:name>asdf</B:name>
      <B:desc>foobar </B:desc>
</F:foo>

Background

I sometimes face (old) API's that requrie XML to be structured in a way similar to the given example. They use namespaces to distinguish between entities to form some sort of inheritance tree.
A feature like this would make interacting with these kind of API's very easy.

mlevkov · 2020-06-28T08:01:05Z

I do not know how to handle this here, but I've used yaserde crate for such @Richterrettich

tafia · 2020-07-17T03:28:47Z

This would be a nice feature indeed. Unfortunately I don't have much time but I'd be happy to integrate it if someone finds the time to do it.

WhyNotHugo · 2023-02-18T18:04:03Z

I'd love to see this happen, but I've no idea where to even start. What kind of macro would namespace be in order for its data to somehow be accessible in the serde (de)serialization stage?

meghfossa · 2023-05-26T03:21:37Z

Is there any workaround for this, or : is not supported at all. I don't mind if the workaround approach is verbose or unintuitive. Right now, I can't parse any field with : in it's name.

WhyNotHugo · 2023-05-26T06:47:20Z

No workaround if you want serde. I wrote my own parser with https://docs.rs/quick-xml/latest/quick_xml/reader/struct.NsReader.html

Mingun · 2023-05-26T07:34:54Z

If you don't want to write deserialization code manually, you could look at xmlserde. Support for namespaces for serde is not an easy task, unfortunately.

dralley · 2023-07-26T16:57:00Z

Whoever picks this up, consider starting from #466

jespersm · 2025-01-02T23:36:58Z

I want to give this one a try.

I'm aiming for the following feature set:

Easy specification of namespace information into serde derive attributes
Ability to ensure space-efficient serialization (i.e. by explicitly pushing the namespace definitions to the start of the serialized XML)
Ser/de for xml:lang and xml:space
Support for custom ser/de of QNames, as used in e.g. XML Schema files, WSDL, and XSLT.

The main problem is that there aren't a lot of options in the serde's container, variant and field attributes that we can encode the namespace information into -- really only 'rename', like in the following suggested format (some have called it James Clark notation ):

    /// Type where one field represented by an attribute and one by an element
    #[derive(Debug, Deserialize, PartialEq)]
    #[serde(rename = "{urn:example:a}mixed-ns")]
    struct Mixed {
        #[serde(rename = "@{urn:example:b}float")]
        float: f64, // Note: It's an XML attribute
        #[serde(rename = "{urn:example:c}string")]
        string: String,
    }

Which could be used to deserialize XML like this:

<elements xmlns="urn:example:a" xmlns:bbb="urn:example:b" xmlns:ccc="urn:example:c" bbb:float="42.0">
  <ccc:string>answer</ccc:string>
</elements>

Note that to deserialize, you don't need to know the prefixes in advance. For seralizing, you don't either, but you may have preferences you want to express in the generated XML.

Repeating the name namespace over and over again is not pretty, I know.
However, alternatively, we'd need a separate procmacro and/or some gruesome linker-tricks to lookup the namespace per-type information (by type-id or similar), if I understand serde's architecture correctly -- I'd rather not go that way.

WhyNotHugo · 2025-01-03T00:28:57Z

This would imply that namespaces need to be known at compile time, and can't be read when deserialising either, right? I don't think this is a problem, but I still want to ensure that limitations are clear.

jespersm · 2025-01-03T11:28:47Z

This would imply that namespaces need to be known at compile time, and can't be read when deserialising either, right? I don't think this is a problem, but I still want to ensure that limitations are clear.

In the general case (deserialize instances of a known "ordinary" schema into Rust datastructures), you know the relevant namespaces in advance, and the prefixes do not matter.

In the case more creative uses of XML Namespaces, such as XML schema, WSDL and the like, you know the namespaces of the structures ahead of time, but to deserialize it properly, you need access to the namespaces of the content which the structures are describing.

Example:

<xsd:schema targetNamespace="http://www.example.com/items"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema"
    xmlns:items="http://www.example.com/items"
    elementFormDefault="qualified">

    <xsd:element name="order">
        <xsd:complexType>
            <xsd:sequence>
                <xsd:element name="item" type="items:itemType" maxOccurs="unbounded" />
            </xsd:sequence>
        </xsd:complexType>
    </xsd:element>

    <xsd:complexType name="itemType">
        <xsd:simpleContent>
            <xsd:extension base="xsd:string">
                <xsd:attribute name="itemId" type="xsd:ID" />
            </xsd:extension>
        </xsd:simpleContent>
    </xsd:complexType>

</xsd:schema>

Note here how the value of /xsd:schema/xsd:element/xsd:complexType/xsd:sequence/xsd:element@type mentions a namespace prefix ("items") which is given in the schema instance (i.e. the schema document for 'order' and 'items'), but not part of the value space for XML Schema itself. This is crux of the fourth bullet point above, about deserializing (and serializing) QNames.

For XML files which are entirely mixed-form, like XSLT, I really can't tell if deserialization is a productive strategy to pursue. It would require doing everything in mixed mode, deserializing the "content" subtrees into an in-memory XML tree, while deserializing the XSLT elements themselves. All XPath expressions would need access to namespace mappings, since elements like <xsl:value-of select="//library:book/@isbn:isbn-number"/> can several different QNames which need to be parsed out carefully. Writing a custom parser using NsReader would likely be more productive in that case -- you can't be all things to all people.

Ideally, the namespaces could be made known to any custom Serialize or Deserialize you use, perhaps by somehow extending the contract or hooking into the NsReader state -- thread locals?

WhyNotHugo · 2025-01-03T15:10:24Z

WebDAV uses custom namespaces for properties defined by extensions.

A client might receive elements with unknown namespaces (which some other client created). But clients need dedicated support to do something useful with this data, so it's usually okay to ignore unknown elements with unknown namespaces.

I know of one tricky situation where different clients use different namespaces for the same property (a not-fully-standard one; calendar colour). But I guess that with the proposed implementation, a client could just serialise both variations into separate fields.

jespersm · 2025-01-03T16:46:50Z

WebDAV uses custom namespaces for properties defined by extensions.

Ouch, WebDAV is a dumpster fire of incompatibility, I just had a major run-in with it. I'll keep the WebDAV namespace in mind. Examples welcome.

Caellian · 2025-01-03T21:27:54Z

SVGs are similar with <metadata>.

Support for namespaces for serde is not an easy task, unfortunately.

It's not possible. Namely:

specification of namespace information into serde derive attributes

requires addition of namespaces to serde. This isn't some additional data that can be encoded in (de)serialized data, it's a parser/generator metadata, so it really needs to be supported by the serde itself (or another library).

Every element has to be aware of namespaces defined for it or any defined above it in the tree. Deserializer has to be able to provide the top-level/default namespace. And so on...

I saw that no serde issues mention namespaces so I created one: serde-rs#2877. I'm not 100% sure this issue will be accepted though because afaict, the problem requires a context-aware parser and serde wasn't designed for this.

jespersm · 2025-01-03T23:10:51Z

It's not possible. Namely:

specification of namespace information into serde derive attributes

requires addition of namespaces to serde. This isn't some additional data that can be encoded in (de)serialized data, it's a parser/generator metadata, so it really needs to be supported by the serde itself (or another library).

I'm sure it's doable, either with extra derives as already attempted, or by encoding the required namespaces into the renames as prototyped by me already.

But yes, it's a hard problem, with a diverse set of trade-offs.

Caellian · 2025-01-04T00:12:18Z

Not sure why, but adding it to name and working around that feels a bit hackish to me.

While creating the issue on serde I got an idea of storing additional metadata in (de)serializer as HashMap<TypeId, XMLSpecificTypeInfo>, maybe a trait that builds on top of serde infra could be added to specify attribute/inner and namespace.

jespersm · 2025-01-19T11:56:29Z

I've pushed my work in progress to https://github.com/jespersm/quick-xml/tree/serde_namespace_support - starting with the deserialization side of things, but it's not reviewable yet.

Mingun self-assigned this May 21, 2022

Mingun added enhancement serde Issues related to mapping from Rust types to XML namespaces Issues related to namespaces support labels May 21, 2022

This was referenced May 25, 2022

Specify enum variant on colo with serde #338

Closed

Design for namespaces #316

Closed

Serialisation: support for namespaces and declarations #282

Closed

JOSEPHGILBY mentioned this issue Aug 26, 2022

Proposal for serializing namespaces in serde addressing #218 #466

Closed

Mingun pinned this issue Oct 29, 2022

Mingun mentioned this issue Jan 6, 2023

Restore ability to deserialize attributes that represents XML namespace mappings (xmlns:xxx) #539

Merged

Mingun mentioned this issue Jan 27, 2023

Serde deserialization: Namespaces can lead to erroneous "duplicate field" errors #547

Closed

Mingun removed their assignment Feb 18, 2023

Mingun mentioned this issue Apr 12, 2023

problems with colon separators #591

Closed

Mingun mentioned this issue Jun 16, 2024

duplicate field @type when both default and xsi namespace attribute are present #757

Closed

Mingun mentioned this issue Sep 24, 2024

Document how to deserialize and serialize XML with namespaces using serde #803

Open

Mingun mentioned this issue Oct 15, 2024

(De)serializing xsi attributes (xsi:type, xsi:nil etc.) #822

Closed

zeenix mentioned this issue Nov 8, 2024

xmlgen should ignore non-supported XML w/ a warning dbus2/zbus#256

Open

jespersm mentioned this issue Jan 18, 2025

Impossible to serialize and/or deserialize reserved xml attributes. #841

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Struct namespaces with Serde #218

Struct namespaces with Serde #218

Richterrettich commented Jun 22, 2020

mlevkov commented Jun 28, 2020

tafia commented Jul 17, 2020

WhyNotHugo commented Feb 18, 2023

meghfossa commented May 26, 2023 •

edited

Loading

WhyNotHugo commented May 26, 2023 via email

Mingun commented May 26, 2023

dralley commented Jul 26, 2023

jespersm commented Jan 2, 2025 •

edited

Loading

WhyNotHugo commented Jan 3, 2025 via email

jespersm commented Jan 3, 2025 •

edited

Loading

WhyNotHugo commented Jan 3, 2025

jespersm commented Jan 3, 2025

Caellian commented Jan 3, 2025

jespersm commented Jan 3, 2025

Caellian commented Jan 4, 2025

jespersm commented Jan 19, 2025

Struct namespaces with Serde #218

Struct namespaces with Serde #218

Comments

Richterrettich commented Jun 22, 2020

Background

mlevkov commented Jun 28, 2020

tafia commented Jul 17, 2020

WhyNotHugo commented Feb 18, 2023

meghfossa commented May 26, 2023 • edited Loading

WhyNotHugo commented May 26, 2023 via email

Mingun commented May 26, 2023

dralley commented Jul 26, 2023

jespersm commented Jan 2, 2025 • edited Loading

WhyNotHugo commented Jan 3, 2025 via email

jespersm commented Jan 3, 2025 • edited Loading

WhyNotHugo commented Jan 3, 2025

jespersm commented Jan 3, 2025

Caellian commented Jan 3, 2025

jespersm commented Jan 3, 2025

Caellian commented Jan 4, 2025

jespersm commented Jan 19, 2025

meghfossa commented May 26, 2023 •

edited

Loading

jespersm commented Jan 2, 2025 •

edited

Loading

jespersm commented Jan 3, 2025 •

edited

Loading