[Pre-RFC] sum/union types

hamza1311 · February 10, 2023, 8:04pm

Summary

Introduce "sum" or "union" types. These types are defined as combination of multiple structs.

Motivation

At the moment, if there 2 types that share some properties, they have to duplicated.

struct First {
    one: String,
    two: u32,
    custom: bool,
}

struct Second {
    one: String,
    two: u32,
    foo: f64,
}

The fields one and two are duplicated between structs First and Second. This is hard to maintain and scales terribly.

One real-world example of this is typed HTML. There are over 140 HTML elements, with over 30 global attributes. Many other attributes are shared across elements. Defining such types can result in huge amount of duplicated code.

This is also a big problem in Rust UI frameworks: Ergonomic pattern for passing along on* callbacks to children · Issue #1533 · yewstack/yew · GitHub

Proposed solution

Introducing union types. The above snippet can also be written as:

struct Base {
    one: String,
    two: u32,
}

// typescript syntax
type First = Base & {
    custom: bool,
}

// could be Rust
#[extends(Base)]
struct Second {
    foo: f64,
}

// alternatively, however this requires a new keyword
struct Third extends Base {
    bar: HashMap<String, String>,
}

Going back to the HTML example from before, there could be a shared struct that multiple elements can branch off of.

Prior art

Typescript union types
Inheritance in OOP, but only for fields, not methods

scottmcm · February 10, 2023, 8:11pm

nit, but I'd suggest finding a different name for this. "Sum type" already has a programming meaning https://en.wikipedia.org/wiki/Sum_type: Rust enums are sum types. And of course Rust has unions, so these shouldn't be called that either.

kornel · February 10, 2023, 8:20pm

What's the goal here? Is this only meant to reduce boilerplate when defining structs, or is this inheritance with subclassing?

If there's subclassing and it supports &mut Base, then it will have the "object slicing" problem that C++ has.

This also needs consideration of private fields and invariants. It's easy if it required all fields to be public, but it would severely limit usefulness of the feature. For example, I don't think an actual HTML DOM could be well implemented if it could only have public fields, and nothing private, and no methods to manipulate the private fields.

There has been an alternative approach proposed:

github.com/rust-lang/rfcs

RFC: Delegation

rust-lang:master ← elahn:delegation2018

opened 10:07AM - 06 Apr 18 UTC

elahn

+741 -0

Syntax sugar for efficient code reuse via the composition pattern. Wrapper funct…ions are generated for a struct, delegating most or all of a trait’s `impl` block to a member field that already implements the trait. [Rendered](https://github.com/elahn/rfcs/blob/delegation2018/text/0000-delegation.md) ### Please Note: This RFC is a group effort from the Rust community. Whenever an issue is raised, please edit the [RFC draft](https://hackmd.io/ZUEHoEgwRF29hbcIyUXIiw?edit) to address it as best you can. If the design needs to be bikeshedded, please do so [on this internals thread](https://internals.rust-lang.org/t/new-rfc-for-delegation-anyone-interested-in-contributing/6644). Whenever an issue or question has been resolved, [please submit a PR to this RFC](https://github.com/elahn/rfcs/tree/delegation2018). --------- Thank you, everyone for your contributions, they’ve been a big help. If we continue this collaborative style throughout the RFC process, I’ve no doubt we can address any concerns that arise and get this puppy accepted!

hamza1311 · February 11, 2023, 12:54pm

I would say it's both. Ideally, we would need to be able to define the extended struct and be able to mutate it. I'm not familiar with C++ so I don't see how object slicing is a problem. If you provide any resources that explain it, that would be great.

To give a concrete example from Yew, if we are creating wrapper components, we need to define every single attribute in the properties struct for the component. Ref: Function Components | Yew and Properties | Yew.
Think of a <button> element and <MyButton> component. The component is wrapping the element with addition functionality/styles/etc. For MyButton to be able replicate functionality, it must define all the properties can be passed. This gets really cumbersome. It can be solved if <button> defined what it can take and <MyButton> extended from it. If we use proc-macros as they as today, we end up with hundreds of structs with hundreds of fields and that has a huge hit on the compile times.

One other solution that comes to mind is a (compiler built-in) attribute macro that just extends the struct with the new fields. This has the benefit of reducing code duplication but the downside of not allowing proc-macros to be able to know what the final will look like, so crates like typed-builder wouldn't work.

I hope that makes it clear what the goal is.

hydroper1 · February 11, 2023, 8:57pm

I was looking for similiar functionality. But I want to delegate from a trait, not from a struct. That is, something like:

trait C {
    fn common(&self) -> Arc<Common>;

    delegate * to self.common();
}

mgeisler · February 12, 2023, 10:32am

Note that this paragraph gives you a solution in today's Rust: let all elements have a common_attributes: CommonAttributes field

When I read your post, I was hoping to see a discussion of why this isn't enough functionality?

nielsle · February 12, 2023, 11:57am

Have a look at the way Servo handles inheritance. It may provide some inspiration

github.com

servo/servo/blob/master/components/script/dom/bindings/inheritance.rs

/* This Source Code Form is subject to the terms of the Mozilla Public
 * License, v. 2.0. If a copy of the MPL was not distributed with this
 * file, You can obtain one at https://mozilla.org/MPL/2.0/. */

//! The `Castable` trait.

pub use crate::dom::bindings::codegen::InheritTypes::*;

use crate::dom::bindings::conversions::get_dom_class;
use crate::dom::bindings::conversions::{DerivedFrom, IDLInterface};
use crate::dom::bindings::reflector::DomObject;
use std::mem;

/// A trait to hold the cast functions of IDL interfaces that either derive
/// or are derived from other interfaces.
pub trait Castable: IDLInterface + DomObject + Sized {
    /// Check whether a DOM object implements one of its deriving interfaces.
    fn is<T>(&self) -> bool
    where
        T: DerivedFrom<Self>,

This file has been truncated. show original

FZs · February 12, 2023, 4:25pm

You can store an instance of the base struct inside the child structs, and implement the Deref trait to return that. That way it will automatically upcast to its parent struct whenever necessary.

Doing this is a common practice for newtypes, but can be used to extend the struct with additional fields as well.

You can also have many structs, all Derefing to the same base struct.

Here's an example:

Now the boilerplate becomes implementing Deref, but that can be eliminated by creating a macro for that.

I think this can do most of the things that you could do with subclassing.

Topic		Replies	Views
Pre-RFC: sum-enums language design	86	7944	April 16, 2019
Was this idea for struct/enum unification considered? ideas (deprecated)	7	1894	March 25, 2019
Sum types in product types language design	27	1209	February 11, 2024
Efficient code reuse via "type composition"? ideas (deprecated)	1	2017	March 25, 2019
Unify structs, tuples and funciton calls ideas (deprecated)	13	4636	March 25, 2019

[Pre-RFC] sum/union types

Summary

Motivation

Proposed solution

Prior art

Related topics