Proposal + Bikeshed: Rename C-like Enumeration

Havvy · November 9, 2017, 11:24pm

If there is no data attached to any of the variants of an enumeration and there is at least one variant then it is called a c-like enumeration.

This name is terrible. It’s referring to the C programming language’s construct of an enum and that in some cases^[1] can be passed to C for FFI. But in reality, it’s also useful in non-FFI code. You also have to know C’s enums to understand the name.

As such, I propose renaming this term to something more meaningful.

@scottmcm suggested discriminant-only enumeration on IRC when I asked last night. Though I open this up for bikeshedding.

[1]: Specifically, when the enum has a C representation.

hanna-kruppe · November 9, 2017, 11:56pm

+1 from me on replacing the term.

To add another reason why this name is misleading: A C enum is mostly a typedef plus some named constants, i.e., an object of an enum type can hold any integer value. This is often used for bitflags, for example. In contrast, Rust’s C-like enums (even those with repr(C)) can only legally hold the discrimnant values, everything else is UB. Therefore, using a C-like Rust enum in FFI to model a C enum is often outright wrong and risky even at best.

kornel · November 10, 2017, 12:05am

“discriminant” sounds very much like techno jargon to me. I haven’t seen that term used outside compiler-related discussions, so I doubt it’s any more informative to average user than “C-like”.

The gotchas with #[repr(C)] are real though. Maybe instead of changing the name, the UB should be removed instead? (i.e. a C-like enum with #[repr(C)] should really behave more like C).

Havvy · November 10, 2017, 12:16am

Wheee. The bikeshed fell onto IRC. (Using Enum instead of Enumeration in this list)

Numeric Enum
Scalar Enum
Integral Enum (probably bad)
Variant-only Enum (TRPL 2e calls them variants) (less jargony than discriminant)
Data-less Enum (a reversal of the -only suggestions)
Field-less Enum (does any other documentation call them fields?)

notriddle · November 10, 2017, 12:16am

Can’t be done in a sound, backwards-compatible way. You can match on a #[repr(C)] enum exhaustively, and if you do that then you can have the match expression evaluate to any type:

#[repr(C)]
enum Test {
A = 1,
};

let test: Test = unsafe { transmute(3) };
let result = match test { A => 2 };
println!("{}", result);

What should the above code do? Because right now, it compiles and it invokes UB.

hanna-kruppe · November 10, 2017, 12:19am

I don't see a realistic way to achieve that, but in any case that would be an entirely different discussion (e.g., C-style enums in FFI (and a proposal to lump in with unions))

kornel · November 10, 2017, 12:23am

AFAIK UB here is by Rust’s implementation choice, and can eliminated by arbitrarily defining how other values are interpeted (for example, that the last match arm is taken). There’s no way to make it sensible, but at least it doesn’t have to cause nasal demons.

jmst · November 10, 2017, 11:53am

Could just panic/abort instead of UB (at a small performance cost).

gasche · November 12, 2017, 11:34am

Other ideas:

simple enumeration
flat enumeration
immediate enumeration (a reference to compiler-slang using “immediate” for scalar values.)
numeration (these enumerations are isomorphic to an integer interval [1; n], except that names are more readable than hardcoded numbers)

(You can also use simple/flat/immediate for constructors/variants: a flat constructor takes no parameters, and a flat enumeration has only flat constructors.)

In type theory we say “sum type” for the disjoint-sum construction that underlies variants, and in ML languages we say “variant type” for what is called in Rust an “enumeration” (which is a sum-of-products type).

(Why is there a special requirement that c-like enumerations should have at least one variant? What goes wrong if we have empty c-like enumerations with no variants?)

FaultyRAM · November 13, 2017, 1:47am

A few ideas from me:

Tag-only enumerations
Trivial enumerations
Plain enumerations

retep998 · November 13, 2017, 8:52pm

#[repr(C)] discriminant only enums are not necessarily the same size as the equivalent C enum. Even when they are the same size, holding any value other than the defined variants is UB, unlike the equivalent C enums which are often used as bitflags. Using Rust enums in FFI bindings is generally a bad idea.

Therefore I am strongly in favor of moving away from the “c-like enum” terminology.

QuietMisdreavus · November 14, 2017, 10:17pm

We had a quick chat about this idea at today’s docs-team meeting. Of the suggestions so far, we seem to like either “plain/simple enums” or “data-less enums”/“enums without fields”. The latter is pertinent because it was mainly proposed in terms of removing the need for a fancy name altogether. Enums are enums; some have fields and some do not. The ability to set numerical values to the discriminant in these enums is just a property of not having fields. (Or “data”, or whatever we’ve decided the general term for values other than the discriminant enclosed in an enum variant is.)

So, i guess, the options we’re narrowing it down to are:

“simple enums”, as a subset of “enums”
“enums with fields/data” as opposed to “enums without fields/data” (with “field-less”/“data-less” as a shorthand)

If it were strictly between those two, what would you pick?

hanna-kruppe · November 14, 2017, 10:31pm

+1 for “enums without fields”

sunjay · November 15, 2017, 5:01pm

Maybe “name-only” or “enums without fields”?

jesskay · November 15, 2017, 5:01pm

“data-less”/“field-less” (and their longer forms where more appropriate) strikes the ideal balance for me - it’s simpler and shorter, without needing the reader to figure out what “simple” actually implies (not that it’s a particularly hard one to figure out with minor context, but it still seems slightly worse than the term which outright says what it means)

bstrie · November 15, 2017, 5:02pm

Why does this even need an official name at all? We don’t call enums that benefit from NonZero “nonzero enums”; Rust does all sorts of enum optimizations under the hood, why privilege this one?

If we must call it something, avoid jargon like “discriminant” at all costs. “Simple enums” would suffice.

Havvy · November 15, 2017, 5:10pm

I personally avoid terms like “simple” and “plain” to describe anything because they aren’t really self-describing. Granted, that’s an improvement over a misleading name.

We want a name for it because they interact with other features (e.g. casting to integers) in ways that enums with data do not. They need at least one variant because those without variants are called zero-variant enums, and want to avoid creating inheretance hierachies in names.

ciphergoth · November 15, 2017, 6:49pm

+1 to “flat enumeration” or “flat enum”

notriddle · November 15, 2017, 7:30pm

Instead of saying “+1,” let’s use an actual poll:

simple enum
degenerate enum
flat enum
discriminant-only enum
numeric enum
scalar enum
integral enum
variant-only enum
immediate enum
trivial enum
tag-only enum
plain enum
field-less enum (enum without fields)
data-less enum (enum without data)
c-like enum

0 voters

Show results

Witolve · November 15, 2017, 8:01pm

+1 for c -like enum, though i see that only the first few have some votes gotta go now because i need to check the site. said i need to take bcuz of my health problems. but i will come back with some questions. about some of them. could you help??

Topic		Replies	Views
C-like enums should have another way to extract underlying integral values aka discriminant language design	9	2826	March 25, 2019
C-style enums in FFI (and a proposal to lump in with unions) internals	12	9077	March 25, 2019
Exact semantics of Enums with numeric discriminants	1	820	March 25, 2019
Pre-RFC: mem::trailing_padding!	13	283	October 10, 2024
Pre-RFC enum from integer language design	24	13743	March 25, 2019

Proposal + Bikeshed: Rename C-like Enumeration

Related topics