Refactoring std for ultimate portability

jethrogb · November 3, 2016, 4:34pm

We could use weak linking here, and allow memchr to be overriden by libc at link time. I don't know which platforms support it.

brson · November 3, 2016, 8:46pm

I expect Error can be handled with the inner-type pattern, where pal_common defines a common Error type with less features than the one in std.

I'm not sure, but I suspect there are semantic differences that would be exposed.

This would be a reasonable approach.

It's hard to guess whether that is sufficient. io::Error is used for many purposes in std that are not associated with those traits.

It depends on what you mean by platform-specific. The example here showed that it is platform-specific in that the implementation uses platform-specific features.

Those features are just optimizations of convenience though because libc happens to have efficient implementations of memchr (and one other function I can't recall off-hand). There is nothing about the definition of C strings that is tied to an operating system (though the definition of c_char is ABI-specific, c_char is one of the few C types std defines), so one could implement them without calling into libc, and in that sense CStrings are platform-independent.

I'd be interested to know your motivation for not having CString (I inderstand not wanting to link to libc). If it's just because you don't need it that's fine.

There are various ways one could imagine not implementing and/or exposing CStrings. Scenarios are one.

(Looks like CStrings have been well discussed in this thread so I won't comment further).

grep for panic! is the simple answer. I'm not sure if there's any particularly better strategy, except where there are useful defaults. One must either stub out functionality or implement everything at once.

I mentioned this in passing as potential future work. Any appropriate division here will become more clear as the work progresses.

dckc · November 3, 2016, 9:39pm

Any chance of factoring out the parts of std that use ambient authority? I’ve been working on this since 2012, but not making much progress.

I’m interested to at least design a lint for it… with that and a call-graph tool, I think I could make good headway.

futile · November 3, 2016, 9:59pm

grep for panic! is the simple answer.

Using a custom macro like pal_default_panic! would make the situation even more obvious. panic! might be used in other parts of the code, so grepping would give more than the relevant results. But that's just a detail.

notriddle · November 4, 2016, 12:57am

unimplemented! already exists, and shouldn’t be used in std for anything other than the PAL defaults.

Ericson2314 · November 4, 2016, 10:41pm

Extrapolating from those two blanket impls, there's the general principle of defining functions with a minimal error, and using where std::io:Error: From<MyMinimalError> in the std wrapper.

I expect Error can be handled with the inner-type pattern, where pal_common defines a common Error type with less features than the one in std.

Maybe that's still needed, but works just fine in conjunction to mine. Impls like Cursor<T>, &mut [u8], Buf* and the other wrappers ought to be perfectly portable and live below the PAL. I think that just lives the standard streams of things in std::io to be defined in the PAL.

RalfJung · November 9, 2016, 10:06am

I feel I have to mention that PAL is a great name. It’s the German name for the SEP from the Hitchhiker’s Guide to the Galaxy, and indeed for most Rust programmers, the PAL will be somebody else’s problem.

naicode · November 10, 2016, 2:59pm

I would love to see a future where everyone uses rust, or at last one where it is much easier to port rust to new platforms .

Wrt.:

and

some bikshedding/idea

I think both can be done with some think like singular (module?) interfaces , basically interfaces, which have exact one implementation chosen at compiler time at any time (ignoring the "multiple versions of the same crate" case). They can't be used wrt. generics but are only meant to say (at module level?) that some think of a certain structure exists (applicable to structs, enums, modules etc. inclusive possible partial variations of them). A crate could require such a interface to be satisfied (through other crates, possible depending on this crate) or could provide a implementation satisfying.

EDIT: just to clarify singular interfaces are also not meant to be used as trait objects (they are no traits) and don't have to specify the complete public interface the implementation satisfying them has, partial is ok as long as it doesn't conflict.

And while I think such interfaces would be nice for some cases of dependency inversion in combination with platform specify code I'm not sure if I would want see usage of it outside of this cases. And both the UI case and the test harness case mentioned by @DanielKeep can be solved differently, I think.

jethrogb · November 10, 2016, 4:02pm

naicode:

I think both can be done with some think like singular (module?) interfaces , basically interfaces, which have exact one implementation chosen at compiler time at any time (ignoring the "multiple versions of the same crate" case). They can't be used wrt. generics but are only meant to say (at module level?) that some think of a certain structure exists (applicable to structs, enums, modules etc. inclusive possible partial variations of them). A crate could require such a interface to be satisfied (through other crates, possible depending on this crate) or could provide a implementation satisfying.

EDIT: just to clarify singular interfaces are also not meant to be used as trait objects (they are no traits) and don't have to specify the complete public interface the implementation satisfying them has, partial is ok as long as it doesn't conflict.

And while I think such interfaces would be nice for some cases of dependency inversion in combination with platform specify code I'm not sure if I would want see usage of it outside of this cases. And both the UI case and the test harness case mentioned by @DanielKeep can be solved differently, I think.

Related: Pre-RFC: providable

naicode · November 10, 2016, 4:55pm

Thanks @jethrogb. The main difference to my idea is, that my idea is not bound to traits and doesn't use impl. With this you can use it for parts, requiring structs. E.g. defining that there has to be a struct PathBuf which's (non trait) impl has a method fn new() -> PathBuf etc. Also that std::path::PathBuf should be a valid path to access it.

naicode · November 10, 2016, 5:06pm

is there a way, to make symbols public wrt. to building std. But private wrt. to using it. It probably don't work when rust parts of std are dynamically liked in, but if a single, possible dynamic linked in, std artifact is produced it could work, or? (I'm not saying we should do it, I just wonder if we could do it if it is needed)

jethrogb · November 10, 2016, 5:16pm

When building a shared object you can choose which symbols to expose. So yes, if you were to build the entire runtime into a single shared object a lot of internal symbols would disappear. However, you'd of course lose all the pluggability you'd gain from using separate objects.

susurrus · November 10, 2016, 6:05pm

Could this functionality be provided by an annotation, using the #[] syntax? Maybe like #[provides(item.path)] for the implementation and #[providable] for the definition.

Maybe I misunderstand those annotations (I don’t know what they’re called in Rust).

naicode · November 10, 2016, 8:11pm

I think so, you could see #[provides(item.path)] as some how analog to #[lang="some_lang_item"] but more general. An use #[providable] on a struct, struct+impl or trait. (through there might be some corner cases)

I will move this part of the discussion to the mentioned pre-rfc

Also while I think that this kind of mechanic can be a grate help for std and other bigger libraries where the logical seperation of components is not allways on the same plane as the os-specific/unspecific seperation, I don’t know if it would be good for the general rust language.

bjorn3 · November 10, 2016, 8:18pm

I think per function providables are a good idea.

jethrogb · December 2, 2016, 2:19am

@brson I’d like to keep moving on this. What would be a path forward on this? Reading this thread there weren’t really any objections to your proposal. Does someone just need to do the work to implement your proposal? Do we need to do some more design first? Do we need an RFC?

Yohanesu75 · December 7, 2016, 1:11am

Let’s do this so we can more easily add CloudABI to Rust

jethrogb · December 14, 2016, 5:23pm

@brson another ping now you’re back from your conference

brson · December 20, 2016, 1:41am

@jethrogb Hi, I’m so glad you are interested, and I’m sorry for not following up sooner. I don’t think there are any obstacles to proceeding, as refactoring in this direction seems fairly unobjectionable.

There are a few ways to proceed, but I think getting to the point where you can implement a custom std out of tree will be a long road.

For refactoring std, the easiest thing to get started is by cutting down entries in the tidy whitelist, where std is not using conditions in the right modules. These are easy patches to make and review.

Someone could also be bold and create the pal_common / pal_unix / pal_windows crates and wire them up to the build system. I suspect that would get past review with some debate. After that there will be a long slog of puzzling out how to pull pieces out. I mentioned some other next steps in my op. And remember my linked branch has uncommitted refactoring steps that, while bitrotted, demonstrate some sequences that work to untangle dependencies.

As far as your end goal of creating a std port that excludes large chunks of std, for now I would focus on creating a port that fits into the model described here, where the customization is at the sys/pal layer. Today, I would suggest that, instead of trying to compile out parts of std your port doesn’t use, settle for panicking instead. In the longer term, when std has scenarios, then we can actually remove the unsupported features. This might involve a custom platform under the libstd/sys directory, and ultimately a custom pal crate.

Ericson2314 · January 7, 2017, 8:40pm

@jethrogb Any progress? I’d love to help.

Topic		Replies	Views
A vision for portability in Rust	56	8881	March 25, 2019
Rust needs to run, and be easy to use, on every platform announcements	8	10387	March 25, 2019
Rust platform portability and maintenance	13	4225	March 25, 2019
Libsystem, or, the great libstd refactor libs	37	12451	March 25, 2019
[discussion / pre-pre-pre-RFC] out-of-tree platform support for libstd libs	2	880	March 25, 2019

Refactoring std for ultimate portability

Related topics