Where to document allocation size upper bound?

Question

Where to document allocation size upper bound?

joshlf opened this issue 10 months ago · comments

Joshua Liebow-Feeser commented 10 months ago

It's common knowledge that a "Rust object" or "Rust allocation" can't have a size which overflows isize. This is relied upon in a lot of APIs such as the raw pointer add method. However, the only place it seems to be documented as a general property is on the reference page for numeric types:

The isize type is a signed integer type with the same number of bits as the platform's pointer type. The theoretical upper bound on object and array size is the maximum isize value. This ensures that isize can be used to calculate differences between pointers into an object or array and can address every byte within an object along with one byte past the end.

I see a few issues with this description:

Being on the page for numeric types makes this text not that discoverable for people looking for general guarantees about types and allocations.
It uses imprecise language: what are "objects" and "arrays"? Is the latter used in the sense of the type system - a [T; N]?
Some allocations are neither native Rust objects nor native Rust arrays. E.g., the allocation backing a Vec doesn't have a type (at least not in its API). If it's guaranteed that &T can't refer to an object of more than isize bytes, then vec.as_slice() can't return a reference which violates this guarantee. However, that doesn't prevent the addresses of vec[0] and vec[N] from being more than isize bytes apart. Various Vec APIs strongly hint that this is impossible, but none actually guarantee it, and the Vec top-level docs make no guarantee about the interactions between various APIs (such as the vec[0]/vec[N] problem).

Based on my understanding of the current state of the language, here's a stab at a more complete set of guarantees; do these sound reasonable?

For all T, given t: T, the size of t is guaranteed to fit in an isize
For all T, given t: &T, the size of t's referent is guaranteed to fit in an isize
With respect to non-builtin types like Vec, I could see a few approaches:
- Each such type documents its own guarantees
- We define a formal notion of an "allocation", and document that t: T and t: &T are instances of allocations. Beyond that, we leave it up to non-builtin types to document their own guarantees by making reference to the docs for "allocation".
- We define a formal notion of an "allocation", and make it clear in the definition itself that it covers non-builtin things like Vec. That seems iffy; I'm not sure how you'd formally specify the set of objects that are covered by a definition like this (e.g., do we want to make guarantees about the memory backing a HashMap?).

cc @jswrenn

Ralf Jung · Answer 1 · Fri Sep 29 2023 05:50:16 GMT+0800 (China Standard Time)

Don't have time for a full response right now, but note that this is also documented in slice::from_raw_parts, since that is the most obvious place where someone would violate this guarantee.

Christopher Durham · Answer 2 · Fri Sep 29 2023 06:14:51 GMT+0800 (China Standard Time)

It may not be formalized yet, but in discussing I've been using the concept of a Rust Allocated Object to refer to the thing which exists in the abstract machine, provides memory addresses which can be pointed to, and defines what pointer offsets are "inbounds." The RAO refers just to the chunk of allocated memory; remember that memory is untyped and types only exist for typed copies between memory (as far as the AM is concerned).

The usual way to create a RAO is via std::alloc, and those APIs communicate the size <= isize::MAX requirement, and (mostly) enforce it via Layout¹. Memory which is allocated externally to Rust but is still dereferencable requires the FFI code to logically create a RAO; the implementation/# target likely provides some (usually implicit) way to promote a region of read/write system memory to a RAO. The limit is also mentioned in size_of_val_raw (unstable).

What we do need to document though is whether creating a RAO with size > isize::MAX is immediate UB or merely unsound, with UB occurring when doing a layout calculation / field projection of overlarge size. (Allocating such RAO is definitely UB with the std allocation functions, thus only possible via FFI.) For simplicity of the model, I would argue to make it immediate UB, and this might be required for targeting LLVM, which will happily merge two inbounds offsets assuming the combined offset won't overflow isize. Plus, such a large allocation can't even be addressed with 64-bit page tables.

I did this smile and made some of my own layout polyfilling code unsound in the process smile ↩

Joshua Liebow-Feeser · Answer 3 · Fri Sep 29 2023 06:31:40 GMT+0800 (China Standard Time)

It may not be formalized yet, but in discussing I've been using the concept of a Rust Allocated Object to refer to the thing which exists in the abstract machine, provides memory addresses which can be pointed to, and defines what pointer offsets are "inbounds." The RAO refers just to the chunk of allocated memory; remember that memory is untyped and types only exist for typed copies between memory (as far as the AM is concerned).

The reason I mention t: T and t: &T is that in a lot of the unsafe code I write, that's the starting point: I'm trying to do something with pointers that are derived from a T or &T, and I need to be able to say something like "safe Rust can't produce a T or &T larger than isize, so my code is guaranteed that all offsets.... blah blah blah". I don't actually care about the T itself, just the fact that Rust makes certain guarantees about all values or references-to-values.

For more context, the place this has come up for me recently is in this PR. It adds a Ptr<'a, T> type which is somewhere in between a NonNull<T> and a &'a T or &'a mut T in terms of its invariants. One of its invariants is that the referenced memory region has a size which fits in isize. This is required in places such as this one, where it's a prerequisite for the pointer add method. We need guarantees about T and &T in order to ensure that we're satisfying Ptr's invariants when we construct it here.

Christopher Durham · Answer 4 · Fri Sep 29 2023 07:07:11 GMT+0800 (China Standard Time)

It's unquestionably a soundness requirement that values described by a Rust type (including ?Sized ones) have a size <= isize::MAX. Doing size_of_val for an oversized type is documented to be UB.

I don't really know anywhere better to document this soundness requirement other than size_of_val_raw, slice_from_raw_parts, alloc/Layout, and perhaps the various feature(ptr_metadata) APIs. While the precise validity requirement for RAO carefully managed by pointer is technically undecided, the requirement on types/references is reasonably well documented in all the places it could potentially be violated.

If the primary question here is w.r.t. soundness guarantees, it could merit a docs issue, but it's not particularly actionable for UCG let alone T-opsem.

(But it doesn't hold if you don't point to an actual allocation; it's safe to construct a raw pointer to an oversized slice.)

Joshua Liebow-Feeser · Answer 5 · Fri Sep 29 2023 07:39:38 GMT+0800 (China Standard Time)

IMO it’s not sufficient to document it on specific APIs like size_of_val. There are plenty of ways code might need to rely on the size property besides specific APIs - e.g., knowing that two pointers which are both synthesized from references into the same T aren’t more than isize bytes apart. I’d advocate for there being a general guarantee somewhere that safe Rust will never produce a T or &T whose size exceeds isize.

…

On Thu, Sep 28, 2023 at 4:07 PM Christopher Durham ***@***.***> wrote: It's unquestionably a *soundness* requirement that values described by a Rust type (including ?Sized ones) have a size <= isize::MAX. Doing size_of_val for an oversized type is documented to be UB. I don't really know anywhere better to document this soundness requirement other than size_of_val_raw, slice_from_raw_parts, alloc/Layout, and perhaps the various feature(ptr_metadata) APIs. While the precise validity requirement for *RAO* carefully managed by pointer is technically undecided, the requirement on types/references is reasonably well documented in all the places it could potentially be violated. If the primary question here is w.r.t. soundness guarantees, it could merit a docs issue, but it's not particularly actionable for UCG let alone T-opsem. — Reply to this email directly, view it on GitHub <#465 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAH7ML36WGQ7OWJXNSHDI7DX4X7KVANCNFSM6AAAAAA5LRMFEQ> . You are receiving this because you authored the thread.Message ID: ***@***.***>

Ralf Jung · Answer 6 · Fri Sep 29 2023 13:55:40 GMT+0800 (China Standard Time)

size_of_val seems like the natural place to put the answer to your first two questions. By saying that function will never return a value exceeding isize::MAX, you should have everything you need -- even if you don't physically call that function, you can now rely on the size of any object you hold where you could safely call that function to not exceed isize::MAX.

I'm not sure where to put the answer to the third question.

It may not be formalized yet, but in discussing I've been using the concept of a Rust Allocated Object to refer to the thing which exists in the abstract machine, provides memory addresses which can be pointed to, and defines what pointer offsets are "inbounds."

In #464 we are calling it an "allocation".

What we do need to document though is whether creating a RAO with size > isize::MAX is immediate UB or merely unsound, with UB occurring when doing a layout calculation / field projection of overlarge size.

I think it has to be UB; creating such an allocation (and giving Rust access to it) is a case of mutating the Rust-visible state in a way that is not possible from Rust. The Rust AM has an invariant that all allocations are at most isize::MAX in size; violating such an invariant must be immediate UB.

But of course one could say that when such an allocation is created, really it's just created with size isize::MAX, and the UB occurs on the first access outside that range.

Joshua Liebow-Feeser · Answer 7 · Tue Oct 10 2023 04:28:43 GMT+0800 (China Standard Time)

size_of_val seems like the natural place to put the answer to your first two questions. By saying that function will never return a value exceeding isize::MAX, you should have everything you need -- even if you don't physically call that function, you can now rely on the size of any object you hold where you could safely call that function to not exceed isize::MAX.

I don't think that's sufficient because size_of_val can panic. It's not documented on size_of_val itself, but size_of_val_raw's docs say:

an (unstable) extern type, then this function is always safe to call, but may panic or otherwise return the wrong value, as the extern type’s layout is not known. This is the same behavior as size_of_val on a reference to a type with an extern type tail.

I was originally going to put up a PR to add the following to size_of_val's docs:

/// # Safety
///
/// It is guaranteed that `size_of_val` will always return a value which fits in
/// an `isize`. `unsafe` code may rely on this guarantee for its soundness. Note
/// that this amounts to a guarantee that, for all types, `T`, and for all values
/// `t: &T`, `t` references a value whose size can be encoded in an `isize`. This
/// holds because, given a `t: &T`, it is always valid to call `size_of_val(t)`.

However, I realized that this argument is unsound: If size_of_val can panic, then given t: &T, you only know that if size_of_val(t) returns, it will return a value which fits in isize. But you don't know that it will return.

Ralf Jung · Answer 8 · Tue Oct 10 2023 14:30:31 GMT+0800 (China Standard Time)

However, I realized that this argument is unsound: If size_of_val can panic, then given t: &T, you only know that if size_of_val(t) returns, it will return a value which fits in isize. But you don't know that it will return.

So, we could say

When there are (unstable) extern types involved, the function may panic or otherwise return the wrong value.
If the function doesn't panic (even if extern types lead to a wrong value), we guarantee that the result fits in an isize.

Until the extern type situation is resolved, this seems the best we can do? Well actually we could do better, we could guarantee that it will panic (probably this has to be a non-unwinding panic) on extern type, and never just return nonsense. IMO that's what we should do, but currently the "panic" part hasn't been implemented yet I think.

Joshua Liebow-Feeser · Answer 9 · Thu Oct 12 2023 01:03:41 GMT+0800 (China Standard Time)

Unfortunately I don't think that's sufficient because there are cases where you never actually call size_of_val(t) - you just know that you could. If you actually called size_of_val(t), you could at least argue that your code would diverge rather than misbehaving. But in code that doesn't call it, you can't rely on that argument.

E.g., consider this type:

pub(crate) struct Ptr<'a, T: 'a + ?Sized> {
    // INVARIANTS:
    // - `ptr` addresses a byte range which is not longer than `isize::MAX`
    // (other invariants removed for brevity)
    ptr: NonNull<T>,
    _lifetime: PhantomData<&'a ()>,
}

It has this impl:

impl<'a, T: 'a + ?Sized> From<&'a T> for Ptr<'a, T> {
    fn from(t: &'a T) -> Ptr<'a, T> {
        Ptr { ptr: NonNull::from(t), _lifetime: PhantomData }
    }
}

In order to construct an instance of Ptr which satisfies the field invariant on the ptr field, we need a guarantee that NonNull::from(t) results in a pointer whose referent is a memory region whose length fits in isize. We'd like to say something like "we know t refers to a memory region of no more than isize::MAX bytes because we could call size_of_val(t), which in turn promises to return a size no greater than isize::MAX." However, that argument doesn't work if size_of_val can panic.

Given this limitation, I think we still need a separate location to document the size maximum (unless we can make a stable promise that size_of_val will never panic, in which case this type of reasoning would be sufficient).

Ralf Jung · Answer 10 · Thu Oct 12 2023 01:19:55 GMT+0800 (China Standard Time)

I see, fair. The argument here should be that there simply are no memory regions larger than isize, so that does not even involve size_of_val. But where could that be documented? The "alloc" module would make sense but that is really only about heap allocations and we are stating a fact about all allocations...

Joshua Liebow-Feeser · Answer 11 · Thu Oct 12 2023 03:31:56 GMT+0800 (China Standard Time)

But where could that be documented? The "alloc" module would make sense but that is really only about heap allocations and we are stating a fact about all allocations...

Yeah so I've been talking this over with @jswrenn, and the conclusion we've come to is that the thing that makes the most sense is to make it a bit validity constraint on &T for all T: ?Sized. Our rationale is that what we're trying to do is the following:

fn foo<T: ?Sized>(t: &T) {
    let ptr = NonNull::from(t);
    // SAFETY: <what do we write here?>
    unsafe { requires_ptr_whose_referent_size_fits_in_isize(ptr) }
}

We need to be able to make an argument whose premise is t: &T and whose conclusion is that t refers to no more than isize::MAX bytes. At first we considered a weaker guarantee like "safe Rust code will never produce a &T which refers to more than isize::MAX bytes", but this on its own isn't sufficient - it doesn't guarantee that unsafe code won't synthesize such a reference. We need to also ban unsafe code from doing this, which is basically what it means to have a bit validity constraint.

We're thinking something like:

For all T: ?Sized, it is unsound to produce a value, t: &T, whose referent is more than isize::MAX bytes in size. Unsafe code may assume that any such t: &T will refer to no more than isize::MAX bytes.

Ralf Jung · Answer 12 · Thu Oct 12 2023 03:39:29 GMT+0800 (China Standard Time)

"referring to more than isize::MAX bytes" is basically an ill-typed statement. At least if you mean "refer to" in the sense of "there is that much dereferenceable memory behind this pointer". This is independent of whether it's a raw pointer or a reference. There just can't be a contiguous memory range larger than isize::MAX.

A &[u8] with a size of more than isize::MAX is already invalid today because it is dangling, and dangling references are UB. So this doesn't need docs changes.

Joshua Liebow-Feeser · Answer 13 · Thu Oct 12 2023 03:42:54 GMT+0800 (China Standard Time)

There just can't be a contiguous memory range larger than isize::MAX.

I agree that this is true in practice, but is it guaranteed anywhere? IIUC that's exactly what we're trying to guarantee here.

A &[u8] with a size of more than isize::MAX is already invalid today because it is dangling, and dangling references are UB. So this doesn't need docs changes.

Oh interesting, I'm not sure where this comes from. How is such a reference dangling?

Ralf Jung · Answer 14 · Thu Oct 12 2023 03:44:55 GMT+0800 (China Standard Time)

Oh interesting, I'm not sure where this comes from. How is such a reference dangling?

It's dangling because there can't be a memory range large enough for it to point to that would make it non-dangling. :)

I agree that this is true in practice, but is it guaranteed anywhere? IIUC that's exactly what we're trying to guarantee here.

That's what I was asking above -- where should such docs go? This property has nothing to do with references so stating it about references makes no sense. It's a property about what the Rust Abstract Machine considers an "allocated object".

Ralf Jung · Answer 15 · Thu Oct 12 2023 03:46:47 GMT+0800 (China Standard Time)

We could add it here maybe? That defines "allocated object". If we say that allocated objects have a maximal size of isize::MAX that should basically cover it?

Joshua Liebow-Feeser · Answer 16 · Thu Oct 12 2023 03:56:30 GMT+0800 (China Standard Time)

Oh interesting, I'm not sure where this comes from. How is such a reference dangling?

It's dangling because there can't be a memory range large enough for it to point to that would make it non-dangling. :)

Ah gotcha :)

I agree that this is true in practice, but is it guaranteed anywhere? IIUC that's exactly what we're trying to guarantee here.

That's what I was asking above -- where should such docs go? This property has nothing to do with references so stating it about references makes no sense.

Yeah so I think we're actually on the same page here. I definitely agree that what we're really talking about is a property of "allocations" or "memory regions" or some similar concept. However, The Reference doesn't currently define these concepts, while it already defines the concept of a reference. Our thinking behind making this a bit validity invariant on &T is just that it'd require a much smaller change to The Reference as compared to introducing and defining an entirely new concept.

We could add it here maybe? That defines "allocated object". If we say that allocated objects have a maximal size of isize::MAX that should basically cover it?

Ah interesting, yeah that's a good start! I'll be honest it feels weird that that's in ptr rather than in the Reference since it's documenting a property that applies to all of Rust rather than just to things in the ptr module, but that's roughly what we need. In terms of content, do you think it'd be appropriate to just beef up that section by adding something like the following?

It is guaranteed that an allocated object never spans more than isize::MAX bytes. For all types, T: ?Sized, and for all t: &T, it is guaranteed that t refers to a subset of a single allocated object.

Obviously let me know if you think there are better words/phrases to use for concepts like "spans" and "refers to", etc.

Ralf Jung · Answer 17 · Thu Oct 12 2023 15:57:50 GMT+0800 (China Standard Time)

We have many things in the lib docs that probably should also be in the reference -- but my general assumption is that hardly anyone reads the reference, so libs docs is usually where improvements go. Ongoing examples of that are rust-lang/rust#115476 and rust-lang/rust#115577.

Joshua Liebow-Feeser · Answer 18 · Fri Oct 13 2023 04:34:45 GMT+0800 (China Standard Time)

We have many things in the lib docs that probably should also be in the reference -- but my general assumption is that hardly anyone reads the reference, so libs docs is usually where improvements go. Ongoing examples of that are rust-lang/rust#115476 and rust-lang/rust#115577.

Yeah that makes sense. Obviously completely orthogonal to the present discussion, but I wonder whether it'd be reasonable to put things in the Reference but then refer to them in the obvious places so they're just as discoverable.

It is guaranteed that an allocated object never spans more than isize::MAX bytes. For all types, T: ?Sized, and for all t: &T, it is guaranteed that t refers to a subset of a single allocated object.

Does this seem like reasonable language for the ptr module docs section on allocated objects? I can put up a PR.

Ralf Jung · Answer 19 · Fri Oct 13 2023 05:52:07 GMT+0800 (China Standard Time)

The first sentence sounds like something we could add there, yes.

The part about references is a consequence of that. That seems to be better located here (but that page doesn't seem to talk about validity requirements at all currently) and/or here. (And of course it should be generalized to also apply to &mut.)

Yeah that makes sense. Obviously completely orthogonal to the present discussion, but I wonder whether it'd be reasonable to put things in the Reference but then refer to them in the obvious places so they're just as discoverable.

Reasonable, yes. That's just less convenient to do.^^ It needs a reference PR, waiting for the submodule to be updated, and then a libs PR... and often it's also not at all clear where in the reference something would go. The standard library has these nice keyword and primitive type documentation pages, do we really want to duplicate all that in the reference? E.g. for unsafe we currently have some stuff in the reference and some in the keyword docs and they are covering the same material to a large extent and it's all rather messy...

Joshua Liebow-Feeser · Answer 20 · Fri Oct 13 2023 06:35:56 GMT+0800 (China Standard Time)

The first sentence sounds like something we could add there, yes.

Sounds good; put up a PR: rust-lang/rust#116675

The part about references is a consequence of that. That seems to be better located here (but that page doesn't seem to talk about validity requirements at all currently) and/or here. (And of course it should be generalized to also apply to &mut.)

Sounds good; put up a PR: rust-lang/rust#116677

Yeah that makes sense. Obviously completely orthogonal to the present discussion, but I wonder whether it'd be reasonable to put things in the Reference but then refer to them in the obvious places so they're just as discoverable.

Reasonable, yes. That's just less convenient to do.^^ It needs a reference PR, waiting for the submodule to be updated, and then a libs PR... and often it's also not at all clear where in the reference something would go. The standard library has these nice keyword and primitive type documentation pages, do we really want to duplicate all that in the reference? E.g. for unsafe we currently have some stuff in the reference and some in the keyword docs and they are covering the same material to a large extent and it's all rather messy...

Yeah, that's very understandable. I think my general worry is that, with concepts spread out across a lot of documentation, we risk losing track of what we've technically promised and thus breaking promises made in the past. The more that code authors have to language lawyer as opposed to just cite a straightforward guarantee made by documentation, the more likely that a guarantee is technically implied by the docs but not in a way that's obvious to editors of those docs or language/compiler authors. It'd be ideal if terms were formally defined where possible (where "formally" just means "a definition exists at a place that can be linked to") and uses of those terms were always linked so that broken links could be caught automatically. It sounds like we're far away from that state, though.

Ralf Jung · Answer 21 · Fri Oct 13 2023 13:43:41 GMT+0800 (China Standard Time)

Yeah, that's very understandable. I think my general worry is that, with concepts spread out across a lot of documentation, we risk losing track of what we've technically promised and thus breaking promises made in the past.

Fully agreed, I wasn't claiming that what we are doing is good.

Jubilee · Answer 22 · Fri Oct 13 2023 16:12:22 GMT+0800 (China Standard Time)

Documenting the keywords in std's API docs was a fairly arbitrary choice. They could just as easily be documented in the reference.

Joshua Liebow-Feeser · Answer 23 · Fri Oct 27 2023 10:52:09 GMT+0800 (China Standard Time)

Follow-up question that's related to this discussion: do we intend to guarantee that, for a slice DST, an instance with 0 elements always has a size which fits in isize::MAX? This came up today in rust-lang/rust#69835 (comment)

If we intend to guarantee this, where would be a good place to document it? I can put up a PR.

Matthew House · Answer 24 · Fri Oct 27 2023 12:48:44 GMT+0800 (China Standard Time)

As a note, I was performing some tests to ensure that the status quo is that 0-length slice ZSTs are always within isize::MAX bytes (i.e., both the header and the trailing padding are counted in the compiler's maximum-type-size check), but I unexpectedly ran into rust-lang/rust#117265, finding that padding isn't counted even for regular types.

Ralf Jung · Answer 25 · Fri Oct 27 2023 14:42:39 GMT+0800 (China Standard Time)

I don't know if we currently guarantee that (the layout computation code is largely a black box to me), but it does sound like a sensible thing to guarantee.

Joshua Liebow-Feeser · Answer 26 · Sat Oct 28 2023 03:56:55 GMT+0800 (China Standard Time)

I don't know if we currently guarantee that (the layout computation code is largely a black box to me), but it does sound like a sensible thing to guarantee.

Do you have opinions on where such a thing should be documented?

Ralf Jung · Answer 27 · Sat Oct 28 2023 04:01:29 GMT+0800 (China Standard Time)

The [T] type maybe? After all it's about types that have that as their tail.

Joshua Liebow-Feeser · Answer 28 · Wed Nov 01 2023 09:25:54 GMT+0800 (China Standard Time)

Okay, I put up a PR: rust-lang/rust#117474

Another place we could document it would be on the Dynamically Sized Types page in the Reference.

Ralf Jung · Answer 29 · Mon May 27 2024 20:43:09 GMT+0800 (China Standard Time)

@joshlf https://doc.rust-lang.org/nightly/std/ptr/index.html now documents that an allocation is at most isize::MAX bytes. Could you remind me again what the motivation is for further documenting in rust-lang/reference#1482 that the "minimal size" of all types is at most isize::MAX? That guarantee seems to be hard to write down precisely (since it depends on what exactly we monomorphize), so I am trying to figure out why that even is a question we have to make a guarantee about.

Joshua Liebow-Feeser · Answer 30 · Wed May 29 2024 09:04:34 GMT+0800 (China Standard Time)

TLDR: To support layout_for_ptr

Breadcrumbs (specifically look for isize):

rust-lang/rust#69835 (comment)
rust-lang/rust#69835 (comment)
rust-lang/rust#69835 (comment)
...which led to: rust-lang/rust#69835 (comment)

Matthew House · Answer 31 · Wed May 29 2024 10:23:49 GMT+0800 (China Standard Time)

Could you remind me again what the motivation is for further documenting in rust-lang/reference#1482 that the "minimal size" of all types is at most isize::MAX?

To lay it out explicitly: suppose you have a custom slice DST, and you want to find the byte offset of the unsized tail from the beginning of the struct.

Unfortunately, offset_of!() does not support DST tails: it only supports fields within the static prefix. Therefore, you must form a value of the DST somewhere in memory (of any length), take a pointer to the unsized tail, and manually compute the byte offset.

However, you cannot simply do this with a null DST pointer, since taking an invalid addr_of!() is forbidden. Therefore, you must create a valid allocation with the proper size.

However, you can't stably obtain a large-enough allocation without already having an instance, since that requires size_of_val(). Therefore, you must wait for size_of_val_raw() to stabilize, then use that on a null DST pointer to obtain the allocation size.

However, calling size_of_val_raw() is UB if the size of the entire value is greater than isize::MAX. Therefore, you must hope that if you make your null DST pointer as small as possible (length 0), it will be small enough to pass into size_of_val_raw(). Then you can use that to create an actual allocation, take the addr_of!() the unsized tail, and compute its byte offset.

(Note that this guarantee still isn't sufficient for dynamically constructing repr(Rust) slice DSTs of length greater than 0, since they can have an unbounded amount of padding!)

Ralf Jung · Answer 32 · Wed May 29 2024 19:45:56 GMT+0800 (China Standard Time)

Wow, what a rabbit hole.^^

To lay it out explicitly: suppose you have a custom slice DST, and you want to find the byte offset of the unsized tail from the beginning of the struct.

What exactly is a "custom slice DST"? We don't have custom DST. Do you mean an unsized type whose unsized tail is a slice?

Unfortunately, offset_of!() does not support DST tails: it only supports fields within the static prefix.

Since the alignment is statically known for slices, it should be fairly easy to add support for unsized fields with slice tail to offset_of!. If that avoids the need for rust-lang/reference#1482 then that honestly seems worth it.^^

However, calling size_of_val_raw() is UB if the size of the entire value is greater than isize::MAX. Therefore, you must hope that if you make your null DST pointer as small as possible (length 0), it will be small enough to pass into size_of_val_raw(). Then you can use that to create an actual allocation, take the addr_of!() the unsized tail, and compute its byte offset.

Ah, so the entire point of rust-lang/reference#1482 is to ensure that the precondition for size_of_val_raw holds?

That is easier to guarantee than rust-lang/reference#1482. For instance we could just say that if the slice metadata is 0, then size_of_val_raw is safe to call -- either the call will be optimized out entirely or the compiler guarantees that the total size fits in isize::MAX.

Jubilee · Answer 33 · Thu May 30 2024 11:59:38 GMT+0800 (China Standard Time)

agreed that "I want to do something that should be obvious with the standard library but I can't because (stuff)" seems like a request for a library enhancement first and a justification for that enhancement second.

Matthew House · Answer 34 · Fri May 31 2024 11:08:10 GMT+0800 (China Standard Time)

What exactly is a "custom slice DST"? We don't have custom DST. Do you mean an unsized type whose unsized tail is a slice?

Whatever you want to call it. I mean a user-defined type that is a DST with a (possibly wrapped) slice tail, not a type that is a DST with user-defined metadata. If such a type is repr(Rust), or if it is repr(C) and you do not know all of its fields, you end up with this dilemma, where you must know the layout beforehand to learn anything about the layout soundly. (Except that fields before the tail can now be queried with offset_of!().)

Ah, so the entire point of rust-lang/reference#1482 is to ensure that the precondition for size_of_val_raw holds?

That is the primary purpose, from my understanding of the zerocopy use case.

That is easier to guarantee than rust-lang/reference#1482. For instance we could just say that if the slice metadata is 0, then size_of_val_raw is safe to call -- either the call will be optimized out entirely or the compiler guarantees that the total size fits in isize::MAX.

Regardless of the wording, this only fixes the narrow case of determining the offset of the slice tail. With such a guarantee, one still can't soundly create repr(Rust) DSTs with slice tails of length greater than 0, except via unsizing coercions. To allow this, either the additional padding following the tail of a repr(Rust) DST could be determined by a specified algorithm (i.e., pad to the alignment of the prefix), or a fallible version of size_of_val_raw could be added (as I initially suggested in rust-lang/rust#69835 (comment)).

Ralf Jung · Answer 35 · Fri May 31 2024 13:48:12 GMT+0800 (China Standard Time)

Whatever you want to call it. I mean a user-defined type that is a DST with a (possibly wrapped) slice tail, not a type that is a DST with user-defined metadata.

I see, thanks. I usually call them something like "slice-tail DST" or "unsized type with a slice tail" or so. I don't know of short standard terminology for them.

Regardless of the wording, this only fixes the narrow case of determining the offset of the slice tail. With such a guarantee, one still can't soundly create repr(Rust) DSTs with slice tails of length greater than 0, except via unsizing coercions.

To create one without unsizing coercions, you need to make layout assumptions anyway, don't you?
If you are willing and able to do that, then you can compute the total size of the type as the size with length 0, plus N * the element size of the slice tail. Or does that not work because of things being rounded up to alignment somewhere?

With the length=0 guarantee, you can implement offset_of! in userland. I don't see how rust-lang/reference#1482 gives you any more than that. So if that's not sufficient, how would rust-lang/reference#1482 be sufficient?

Matthew House · Answer 36 · Sat Jun 01 2024 03:07:26 GMT+0800 (China Standard Time)

If you are willing and able to do that, then you can compute the total size of the type as the size with length 0, plus N * the element size of the slice tail. Or does that not work because of things being rounded up to alignment somewhere?

Indeed, padding is the problem, as I demonstrated at rust-lang/rust#69835 (comment). The layout for a DST with a slice tail looks like:

The fields of the static prefix, with possible padding before and between them.
At least enough padding to match the alignment of the slice tail.
The consecutive elements of the slice tail, with no padding between.
At least enough padding to match the alignment of the entire DST.

If the DST is repr(C), then the padding is as small as possible, so you can compute the overall size for any given length. However, if the DST is repr(Rust), then you obtain as few guarantees as possible.

Before offset_of!() was added, this technically meant that you don't know whether the tail appears before or after the prefix, since the layout could vary for each length. However, the stabilization of a constant offset_of!() for fields in the prefix implies that the tail is always at the end, even for repr(Rust) DSTs.

The bigger issue is the final padding after the slice tail. Since repr(Rust) has few guarantees, there is no guarantee on how little or much padding might appear in this position. The amount of padding also varies dynamically, depending on the slice length. Therefore, a malicious repr(Rust) layout algorithm could make every slice length past 1 require so much padding that the overall size would become greater than isize::MAX.

Thus, one solution would be to say that the padding after the slice tail (or perhaps any DST tail) is the minimum necessary to reach the alignment of the entire DST, even if it is repr(Rust). Combined with the ability to take the offset_of!() the tail, this would be sufficient to calculate the proper layout.

Ralf Jung · Answer 37 · Sat Jun 01 2024 05:17:18 GMT+0800 (China Standard Time)

Given that you're worried about trailing padding for the non-empty-slice case, how would rust-lang/reference#1482 help?

Christopher Durham · Answer 38 · Sat Jun 01 2024 06:26:31 GMT+0800 (China Standard Time)

(Using struct WithTail<T: ?Sized> { /* fields */, pub tail: T })

Unsizing coercions already require WithTail<[T; N]> and WithTail<[T]> to have identical layout, so forwarding that guarantee makes sense. Although I suppose that alone still doesn't preclude extra padding after the tail slice that monotonically increases¹ with slice length.

Existence of after-tail padding also raises an interesting follow-up question of if/when it is valid to split &mut WithTail<[T]> into (&mut WithTail<[T]>, &mut [T]) — if WithTail<[T]> has trailing padding that aliases the following T, that's a problem for the aliasing requirements.

For manipulation of user defined slice-tailed struct to be in any way reasonable, I think it would be a good idea to guarantee that the layout of WithTail<[T]> is exactly Layout::from_size_align(offset_of!(WithTail<[T; 0]>, tail), align_of::<WithTail<[T; 0]>>())?.extend(Layout::array::<T>(len))?.pad_to_align(), also extending that same guarantee to non-generic slice-tailed structures.

Where/how to communicate that guarantee I'm not sure, but that's effectively the exact guarantee needed to make manual manipulation of slice-tailed types possible.

Monotonicity is desirable to be able to slice &WithTail<[T; {N+M}]> to &WithTail<[T; N]>. ↩

Ralf Jung · Answer 39 · Sat Jun 01 2024 14:12:53 GMT+0800 (China Standard Time)

For manipulation of user defined slice-tailed struct to be in any way reasonable, I think it would be a good idea to guarantee that the layout of WithTail<[T]> is exactly Layout::from_size_align(offset_of!(WithTail<[T; 0]>, tail), align_of::<WithTail<[T; 0]>>())?.extend(Layout::array::(len))?.pad_to_align(), also extending that same guarantee to non-generic slice-tailed structures.

I don't think that property is even true. Consider (on x86-64)

#[repr(C)]
struct MyDST(u64, u8, [u8]);

This has alignment 8, and the last field is at offset 9. With length 0, the total size is 16. With length 7, the total size is still 16.

Christopher Durham · Answer 40 · Sat Jun 01 2024 15:29:26 GMT+0800 (China Standard Time)

I don't see any contradiction there. Note that I specifically built the prefix layout with size set as the offset of the tail field, not the size of the type with zero-sized tail. A zero length tail ends up as Layout(9, 8).extend(Layout(0, 1)).pad_to_align() == Layout(16, 8) and a 7 length tail Layout(9, 8).extend(Layout(7, 1)).pad_to_align() == Layout(16, 8).

Ralf Jung · Answer 41 · Sat Jun 01 2024 15:49:05 GMT+0800 (China Standard Time)

Oh I see, sorry. Makes sense. That is probably right. It does not answer my question about how the reference PR I keep citing helps.

Matthew House · Answer 42 · Sat Jun 01 2024 20:22:06 GMT+0800 (China Standard Time)

Given that you're worried about trailing padding for the non-empty-slice case, how would rust-lang/reference#1482 help?

It is necessary to determine the alignment of the entire DST. With repr(Rust), a struct can be arbitrarily more-aligned than its fields. (Or we might be writing a macro that isn't aware of all the fields in the prefix.) Therefore, to determine the alignment of the DST, align_of_val_raw() is needed. But that function has the same isize::MAX restriction that size_of_val_raw() does, so we need the guarantee that it's safe to probe the layout at length 0.

From there, the alignment of the entire DST, the byte offset of the tail, and the layout of the tail's elements are sufficient to compute the size of the entire DST for any length, assuming that trailing padding is minimized.

Ralf Jung · Answer 43 · Sun Jun 02 2024 14:04:44 GMT+0800 (China Standard Time)

Okay so there's still an assumption here that the alignment doesn't change as the length goes up?
That looks like the "size_of_val_raw is fine for length 0" approach would suffice, though? Above you seemed to say that is not the case. But then I still don't see how rust-lang/reference#1482 helps if "size_of_val_raw is fine for length 0" is not enough.

Honestly I'd rather spend all this time and effort on the proper solution(s) -- offset_of! for slice tail fields, and a checked layout computation method that takes <T as Pointee>::Metadata. (But adding "size_of_val_raw is fine for length 0" is relatively cheap so I am not opposed to that either.)

Matthew House · Answer 44 · Tue Jun 04 2024 00:41:47 GMT+0800 (China Standard Time)

That looks like the "size_of_val_raw is fine for length 0" approach would suffice, though? Above you seemed to say that is not the case. But then I still don't see how rust-lang/reference#1482 helps if "size_of_val_raw is fine for length 0" is not enough.

Sorry if I've been unclear. For the offset of slice tail fields (the zerocopy use case), either "proper offset_of!() support", "out-of-bounds addr_of!()", or "length-0 size_of_val_raw()" is sufficient. For constructing DSTs with not-completely-known layout, "fallible Layout::for_value_raw()" (i.e., your Metadata computation) is sufficient, and "length-0 align_of_val_raw()" + "minimal trailing padding" is also sufficient.

Ralf Jung · Answer 45 · Wed Jun 05 2024 17:23:12 GMT+0800 (China Standard Time)

Okay, so I propose someone prepare a PR for length-0 size_of_val_raw. offset_of on slice tail fields and fallible layout computation are reasonable feature requests; it may be worth filing issues for them.

Ralf Jung · Answer 46 · Sat Jun 08 2024 17:39:13 GMT+0800 (China Standard Time)

I have opened two PRs for these steps:

I will close the issue now, since the original question in the issue title has been answered and the issue description is quite outdated at this point. Please open a new issue if there is still an open question to resolve/track.

Where to document allocation size upper bound?

Footnotes

Footnotes