bbqueue use on shared memory

Question

bbqueue use on shared memory

timvisee opened this issue 4 years ago · comments

I'm looking into using this on shared memory for IPC. Do you have any input on this, whether this might work or not?

I couldn't really figure out whether the actual buffer is part of the struct (ConstBBBuffer), or whether it uses a pointer. If it is part of the struct, I think copying/transmuting the buffer onto shared memory would be alright, wouldn't it. Sorry for the possibly vague message, I'm somewhat uneducated on this topic.

Also, if this can easily be done, providing (unsafe) functions to initialize and attach to a bip buffer might be useful. I believe using a bip buffer on shared memory is quite a common use case.

Matt Ickstadt · Answer 1 · Tue Aug 11 2020 23:37:41 GMT+0800 (China Standard Time)

Yes, the actual buffer is part of the struct, it's the buf field where the A type parameter is for a GenericArray<u8, N> where N is a type-level integer which determines the size of the array.

I'm trying to do a similar thing, making the buffer use a specific SRAM at a specific address. I think I could do this with linker sections, but I don't want to use any of the SRAM memory for the bookkeeping variables.

I'm going to try to hack in the ability to create a bbqueue using a pointer+length. Hopefully we can figure it out and get it upstreamed.

James Munns · Answer 2 · Wed Sep 16 2020 07:09:18 GMT+0800 (China Standard Time)

I'm not 100% sure this would be a great idea with bbqueue as-is, though it could potentially be possible with the correct cross-process synchronization. I admit to not know what is necessary to achieve this, at least on hosted (Windows/Mac/Linux) platforms.

Currently, yes, the storage is part of the struct.

Tim Visée · Answer 3 · Wed Sep 16 2020 17:36:49 GMT+0800 (China Standard Time)

I did some experimentation, and it seems to work quite well (easily reaching 280Gbps through it across processes if you're wondering). I'm just placing the BBBuffer struct on shared memory, and am dereferencing it in a different process. This involves some unsafe logic of course.

The only real problem I have is accessing the producer and consumer. You can't split twice, even though you end up using the producer and consumer once. This is because of the already_split flag.

Having two separate flags (for the producer and consumer) and an additional two functions would solve this. How do you think about this, can I implement this change? (this is of course something quite specific for shared memory usage)

James Munns · Answer 4 · Thu Nov 12 2020 17:32:00 GMT+0800 (China Standard Time)

@timvisee sure! I have an issue for that already actually: #40

I'd definitely take a PR for this.

Tim Visée · Answer 5 · Thu Nov 12 2020 19:14:12 GMT+0800 (China Standard Time)

@timvisee sure! I have an issue for that already actually: #40

I'd definitely take a PR for this.

I've submitted a draft PR for this, see: #78

Wasn't aware of that issue, thanks for linking.

James Munns · Answer 6 · Tue Jan 05 2021 01:08:36 GMT+0800 (China Standard Time)

Hey @mattico and @timvisee, I'm currently working on the "Next Gen" version of BBQueue with const generics, and I was wondering if you would be okay with the "bring your own memory" constructor to require &mut [u8; N], instead of &mut [u8]. This would allow me to keep all of the array lengths compile-time known, which I think would reduce some potential overhead.

I wanted to see if this was acceptable for your use case, or whether you would prefer to have a &mut [u8] constructor.

Tim Visée · Answer 7 · Tue Jan 05 2021 01:17:42 GMT+0800 (China Standard Time)

I'm currently not actively working on a project that uses bbqueue, and I'm not sure it that would be acceptable.

I do feel that using a fixed size array can be quite annoying to work with. This next gen version sounds great though!

Would it be an option to support both? Or would that increase complexity by a lot.

James Munns · Answer 8 · Tue Jan 05 2021 01:26:58 GMT+0800 (China Standard Time)

Right now I'm looking at the trait that would abstract over storage, and it seems that I have two options:

Always use *mut [u8; N], which means I can always keep the array length const-knowable
Always use my own "cell" that tracks a (*mut u8, usize), which means const-knowable items must always create this intermediary cell

I honestly don't know what kind of overhead this would introduce in practice, but I can't think of a way to "mix and match" these two approaches. The first option would look something like this:

trait BBGetter<const N: usize>: Clone {
    fn get_header(&self) -> &BBHeader;
    fn get_storage(&self) -> NonNull<[u8; N]>;
}

/// A backing structure for a BBQueue. Can be used to create either
/// a BBQueue or a split Producer/Consumer pair
pub struct BBBuffer<const N: usize, STO>
where
    STO: BBGetter<N>,
{
    buf: STO,
    hdr: BBHeader,
}

The only use case I can think of for the latter option would be a case where you need to create a bbqueue of a size that is determined at runtime, e.g. you ask the user to input "how large would you like your bbqueue"?

I think for all use cases where the size should be statically knowable, it would be reasonable to have a Try constructor (or helper function) that checks if (&mut [u8]).len() is >= N, and returns an error in the case it is not. Basically something like this:

fn try_to_array<const N: usize>(buf: &mut [u8]) -> Option<&mut [u8; N]> {
    if buf.len() >= N {
        Some(unsafe { &mut *buf.as_mut_ptr().cast::<[u8; N]>() })
    } else {
        None
    }
}

Or something to that extent, which I think should be sound.

EDIT: It seems like this behavior already exists as part of TryInto?

EDIT2: Yup, it already works.

James Munns · Answer 9 · Tue Jan 05 2021 02:53:12 GMT+0800 (China Standard Time)

So after talking to @Dirbaio, I realized a second motivating factor for using the "Cell" approach: This would likely allow for reduction in monomorphization bloat, if you use the the "borrowed storage" variant, which might be useful for some folks.

I'll probably plan on doing that for now. Thanks for the feedback @timvisee!

Tim Visée · Answer 10 · Tue Jan 05 2021 03:02:52 GMT+0800 (China Standard Time)

This would likely allow for reduction in monomorphization bloat, if you use the the "borrowed storage" variant, which might be useful for some folks.

Is such bloat really an issue when you might only use a single array type (meaning the same size for all bbqueue's), or am I missing something here? I'm not too familiar with that topic in context of generics.

Matt Ickstadt · Answer 11 · Tue Jan 05 2021 05:17:03 GMT+0800 (China Standard Time)

Appreciate you working on improving this @jamesmunns! My use case uses only fixed size arrays of known size. You could use an inner function to work around the monomorphization bloat, perhaps. You'll have a clearer idea of the implementation tradeoffs.