bad performance when deserialize long buildin-type array

Question

bad performance when deserialize long buildin-type array

panshengjie opened this issue 6 years ago · comments

rosnodejs deserialize buildin-type array data oneByone using deserializePrimitive(), can we use typeArray instead?

Chris Smith · Answer 1 · Mon Oct 22 2018 21:13:29 GMT+0800 (China Standard Time)

As far as I'm aware. The uint8 arrays for pre-generated messages are already specially deserialized so we'd just be extending the case.

Christian Fritz · Answer 2 · Mon Nov 12 2018 13:24:23 GMT+0800 (China Standard Time)

Is this a duplicate of #117?

Pan Shengjie · Answer 3 · Mon Nov 12 2018 13:33:53 GMT+0800 (China Standard Time)

@chfritz not the same. I thought we should use typeArray everywhere supported, great performance improve you know. But this may lead to compatibility issues.

Chris Smith · Answer 4 · Tue Nov 13 2018 22:11:36 GMT+0800 (China Standard Time)

A few things I was trying to determine (and hadn't managed to yet)...

are TypedArrays strictly a superset of normal Arrays? Basically, that they officially supported all the same properties/methods as normal arrays but had additional ones.
how does this impact older versions of Node that we currently support? Are there babel plugins for it?
Do we need some flag to indicate that generated messages should deserialize to TypedArrays instead of normal arrays? This probably depends on the previous 2 answers...

We'd also just need some perf tests to show the improvement we expect (mostly to guard against bugs in implementation, assuming TypedArrays would be a performance improvement in general).

Pan Shengjie · Answer 5 · Thu Nov 15 2018 15:20:48 GMT+0800 (China Standard Time)

it's quite sure that, TypedArray is not a superset of normal Arrays, which is introduced for binary operation that nodejs lacks. Replace all arrays with typeArray is impossiable, especially object-arrays.

a good choice I think about is introduce a flag to let programmer manually handle heavy binary datas like image/video/sound

besides, heavy object-arrays are still tough , typically laserscan/pointclouds in ROS.

Chris Smith · Answer 6 · Wed Nov 28 2018 04:59:17 GMT+0800 (China Standard Time)

I started looking into doing this but discovered that the TypedArray views require that the byte offset be a multiple of the type size (see: StackOverflow). In specific circumstances (e.g for certain message definitions) it can be guaranteed that this behavior will hold, but turning on TypedArray deserialization for specific topics or messages and not others would probably be a more invasive change than what I'd been imagining...

The general way around the byte-offset issue for ArrayBuffers seems to be using a DataView, but the API there looks like we'd end up with basically the same code we currently have now using Node.js Buffers (which may just be a DataView at this point??).

I'll look into it a bit more... I could be missing something in TypedArrays that would get around this or overcomplicating the per-topic deserialization case.

Joe Pea · Answer 7 · Fri Jun 05 2020 06:40:20 GMT+0800 (China Standard Time)

Any plans? This would be a substantial performance improvement. Seems Node.js supports ArrayBuffer/DataView for a long time now. I would prefer to receive an ArrayBuffer or DataView and do my own custom reading of the data than receive an Array with no control of the performance.

Another idea is, maybe a config option to rosnodejs like {maxBufferSize: 10000} could be used to tell rosnodejs to define an ArrayBuffer of that size, and return views into that buffer for anything that fits within it every time data is received, with the caveat that this data is overwritten each frame (the user must be sure to read any necessary data each time a message callback is called). This would be the most performant.

Joe Pea · Answer 8 · Fri Jun 05 2020 12:32:51 GMT+0800 (China Standard Time)

On my end, I am doing the following, which is similar to what rosnodejs could do. Note in my example it doesn't make rosnodejs any more efficient, it only makes my code more efficient, but it shows how a single ArrayBuffer could be re-used without allocating new memory:

      let imageData: Uint8Array
      let imageBlob: Blob
      let imageUrl: string
      let imageBlobParts: BlobPart[] = []
      const img = document.getElementById('cameraImage') as HTMLImageElement

      let buffer = new ArrayBuffer(INITIAL_SIZE)

      const callback = (msg) => {
        if (imageUrl) URL.revokeObjectURL(imageUrl)

        const dataLength = msg.data.length

        if (dataLength > buffer.byteLength) buffer = new ArrayBuffer(dataLength + 10000)

        // TODO make an object pool of Uint8Array objects so we don't have to
        // re-allocate them when we encounter an image that had the same size as
        // a previous image.
        imageData = new Uint8Array(buffer, 0, dataLength)
        imageData.set(msg.data)

        imageBlobParts[0] = imageData // re-use objects as much as possible!
        imageBlob = new Blob(imageBlobParts, {type: 'image/jpeg'})
        imageUrl = URL.createObjectURL(imageBlob)

        img.src = imageUrl // render the image
      }

As you can tell there, there is a single spot in memory (a single ArrayBuffer), and each time I receive a message, I am re-using the memory and overwriting the values.

If I do the following, the performance is much worse (400% slower):

      let imageData: Uint8Array
      let imageBlob: Blob
      let imageUrl: string
      let imageBlobParts: BlobPart[] = []
      const img = document.getElementById('cameraImage') as HTMLImageElement

      const callback = (msg) => {
        if (imageUrl) URL.revokeObjectURL(imageUrl)

        imageData = new Uint8Array(msg.data) // new one every time, very bad
        imageData.set(msg.data)

        imageBlobParts[0] = imageData // re-use objects as much as possible!
        imageBlob = new Blob(imageBlobParts, {type: 'image/jpeg'})
        imageUrl = URL.createObjectURL(imageBlob)

        img.src = imageUrl // render the image
      }

In this last example we are not re-using memory, and the callback is 4x slower.

I think it would be great if rosnodejs would allow us to re-use memory and greatly improve performance.

Chris Smith · Answer 9 · Tue Jun 09 2020 04:55:30 GMT+0800 (China Standard Time)

Excluding on-the-fly messages, rosnodejs already does not allocate additional memory for uint8 arrays. For most other types of message, I think the current api does what users expect, though point clouds and some other data structures may have issues.

The easiest way around this is to allow users to do custom deserialization from the buffer. A longer term solution could be another deserialization option that wraps DataView.

Pan Shengjie · Answer 10 · Tue Jun 30 2020 18:44:00 GMT+0800 (China Standard Time)

I will PR tomorrow with my approach, 5x faster on topic listen