matrix/mat64: question: re-use of memory for mat64.Vector pointers seems confusing

Question

matrix/mat64: question: re-use of memory for mat64.Vector pointers seems confusing

ChristopherRabotin opened this issue 7 years ago · comments

In the following code, it seems that vec1 is not re-initialized at each loop. Is that correct, and if so, is that the intended behavior? I was incredibly confused at this behavior for several hours last night while trying to debug a piece of code.

The following tests were ran with go version go1.7.1 linux/amd64.

Re-use of memory example

In the following, since vec1 is created inside the loop, I would expect each new iteration to start from scratch and overwrite the vec1 pointer entirely. However, it behaves as if vec1 was defined once before the start of the loop.

Code

func main() {
	values := []float64{1, 2, 3}
	for i := 0; i < 3; i++ {
		vec1 := mat64.NewVector(3, values)
		vec2 := mat64.NewVector(3, []float64{4, 5, 6})
		vec1.SubVec(vec1, vec2)
		fmt.Printf("%+v\n", vec1)
	}
}

Output

&{mat:{Inc:1 Data:[-3 -3 -3]} n:3}
&{mat:{Inc:1 Data:[-7 -8 -9]} n:3}
&{mat:{Inc:1 Data:[-11 -13 -15]} n:3

Fix for previous situation

func main() {
	for i := 0; i < 3; i++ {
		values := []float64{1, 2, 3}
		vec1 := mat64.NewVector(3, values)
		vec2 := mat64.NewVector(3, []float64{4, 5, 6})
		vec1.SubVec(vec1, vec2)
		fmt.Printf("%+v\n", vec1)
	}
}

Output

&{mat:{Inc:1 Data:[-3 -3 -3]} n:3}
&{mat:{Inc:1 Data:[-3 -3 -3]} n:3}
&{mat:{Inc:1 Data:[-3 -3 -3]} n:3}

Brendan Tracey · Answer 1 · Thu Feb 16 2017 06:19:42 GMT+0800 (China Standard Time)

It is the intended behavior. NewVector (as well as NewDense, and the like), uses the actual slice provided as memory. That is, it does not allocate and copy.

What could be done to avoid confusion? It's a use case we need to support for a variety of reasons. It's also clearly documented; from NewVector: If len(data) == n, data is used as the backing data slice.

Dan Kortschak · Answer 2 · Thu Feb 16 2017 07:01:02 GMT+0800 (China Standard Time)

The Go equivalent is https://play.golang.org/p/M4l99eafAV

package main

import (
	"fmt"
)

func main() {
	values := [...]float64{1, 2, 3}
	for i := 0; i < 3; i++ {
		vec1 := values[:]
		vec2 := []float64{4, 5, 6}
		subVecInto(vec1, vec1, vec2)
		fmt.Printf("%+v\n", vec1)
	}
}

func subVecInto(dst, a, b []float64) {
	if len(dst) != len(a) || len(dst) != len(b) {
		panic("length mismatch")
	}
	for i := range a {
		dst[i] = a[i] - b[i]
	}
}

I think this is a commonly used and well documented approach within Go idiom. Maybe the documentation could be more explicit. Though I see that NewDense is actually less clear on this than NewVector, NewSymDense and NewTriDense which all explicitly state that data/mat will be used (maybe changing mat -> data would be good too.

Brendan Tracey · Answer 3 · Fri Feb 17 2017 00:05:57 GMT+0800 (China Standard Time)

Yes. I wonder if there's some value in having a NewVectorFrom([]float64) where the data is copied? It is true that NewXxx goes against our normal convention of data is copied.

Dan Kortschak · Answer 4 · Fri Feb 17 2017 02:18:46 GMT+0800 (China Standard Time)

No sorry, I meant the docs differ. I think the behaviour is correct here.

Brendan Tracey · Answer 5 · Fri Feb 17 2017 02:20:15 GMT+0800 (China Standard Time)

Sorry, I agree the behavior is correct, but I was wondering if there's value in having a second function which copies the data instead.

Dan Kortschak · Answer 6 · Fri Feb 17 2017 02:22:36 GMT+0800 (China Standard Time)

No, I don't think so. Maybe there is something we could do in the docs or in the wiki documentation we had been planning.

Chris · Answer 7 · Fri Feb 17 2017 06:34:26 GMT+0800 (China Standard Time)

NewVector's doc has "data is used as the backing data slice". I guess that I didn't interpret this correctly when I read that a few months back. Maybe changing "backing data slide" to "data storage" or just adding "(i.e. operations will rewrite the initial slice)".

Dan Kortschak · Answer 8 · Fri Feb 17 2017 06:53:40 GMT+0800 (China Standard Time)

I think taking wording from the slice blog post and slice internals blog post would be the best. The semantics are identical as seen above.