matrix/mat64: add `Exp' to TriDense

Question

matrix/mat64: add `Exp' to TriDense

ChristopherRabotin opened this issue 8 years ago · comments

Upper and lower triangular matrices can be assimilated to dual numbers, herein represented as a+b\epsilon. There's an interesting property with dual numbers where exp(b\epislon)=1+b\epsilon (where \epsilon is a N dimensional representation of a dual number), cf. this paper. This can be trivially grown to any (2n)x(2n) triangular matrix, but is not trivially applicable to triangular matrices of even rows/columns.

Hence, implementing Exp on the Triangular struct would ensure an extremely simple computation as described below in pseudo code:

let A be a lower triangular;
if diagonal is non-nil:
    extract diagonal as a scaled identity matrix
    expDiag <- trivially compute its exponential;
endif
expLower <- expDiag * (identity + A)

If the matrix is an upper triangular, perform the same computation on the transpose of A (and return the transpose).

If there is no objection to this feature, I volunteer to implement it.

Edit: I forgot to mention that trivial operations also include sums, products and divisions.

Second edit: Looking a bit more into what is currently available in TriDense, I see that part of what is in this issue would involve enhancing MulTri for the specific case where there is an even number of rows/columns.

Brendan Tracey · Answer 1 · Fri Dec 30 2016 21:25:08 GMT+0800 (China Standard Time)

I don't understand what's being suggested. The original title is to add Exp to Triangular. That seems fine. By "arithmetic functions" do you mean Add etc? That would be fine too, though it would probably better to have a separate tracking issue.

Is the part about dual numbers an implementation suggestion for Exp, or a suggestion to modify the implementation of TriDense to support the use case of a triangle of dual numbers?

(I do have https://github.com/btracey/hyperdual if you need it)

Chris · Answer 2 · Sat Dec 31 2016 01:44:10 GMT+0800 (China Standard Time)

The suggestion is only to add an Exp function to TriDense. Subsequently, I think it could be interesting to add Sub to TriDense and slightly modify Add of TriDense to simplify the operation if the provided matrix has an even number of rows.

There will be no support for dual numbers, I'll just be using some of their properties as a tool to simplify the number of operations needed.

(P.S.: I think there's a bug in your hyperdual implementation in the Add function. I think line 44 should be a.data[i] += v.)

Chris · Answer 3 · Sat Dec 31 2016 03:48:03 GMT+0800 (China Standard Time)

Sorry for the multiple comments here. I'm encountering an oddity in the implementation, and I suspect that it's due to my total misunderstanding of how reuseAs works. Any hints would be appreciated.

It seems to me that reuseAs prevents modifications of the underlying data, as if there was a deferred function which resets it to the original value.

Here's an output from a basic test that I'm running: the first matrix shown is printed within the new Exp function, and the second one is printed from the main (input matrix is mat64.NewTriDense(2, matrix.Lower, []float64{0, 0, 2, 0})).

⎡1  0⎤
⎣2  1⎦
⎡0  0⎤
⎣0  0⎦

Below is the code executed in this test:

func (t *TriDense) Exp(a Triangular) {
	n, kind := a.Triangle()
	t.reuseAs(a.Triangle())

	if n%2 == 0 {
		// Exponential is trivial. Calculation performed on lower triangular.
		aP := NewTriDense(n, kind, nil)
		isUpper := kind == matrix.Upper
		if isUpper {
			aP.Copy(a.T())
		} else {
			aP.Copy(a)
		}
		computeKind := matrix.Lower
		t = NewTriDense(n, computeKind, nil)
		diag := NewTriDense(n, computeKind, nil)
		eye := NewTriDense(n, computeKind, nil)
		nilDiag := true
		for i := 0; i < n; i++ {
			eye.SetTri(i, i, 1)
			v := aP.At(i, i)
			if v != 0 {
				nilDiag = false
			}
			diag.SetTri(i, i, v)
			aP.SetTri(i, i, 0)
		}
		// TODO(ChristopherRabotin): Use (*TriDense).Add(a, b) once written.
		// Compute exponential which is simply diag*(eye + aP) (if diag != [0]).
		for i := 0; i < n; i++ {
			for j := 0; j < n; j++ {
				if i >= j {
					t.SetTri(i, j, eye.At(i, j)+aP.At(i, j))
				}
			}
		}
		if !nilDiag {
			t.MulTri(diag, t)
		}
		if isUpper {
			t.Copy(t.T())
		}
		fmt.Printf("%+v\n", Formatted(t))
	} else {
		// Convert to Dense then convert back to Triangle.
		// The exponential of a triangular matrix is always a triangular matrix
		// of the same kind.
		var triExp Dense
		triExp.Exp(DenseCopyOf(t))
		t.Copy(&triExp) // Store result in receiver.
	}
}

Brendan Tracey · Answer 4 · Sat Dec 31 2016 03:58:26 GMT+0800 (China Standard Time)

Did you figure out the internal issue (I can describe if not)?

Brendan Tracey · Answer 5 · Sat Dec 31 2016 04:11:27 GMT+0800 (China Standard Time)

reuseAs doesn't do anything with copying and defer. reuseAs checks the size of the receiver if it is non-zero, and makes it the correct size if it is non-zero.

I'm not exactly sure where you're printing, but t.Copy(t.T()) is wrong. Copy can't be used with itself as a receiver. This is a bug in TriDense.Copy, it should panic like the Dense version does.

Brendan Tracey · Answer 6 · Sat Dec 31 2016 04:13:11 GMT+0800 (China Standard Time)

As a side note, you should probably working with upper triangular matrices. They're faster in gonum since we use row-major storage.

Chris · Answer 7 · Sat Dec 31 2016 04:30:47 GMT+0800 (China Standard Time)

Okay, I'll switch to using upper triangular and not use copy. I have fixed the internal call issue I originally mentioned (I did a dirty symbolic link but it does the trick). Concerning the main bug, the first matrix printed is the Printf line in Exp, at the end of the scope for even dimensional triangular matrices. The second print is from the actual object on which Exp is called, and after the call. So what happens is that the pointer receiver is never actually modified. I still have yet to understand this.

…

On Fri, Dec 30, 2016, 13:13 Brendan Tracey ***@***.***> wrote: As a side note, you should probably working with upper triangular matrices. They're faster in gonum since we use row-major storage. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#419 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AEma6DivFLel7EVpcPCFwp6GKbSbmiw1ks5rNWXZgaJpZM4LYBf7> .

Dan Kortschak · Answer 8 · Sat Dec 31 2016 04:44:39 GMT+0800 (China Standard Time)

So what is the code you have now?

Brendan Tracey · Answer 9 · Sat Dec 31 2016 04:52:32 GMT+0800 (China Standard Time)

The problem is t = NewTriDense(n, computeKind, nil). The pointer itself is passed by value (copied). This doesn't modify what the pointer points to, it changes the pointer altogether. What you really want is *t = *NewTriDense, you want to assign the value the pointer points to for the code to have the intended effect.

Chris · Answer 10 · Sat Dec 31 2016 11:29:41 GMT+0800 (China Standard Time)

Closed as per #420 (comment).