gen.matrix

Question

gen.matrix

ggrothendieck opened this issue 3 years ago · comments

This works but it might be more convenient if a gen.matrix existed.

 as.matrix(gen.data.frame(gen.vector(i+j, i = 1:4), j = 1:4))
##      V1 V2 V3 V4
## [1,]  2  3  4  5
## [2,]  3  4  5  6
## [3,]  4  5  6  7
## [4,]  5  6  7  8

Patrick Roocks · Answer 1 · Sun Apr 25 2021 17:17:57 GMT+0800 (China Standard Time)

That's an excellent idea! I just added this functionality in my last commit (v 0.2.1)

But instead just taking gen.matrix as a short-cut for as.matrix(gen.data.frame(...)) I decided that the auto-generated column names are not used in the matrix. But explicitly defined column names are taken:

> gen.matrix(gen.vector(i+j, i = 1:4), j = 1:4)
     [,1] [,2] [,3] [,4]
[1,]    2    3    4    5
[2,]    3    4    5    6
[3,]    4    5    6    7
[4,]    5    6    7    8
> gen.matrix(gen.named.vector('col{i}', i+j, i = 1:4), j = 1:4)
     col1 col2 col3 col4
[1,]    2    3    4    5
[2,]    3    4    5    6
[3,]    4    5    6    7
[4,]    5    6    7    8

ggrothendieck · Answer 2 · Mon Apr 26 2021 19:23:55 GMT+0800 (China Standard Time)

Given that we know that the result has 2 dimensions would it be possible to change this to eliminate the need for gen.vector to have something like:

gen.matrix(i+j, i=1:4, j=1:4)

gen.matrix(+(i == j), i=1:4, j=1:4) # diagonal matrix

gen.matrix(+(i == j + 1), i=1:4, j=1:4)

Patrick Roocks · Answer 3 · Thu Apr 29 2021 04:11:09 GMT+0800 (China Standard Time)

Another nice idea!

Everything implemented with the last commit.

ggrothendieck · Answer 4 · Thu Apr 29 2021 09:48:33 GMT+0800 (China Standard Time)

I would have expected the first to give a column matrix and the second to give a row matrix but it is the other way around.

> gen.matrix(i, i = 1:4, j = 1)
     [,1] [,2] [,3] [,4]
[1,]    1    2    3    4
> gen.matrix(i, i = 1, j = 1:4)
     [,1]
[1,]    1
[2,]    1
[3,]    1
[4,]    1

Patrick Roocks · Answer 5 · Fri Apr 30 2021 01:13:37 GMT+0800 (China Standard Time)

I had in mind that gen.matrix(i+j, i = 1:4, j = 1:3) is a short-cut for gen.matrix(gen.vector(i+j, i = 1:4), j = 1:3) and thus the order was gen.matrix(expr, col_var, row_var)

But I totally agree to you that it reads very counter-intuitive to specify the cols before the rows. It's a mathematical convention that the row index precedes the column index.

I decided to change it according to your suggestion. Fixed with the last commit.

ggrothendieck · Answer 6 · Fri Apr 30 2021 04:59:50 GMT+0800 (China Standard Time)

I suppose there could be an argument that specifies the order. `matrix(...)` has such an argument.

…

On Thu, Apr 29, 2021 at 1:13 PM Patrick Roocks ***@***.***> wrote: I had in mind that gen.matrix(i+j, i = 1:4, j = 1:3) is a short-cut for gen.matrix(gen.vector(i+j, i = 1:4), j = 1:3) and thus the order was gen.matrix(expr, col_var, row_var) But I totally agree to you that it reads very counter-intuitive to specify the cols before the rows. It's a mathematical convention that the row index precedes the column index. I decided to change it according to your suggestion. Fixed with the last commit. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#3 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AB32F7SB77225Q5HMID5J4LTLGHVNANCNFSM43DVBFFQ> .

-- Statistics & Software Consulting GKX Group, GKX Associates Inc. tel: 1-877-GKX-GROUP email: ggrothendieck at gmail.com

Patrick Roocks · Answer 7 · Fri Apr 30 2021 13:45:12 GMT+0800 (China Standard Time)

You mean the byrow parameter?
To be honest, I don't really like the idea of additional parameters in the gen... function of this package. The parametrization should be as lightweight as possible.

Instead of writing something like gen.matrix(i+j, i=1:3, j=1:2, byrow = FALSE) I suggest to consider t(gen.matrix(i+j, i=1:3, j=1:2)) (using the transpose function t(...) from base R) as the canonical alternative.

ggrothendieck · Answer 8 · Sat May 01 2021 01:53:09 GMT+0800 (China Standard Time)

There is also crossprod and tcrossprod as another way of doing this. This eliminates the cost of the transpose by generating it directly.

Patrick Roocks · Answer 9 · Sat May 08 2021 20:14:45 GMT+0800 (China Standard Time)

The bycol parameter (cf. #4) solves the issue:

> gen.matrix(i+j, i=1:3, j=1:2, bycol = TRUE)
     [,1] [,2] [,3]
[1,]    2    3    4
[2,]    3    4    5

ggrothendieck · Answer 10 · Sun May 09 2021 07:03:35 GMT+0800 (China Standard Time)

That's great but shouldn't the default be TRUE since the main data structures in R, matrices and data frames, are stored column by column.

Patrick Roocks · Answer 11 · Tue May 18 2021 02:05:13 GMT+0800 (China Standard Time)

I renamed bycol to byrow. But still byrow = TRUE means that the inner index refers to the rows.

Anyway I think it is more "canonical" because it perfectly fits to converting the analog gen.vector result to a matrix:

> matrix(gen.vector(i+j, i=1:3, j=1:2), ncol = 3, byrow = TRUE)
     [,1] [,2] [,3]
[1,]    2    3    4
[2,]    3    4    5


> gen.matrix(i+j, i=1:3, j=1:2, byrow = TRUE)
     [,1] [,2] [,3]
[1,]    2    3    4
[2,]    3    4    5