cholesky_cpu: U(2,2) is zero, singular U.

Question

cholesky_cpu: U(2,2) is zero, singular U.

jiejunnan opened this issue 5 years ago · comments

using my dataset,i have tried many solutions to solve it,but all failed

Takanori Hasegawa · Answer 1 · Mon Aug 24 2020 10:57:55 GMT+0800 (China Standard Time)

@jiejunnan Hello, this issue comes from that cov_diag degenerates to 0. So, you should set large value of lambda_cov_diag (i.e. from 0.1 to 1.0, default is 0.005).

I also faced the same issue, and I avoided this error after I set lambda_cov_diag = 1.0 with 10 epochs. (when over 20 epochs, loss value goes to nan... in my dataset)

The paper says your issue in p.6

DAGMM also has the singularity problem as in GMM: trivial solutions are triggered when the diagonal entries in covariance matrices degenerate to 0.

Then, the author decided to penalize 1 / cov_k. I hope you will succeed in your dataset.

Dr. AmmarKamoona · Answer 2 · Fri Mar 19 2021 15:19:35 GMT+0800 (China Standard Time)

@jiejunnan Hello, this issue comes from that cov_diag degenerates to 0. So, you should set large value of lambda_cov_diag (i.e. from 0.1 to 1.0, default is 0.005).

I also faced the same issue, and I avoided this error after I set lambda_cov_diag = 1.0 with 10 epochs. (when over 20 epochs, loss value goes to nan... in my dataset)

The paper says your issue in p.6

DAGMM also has the singularity problem as in GMM: trivial solutions are triggered when the diagonal entries in covariance matrices degenerate to 0.

Then, the author decided to penalize 1 / cov_k. I hope you will succeed in your dataset.

Hi there,

I have the same problem on my dataset. the latent dimension I use is high 8, 16.

Takanori Hasegawa · Answer 3 · Fri Mar 19 2021 15:40:21 GMT+0800 (China Standard Time)

In my experience, a high latent dimension tends to result in this error. I recommend using one or two dimensions for latent space to avoid this error.
According to the paper, the authors used one dimension for latent space.

Fig.1 in the original paper shows the relationship between one dimension latent feature and reconstruction error. Before applying this model, you could draw the same figure( one-dim latent feature and reconstruction error). If you find no difference or cannot train an autoencoder-based model, you should not take this model.

Thanks for reading.