Possibly a bug in ElasticNet implementation

Question

Possibly a bug in ElasticNet implementation

buntec opened this issue 2 years ago · comments

Describe the bug
In ElasticNet.java the raw response vector y is augmented with p zeros before calling LASSO.train, where it is centered. Consequently, the model is trained with a response vector not having p zeros at the end (instead it contains p times the negative of the mean of the augmented y). If I'm not mistaken, this means that the ridge penalty is not taken into account correctly, and that the intercept b is computed incorrectly.

This is just from looking at the code. I don't have numerical evidence to back this up. Apologies if I'm missing something here!

Expected behavior
I think the response vector y should be centered before augmenting it with zeros. (This would probably require moving the centering of y out of LASSO.train.)

Actual behavior
The model is trained with a response vector y not having p zeros at the end.

Code snippet
N/A

Input data
N/A

Additional context
N/A

Add any other context about the problem here.

Haifeng Li · Answer 1 · Fri Nov 04 2022 22:50:35 GMT+0800 (China Standard Time)

Thanks for callout. I center first n elements of y2 now before calling LASSO. Please try master branch.