haifengl / smile

Statistical Machine Intelligence & Learning Engine

Home Page:https://haifengl.github.io

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Takes more memory for LSH model in NearestNeighborSearch

manju22412 opened this issue · comments

I am using lsh nearest neighbors, it takes around 86mb for model storage by training with 20000 records.
Is there any way to reduce the sze..?

Can anyone please reply on the above issue please..

Multi probe LSH uses more memory by design. If memory is a concern, try plain LSH.

i got this issue when i tried with plain LSH only..
And one more thing, it shows "Input vector sizes are different" sometimes when i try to inferencing the model. kindly mention why its happening..?

Your inference data vector size is different from training data's.

i have used the libsvm format for training and inferencing. And getting output for only few cases and remaining it shows error like "Input vector sizes are different". When i try to pass the value other than 1017881 on below line, same error was displaying..

LSH<double[]> lsh = new LSH<>(x, x, 4.0, 1017881);

The parameter w (1017881) is nothing to do with your data vector size.

but, it is issue now..