Not scalable

Question

Not scalable

saurabh11baghel opened this issue 5 years ago · comments

@jmschrei @domoritz I want to select a subset of 100 samples from a dataset of total 100000 samples and 25 features. The FeatureBasedSelection method is taking forever without doing anything at all.

data_subset,labels_subset = FeatureBasedSelection(100,verbose=True).fit_transform(data,labels)

it is showing verbose output as following for the past one hour.
0%| | 0/100 [00:00<?, ?it/s]

What do you thing is wrong?

Jacob Schreiber · Answer 1 · Fri Mar 06 2020 12:54:13 GMT+0800 (China Standard Time)

Sorry for missing this. I don't really know what is wrong. Can you try downloading the latest patch and trying it again? Also try using optimizer='stochastic', which should be significantly faster but not the exact greedy solution.

Jacob Schreiber · Answer 2 · Sun Mar 22 2020 04:58:30 GMT+0800 (China Standard Time)

Please re-open if you are still encountering issues.