Huge memory allocations from `predict.sbo_preds`

Question

Huge memory allocations from `predict.sbo_preds`

vgherard opened this issue 4 years ago · comments

The current (C++) implementation of the predict.sbo_preds() method has two big issues:

Every call to predict() makes a copy of the entire k-gram prediction tables. This is memory expensive and slow if predict() is called in a non-vectorized way (as would happen e.g. in interactive text prediction).
The look-up method in prediction tables is very slow, and causes huge memory allocations/deallocations for large vector input, which slow down a lot model evaluations in eval_sbo_preds(). Maybe #8 could partially fix this?

Valerio Gherardi · Answer 1 · Thu Nov 12 2020 02:51:00 GMT+0800 (China Standard Time)

After messing around for a while with this: the easiest solution is probably to base the whole sbo_preds object on a pure C++ class. This applies also to issues #8 and #19 . Closing.