Allow KeyBERT to pass `batch_size` to `llm.encode()` method

Question

Allow KeyBERT to pass `batch_size` to `llm.encode()` method

adhadseKavida opened this issue 3 months ago · comments

I'm using SentenceTransformer with KeyBERT. the SentenceTransformer.encode() (is called by self.model.encode()) allows to change batch_size parameter. But this parameter is not modifiable through the KeyBERT.extract_sentences().

I would recommend if we can pass on **kwargs to the function to update this. Changing batch size hugely decreases the inference time by maximum utilization of GPU memory.

I know we can send the doc and word embeddings ourselves, but that doesn't seem intuitive.

I'm open to discussion regarding possible alternatives.

Maarten Grootendorst · Answer 1 · Wed Feb 21 2024 19:59:56 GMT+0800 (China Standard Time)

Thanks for sharing this! I would prefer the use of **kwargs as much as possible for two reasons. First, it is less explicit and clear to users what it exactly means. Second, it would be a **kwargs for the sole purpose of a specific backend, which seems a rather big change for a small feature.

Instead, I think opening up SentenceTransformerBackend might fit a bit better here since your suggestion relates to sentence-transformers only. Here, we could simply create another variable, encode_kwargs, that takes in you suggested changes.

Anurag Dhadse · Answer 2 · Wed Feb 21 2024 20:02:21 GMT+0800 (China Standard Time)

That seems a much better suggestion!

Maarten Grootendorst · Answer 3 · Wed Feb 21 2024 20:10:01 GMT+0800 (China Standard Time)

Thanks! Unfortunately, I do not have much time these days to work on this, so it might take a couple of weeks at least. If you, or someone else, wants to work on this then that would be appreciated!

Anurag Dhadse · Answer 4 · Thu Feb 22 2024 13:28:27 GMT+0800 (China Standard Time)

I'll look into it and submit a PR as soon as possible.

Anurag Dhadse · Answer 5 · Mon Feb 26 2024 09:46:53 GMT+0800 (China Standard Time)

Opened up a PR for review!