qdrant RAM usage (?allow quantisation and memmap)

Question

qdrant RAM usage (?allow quantisation and memmap)

Smensink opened this issue 5 months ago · comments

hello, I am attempting to create a qdrant database with a large collection of medical guidelines. Unfortunately although the raw text is only 800MB the database rapidly grows to 15+ GB before the privateGPT instances crashes. The RAM usage goes to 40-50GB+. I am aware that qdrant supports quantisation and keeping the database on disk instead of in RAM, however I cannot seem to get it working with privateGPT (the way the vectorstore is loaded is not the same as in the qdrant documentation) is it possible to allow these features?

Anush · Answer 1 · Sun Apr 28 2024 17:27:34 GMT+0800 (China Standard Time)

If the collection has been created, you can update the collection to use quantization and on-disk vectors. It is documented at https://qdrant.tech/documentation/concepts/collections/#update-collection-parameters.

There's also a tutorial about loading a large amount of vectors.
https://qdrant.tech/documentation/tutorials/bulk-upload/#bulk-upload-a-large-number-of-vectors

Smensink · Answer 2 · Sun Apr 28 2024 17:42:42 GMT+0800 (China Standard Time)

huge, thats very helpful thanks, I'll try it tomorrow. Is there any plan to include these options natively in the repo?

Anush · Answer 3 · Sun Apr 28 2024 19:03:17 GMT+0800 (China Standard Time)

Is there any plan to include these options natively in the repo?

I am unsure. But it would make sense to call them externally since they're not very common operations.

Smensink · Answer 4 · Mon Apr 29 2024 19:03:12 GMT+0800 (China Standard Time)

it works!, still more ram usage than i wouldve thought, but seems to level out around 18gb

Anush · Answer 5 · Mon Apr 29 2024 19:17:42 GMT+0800 (China Standard Time)

I'd also recommend referring to https://qdrant.tech/documentation/guides/optimize/.