Dicklesworthstone / swiss_army_llama

A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

It requires 40G of RAM to work?

sujantkumarkv opened this issue · comments

I'm on a macbook air m2 16GB.

I tried running it with python locally in this line from README:
To run it natively (not using Docker) in a Python venv, you can use these commands:

A few times it failed with missing packages like greenlet etc but the error now it gives is regarding the RAM allocation.

image

as visible, it says:
raise ValueError(f"Cannot allocate {RAMDISK_SIZE_IN_GB}G for RAM Disk. Total system RAM is {total_ram_gb:.2f}G.") ValueError: Cannot allocate 40G for RAM Disk. Total system RAM is 16.00G.

And no, it doesn't need 40gb of RAM. It just needs enough to use whatever model you have selected if you turn off the RAM disk. You can focus on smaller models that easily fit in your RAM budget.

Thanks for your work and contribution.

Does it work with a GPU? You should document that either way.