zyang37 / VectorSearch_ShardIndex

Efficient Serving of Large-scale Vector Search with Sharded Indexes

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Efficient Serving of Large-scale Vector Search with Sharded Indexes

A prototype system built on top of Faiss for efficient vector search on large datasets with sharded indexes.

Setup

pip install -r requirements.txt

Generate synthetic index shards

mkdir shards

python create_shard_idx.py

Query sharded indexes

python query_shard_idx.py

Visualize Logs

Once query index is done, by default logs will be generated called logs/app.log. To visualize the logs, run:

python vislogs/vislogs.py --log logs/app.log

Then, check out vislogs_tmp.pdf!

About

Efficient Serving of Large-scale Vector Search with Sharded Indexes


Languages

Language:Jupyter Notebook 81.9%Language:Python 16.7%Language:Shell 1.3%