Eric Zhu's repositories

datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

Language:PythonLicense:MITStargazers:2305Issues:48Issues:161

SetSimilaritySearch

All-pair set similarity search on millions of sets in Python and on a laptop

Language:PythonLicense:Apache-2.0Stargazers:585Issues:20Issues:10

lshensemble

LSH index for approximate set containment search

Language:GoLicense:MITStargazers:55Issues:5Issues:3

go-fasttext

Facebook fastText database in SQLite with Go API

Language:GoLicense:MITStargazers:32Issues:4Issues:0

llm_maze_agent

Navigating a maze using LLM agent

Language:PythonLicense:MITStargazers:32Issues:2Issues:0

go-set-similarity-search

Efficient set similarity search algorithms implemented in Go

Language:GoLicense:Apache-2.0Stargazers:29Issues:4Issues:0

minhash-lsh

Minhash LSH in Golang

Language:GoLicense:MITStargazers:25Issues:4Issues:3

josie

Code and Benchmarks for JOSIE (SIGMOD 2019)

WhatGPT

A ChatGPT clone made with ChatGPT (GPT-4)

Language:JavaScriptLicense:MITStargazers:5Issues:2Issues:0

rfc6266

Content-Disposition header support for Python

Language:PythonLicense:LGPL-3.0Stargazers:2Issues:3Issues:0

chatgpt-data-analysis-examples

Examples of using ChatGPT with Code Interpreter Plugin for data analysis

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

FLAML

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

Language:Jupyter NotebookLicense:MITStargazers:1Issues:1Issues:0

gpt_index

GPT Index (LlamaIndex) is a project consisting of a set of data structures designed to make it easier to use large external knowledge bases with LLMs.

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

autogen

Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

big-ann-benchmarks

Framework for evaluating ANNS algorithms on billion scale datasets.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

differential-privacy

Google's C++ differential privacy library.

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

DiskANN

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

hnswlib

Header-only C++/python library for fast approximate nearest neighbors

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

joey

baby quokka

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:2Issues:0

luceneutil

Various utility scripts for running Lucene performance tests

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:1Issues:0

nmslib

Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.

Language:C++Stargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:2Issues:0

paper-gpt

Paper utilities using LLM

License:MITStargazers:0Issues:2Issues:0

sqlify

Create a SQLite database from an Excel spreadsheet

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

toolformer-pytorch

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

TPCH-sqlite

SQLite TPCH database and SQL queries

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:JavaStargazers:0Issues:2Issues:0