There are 14 repositories under locality-sensitive-hashing topic.
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)
Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang
Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.
Go to: https://github.com/alexklibisz/elastiknn
Scalable and Sustainable Deep Learning via Randomized Hashing
An implementation of efficient LSH inspired by fruit fly brain
Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Near-duplicate image detection using Locality Sensitive Hashing
One-Shot Learning using Nearest-Neighbor Search (NNS) and Locality-Sensitive Hashing LSH
locality sensitive hashing (LSHASH) for Python3
Fast and precise comparison of genomes and metagenomes (in the order of terabytes) on a typical personal laptop
SetSketch: Filling the Gap between MinHash and HyperLogLog
ProbMinHash – A Class of Locality-Sensitive Hash Algorithms for the (Probability) Jaccard Similarity
Locality Sensitive Hashing In R
A simple audio fingerprinting system
Software for exploration of gene expression data from single-cell RNA sequencing.
Query-Aware LSH for Approximate NNS (PVLDB 2015 and VLDBJ 2017)
BagMinHash - Minwise Hashing Algorithm for Weighted Sets
A tutorial on scalable retrieval of matrix factorization recommendations
TLSH (Trend Micro Locality Sensitive Hash) library for Ruby
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
Implementing various machine learning algorithm from scratch
Minhash and maxhash library in Python, combining flexibility, expressivity, and performance.
A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing