There are 0 repository under lsh-algorithm topic.
一个基于 fasttext + faiss 的商品内容相关推荐实现,nginx+uwsgi+flask / gunicorn+uvicorn+fastapi 提供api查询接口,增加Spark实现 Ansj+Word2vec+LSH+Phoenix
Locality Sensitive Hashing, fuzzy-hash, min-hash, simhash, aHash, pHash, dHash。基于 Hash值的图片相似度、文本相似度
ProbMinHash – A Class of Locality-Sensitive Hash Algorithms for the (Probability) Jaccard Similarity
A Query Efficient Natural Language Attack in a Black Box Setting
TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation
Search your object with hash
Build content-based image retrieval system using deep learning, applied some large scale similarity search technicals like Kdtree, LSH, Faiss.
Recommendation System on cryptocurrency, using data collected from users' tweets + 10-Fold Cross Validation ( Based on the cryptocoins from each user's tweets, the program runs algorithms on the data, resulting in the recommendation of other cryptocoins for each user) ( readme in greek but soon to be translated in English )
使用线程池的高并发 LSH 算法, C++ 实现
An implementation of LSH Forrest based off of the following paper (http://infolab.stanford.edu/~bawa/Pub/similarity.pdf).
Lab assignments for the course ID2222-Data Mining at KTH
Implementation of algorithms for big data using python, numpy, pandas.
Repository for all assignments of the course COL761: Data Mining (Fall 2020), taught at IIT Delhi
Homeworks for Advanced Data Mining and Language Technology (DMT) at La Sapienza University of Rome
Nilsimsa implementation as a swift package
A Robust Library in C# for Similarity Estimation
Vectors - Nearest neighbor search and Clustering using LSH, Hypercube (and Lloyd's only at the clustering) algorithms with L2 metric.
Coursera's Natural Language Processing specialization
This repository contains simple and funny Data Mining projects in Python.
Example on the Local Sensitive Hashing (LSH) algorithm. Relevant for Big Data
Applied the LSH algorithm (developed from scratch) for finding similar texts.
LSH algorithm made with C++
📈|Time Series - Nearest neighbor search and Clustering using LSH, Hypercube (and Lloyd's only at the clustering) algorithms with metrics: L2, Discrete and Continuous Fréchet.
MDLE First Assignment - The objective of this project was to implement the A-Priori algorithm to obtain the most frequent itemsets for a list of conditions for a large set of patients, obtaining then associations between conditions by extracting some rules, and also to implement and apply LSH to identify similar news articles from a dataset.
SpellChecker: an application to check for spell errors.
This repo shows research paper upon which I worked during my summer research intern - 2022.
The assignment comprises two main tasks: implementing LSH to identify similar businesses based on user ratings and developing various collaborative filtering recommendation systems to predict user ratings for businesses.
Homework_4 for Algorithmic Methods for Data Mining (ADM), MSc in Data Science at La Sapienza University of Rome
Finding Similar Items: Textually Similar Documents
Autoencoder dimensionality reduction, EMD-Manhattan metrics comparison and classifier based clustering on MNIST dataset.
Finding similar documents using LSH with MapReduce on multi-node Spark Cluster