There are 2 repositories under similarity-measures topic.
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Up to 200x Faster Inner Products and Vector Similarity — for Python, JavaScript, Rust, C, and Swift, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE 📐
📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...
A .NET port of java-string-similarity
Scalable Time Series Data Analytics
中文智能客服机器人demo,包含闲聊和专业问答2个部分,支持自定义组件(Chinese intelligent customer chatbot Demo, including the gossip and the professional Q&A(FAQ) , support for custom components!)
Quantify the difference between two arbitrary curves in space
DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)
Python library for processing (tandem) mass spectrometry data and for computing spectral similarities.
金融时间序列(预测分析 / 相似度 / 数据处理)
Free hands-on course with the implementation (in Python) and description of several computational, mathematical and statistical algorithms.
Information Theory and Distance Quantification with R
Reference implementation of the paper VERSE: Versatile Graph Embeddings from Similarity Measures
vips-powered ruby gem to measure images similarity, implementing dHash and IDHash algorithms
building a recommendation system using graph search methodologies. We will be comparing these different approaches and closely observe the limitations of each.
Formed trajectories of sets of points.Experimented on finding similarities between trajectories based on DTW (Dynamic Time Warping) and LCSS (Longest Common SubSequence) algorithms.Modeled trajectories as strings based on a Grid representation.Benchmarked KNN, Random Forest, Logistic Regression classification algorithms to classify efficiently trajectories.
Romanian WordNet (Data + API for Python)
Code for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.
A GP-GPU/CPU Dynamic Time Warping (DTW) implementation for the analysis of Multivariate Time Series (MTS).
Speech (audio) subjective evaluation system
Code for the papers: Correlation Coefficients and Semantic Textual Similarity, NAACL-HLT 2019 & Correlations between Word Vector Sets, EMNLP-IJCNLP 2019.
Fuzzy string matching in R. Inspired by Python's thefuzz (but without the Python).
[IROS 2021] Implementation of "Similarity-Aware Fusion Network for 3D Semantic Segmentation"
Extended edit similarity measurement for high dimensional discrete-time series signal (e.g., multi-unit spike-train).
Tool to estimate deltas for sequence sets and answer questions about relative contribution
Similarity and distance measures for clustering and record linkage applications in R
Repository containing all the codes created for the lab sessions of CSE3018 Content Based Image and Video Retrieval at VIT University Chennai Campus
TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation