There are 2 repositories under similarity-measures topic.
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Up to 200x Faster Inner Products and Vector Similarity — for Python, JavaScript, Rust, and C, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE 📐
A .NET port of java-string-similarity
Scalable Time Series Data Analytics
中文智能客服机器人demo,包含闲聊和专业问答2个部分,支持自定义组件(Chinese intelligent customer chatbot Demo, including the gossip and the professional Q&A(FAQ) , support for custom components!)
Quantify the difference between two arbitrary curves in space
DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)
Free hands-on course with the implementation (in Python) and description of several computational, mathematical and statistical algorithms.
Information Theory and Distance Quantification with R
vips-powered ruby gem to measure images similarity, implementing dHash and IDHash algorithms
building a recommendation system using graph search methodologies. We will be comparing these different approaches and closely observe the limitations of each.
Formed trajectories of sets of points.Experimented on finding similarities between trajectories based on DTW (Dynamic Time Warping) and LCSS (Longest Common SubSequence) algorithms.Modeled trajectories as strings based on a Grid representation.Benchmarked KNN, Random Forest, Logistic Regression classification algorithms to classify efficiently trajectories.
Romanian WordNet (Data + API for Python)
Code for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.
A GP-GPU/CPU Dynamic Time Warping (DTW) implementation for the analysis of Multivariate Time Series (MTS).
Speech (audio) subjective evaluation system
Code for the papers: Correlation Coefficients and Semantic Textual Similarity, NAACL-HLT 2019 & Correlations between Word Vector Sets, EMNLP-IJCNLP 2019.
Tool to estimate deltas for sequence sets and answer questions about relative contribution
Similarity and distance measures for clustering and record linkage applications in R
Repository containing all the codes created for the lab sessions of CSE3018 Content Based Image and Video Retrieval at VIT University Chennai Campus
compimg - python package for computing similarity between the images
TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation