There are 2 repositories under similarity-score topic.
DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)
String Distances in Julia
Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
Document Similarity using Word2Vec
Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")
Make plagiarism detection easier. This script will find similar sentences between given files and highlight them in a side by side comparison.
Keras implementation of "SimGNN: A Neural Network Approach to Fast Graph Similarity Computation". Includes synthetic GED data.
A perceptual hash is a fingerprint of a multimedia file derived from various features from its content. Unlike cryptographic hash functions which rely on the avalanche effect of small changes in input leading to drastic changes in the output, perceptual hashes are "close" to one another if the features are similar.
This repository consists of all the code required for similar 2-D pose detection in dance videos. This can used for any type of pose estimation application to find the similarity.
Code for NLPCC2016 Chinese Word Similarity Task
Symmetric Delete spelling correction algorithm using Java
A MatchMaker Exchange server
Parallel all-pairs similarity search algorithms in ocaml #ocaml
A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
A text similarity metric library, e.g. from edit distance's (Levenshtein, Gotoh, Jaro, etc) to other metrics, (e.g Soundex, Chapman). This library is compiled based on the .NET standard with a lot of useful extension methods.
Quantifying Pairwise Chemical Similarity for Polymers
A perfume recommendation system
A movie recommender served with a Flask-restful app
EleKit2 computes the electrostatic complementarity between a docked ligand and its protein receptor
A tool for semantic textual similarity annotation
Recommendation system built using multiple ML models that aim to predict users' interests based on their past behavior and preferences.
This JavaScript implementation detects the areas where two DNA/RNA/protein sequences are similar to each other. All symbols from UTF-8 are accepted by this algorithm.
EleKit measures the similarity of electrostatic potentials between a small molecule and a protein.
Digits Recognizer using correlation and similarity methods in MNIST Letters dataset.
A Python console application that calculates the similarity rate between 3 images. Using OpenCV and Matplotlib libraries.
Context-Aware Semantic Similarity Measurement for Unsupervised Word Sense Disambiguation
All NLP related courses on DataCamp
Build a model to Enrich the Customer Master Data by searching for each Hotel Restaurant Cafe and Fast Food outlet its corresponding entry in TripAdvisor.
An ML API to compute similarity scores between shingled sentence examples.