There are 1 repository under jaccard topic.
:dart: String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).
Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving
set of functions and operators for executing similarity queries
Hierarchical, iterative clustering for analysis of transcriptomics data in R
Document similarity algorithms experiment - Jaccard, TF-IDF, Doc2vec, USE, and BERT.
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
SetSketch: Filling the Gap between MinHash and HyperLogLog
Out-of-the-box analysis and reporting tools for twitter
Catalyst.Segmentation
Implementation of string distance algorithms in Dart
Distance/Similarity functions for Bag of Words, Strings, Vectors and more.
Skin lesion segmentation is one of the first steps towards automatic Computer-Aided Diagnosis of skin cancer. Vast variety in the appearance of the skin lesion makes this task very challenging. The contribution of this paper is to apply a power foreground extraction technique called GrabCut for automatic skin lesion segmentation in HSV color space with minimal human interaction. Preprocessing was performed for removing the outer black border. Jaccard Index was measured to evaluate the performance of the segmentation method. On average, 0.71 Jaccard Index was achieved on 1000 images from ISIC challenge 2017 Training Dataset.
TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation
Clustering similar tweets using K-means clustering algorithm and Jaccard distance metric
iamge Retrieva
PPJoin and P4Join Python 3 implementation
Simple API to recommend songs
calculate jaccard similarity using mapreduce framework
An ML+NLP solution for linking misspelled titles with the true titles
Code for Quora Competition on Kaggle
Highly optimized search for similar multisets
Rust jieba
Evaluation and agreement scripts for the DISCOSUMO project. Each evaluation script takes both manual annotations as automatic summarization output. The formatting of these files is highly project-specific. However, the evaluation functions for precision, recall, ROUGE, Jaccard, Cohen's kappa and Fleiss' kappa may be applicable to other domains too.
Testing Jaccard similarity and Cosine similarity techniques to calculate the similarity between two questions.
Just some implementations of word distance functions.
Implementation of some intern and extern clustering indexes
A tool to approximate the Jaccard similarity of bigBed files from functional genomic datasets
A utility library for comparing strings via the Jaccard similarity algorithm
This is the implementation of an algorithm that finds traceability links in two graphs such that the other graph is a perturbed version of the original graph.
Python package for fast MinHash calculation and operations