text-similarity

There are 23 repositories under text-similarity topic.

Resume-Matcher
srbhr / Resume-Matcher
Improve your resumes with Resume Matcher. Get insights, keyword suggestions and tune your resumes to job descriptions.
applicant-tracking-system ats hacktoberfest machine-learning natural-language-processing nextjs python resume resume-builder resume-parser text-similarity typescript vector-search word-embeddings
Language:Python 23856
text2vec
shibing624 / text2vec
text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。
embeddings nlp sentence-embeddings similarity text-similarity text2vec word2vec
Language:Python 4877
CLUEbenchmark / CLUEDatasetSearch
搜索所有中文NLP数据集，附常用英文NLP数据集
nlp datasets chinese ner qa match text-classification machine-translation knowledge-graph corpus machine-reading-comprehension sentiment-analysis text-similarity text-summarization
Language:Python 4391
NTMC-Community / awesome-neural-models-for-semantic-match
A curated list of papers dedicated to neural text (semantic) matching.
deep-learning information-retrieval neu-ir question-answering semantic-matching text-similarity
Language:HTML 781
murray-z / text_analysis_tools
中文文本分析工具包（包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取）
text-ana text-classification text-clustering text-similarity key-words sentiment-analysis spell-corrector text-summatizer event-extraction topic-keywords
Language:Python 721
SeanLee97 / AnglE
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
dense-retrieval embeddings information-retrieval llama llama2 llm mteb rag retrieval-augmented-generation semantic-similarity semantic-textual-similarity sentence-embedding sentence-embeddings sentence-vector sts stsbenchmark text-embedding text-similarity text-vector text2vec
Language:Python 558
fanghon / antiplag
作业查重软件，它实现了程序代码、文档文本、图片之间的相似度检查。a code-similarity, text-similarity and image-similarity computation software for the codes, documents and images of assignment.
assignment code-similarity java jplag moss phash plagiarism text-similarity
Language:Java 400
nlpodyssey / cybertron
Cybertron: the home planet of the Transformers in Go
bart bert machine-translation question-answering zero-shot-classification bert-as-service transformers huggingface text-classification named-entity-recognition summarization text-categorization text-similarity translation deep-learning machine-learning natural-language-processing nlp
Language:Go 319
dolos
dodona-edu / dolos
:detective: Source code plagiarism detection
academic-dishonesty code-similarity collusion-detection dodona education fuzzy-matching hacktoberfest learn-to-code online-learning plagiarism plagiarism-checker plagiarism-checking plagiarism-detection plagiarism-detector plagiarism-prevention software-plagiarism source-code-analysis text-similarity
Language:TypeScript 310
cjymz886 / sentence-similarity
对四种句子/文本相似度计算方法进行实验与比较
sentence-similarity text-similarity word2vec cosinesimilarity bm25 idf
Language:Python 291
amansrivastava17 / lstm-siamese-text-similarity
⚛️ It is keras based implementation of siamese architecture using lstm encoders to compute text similarity
siamese-network deep-learning keras text-similarity sentence-similarity lstm lstm-neural-networks bidirectional-lstm
Language:Python 288
padeoe / cail2019
法研杯2019相似案例匹配第二名解决方案（附数据集和文档）,CAIL2020/2021司法考试赛道冠军队伍
bert text-similarity competition
Language:Python 252
awslabs / aws-ai-solution-kit
Machine Learning APIs for common use cases, include: General OCR (Simplified/Traditional Chinese), Custom OCR, Image Similarity, Object Recognition, Face Detection, Face Comparison, Human Image Segmentation, Human Attribute Recognition, Pornography Detection, Image Super Resolution, Text Similarity, Car License Plate, etc.
car-license-plate-recognition chinese-ocr deep-learning face-recognition human-segmentation image-similarity machine-learning ocr ocr-recognition optical-character-recognition simplified-chinese super-resolution text-similarity traditional-chinese
Language:Python 183
tlatkowski / multihead-siamese-nets
Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task.
siamese-neural-network multihead-attention semantic-similarity deep-neural-networks siamese-lstm attention deep-architectures deep-learning text-similarity paraphrase paraphrase-identification nlp natural-language-processing quora-question-pairs snli multihead-attention-networks sentence-similarity tensorflow python3 siamese-cnn
Language:Jupyter Notebook 182
kiwirafe / xiangsi
中文文本相似度计算器
cosine-similarity minhash simhash text-similarity
Language:Python 162
lonePatient / TorchBlocks
A PyTorch-based toolkit for natural language processing
pytorch nlp text-classification triplet-loss siamese-network text-similarity multilabel-classification advertising bert transformers relation-classification named-entity-recognition
Language:Python 159
yaoxiaoyuan / mimix
Mimix: A Text Generation Tool and Pretrained Chinese Models
chinese-chatbot chinese-nlp gpt-2 poetry-generation question-generation seq2seq summarization text-similarity comment-generation essay-generation generative-qa product-description-generation product-review-generation pretrained-models novel-generation chinese-english-translator tag-generation spelling-correction vit clip
Language:Python 158
nityansuman / marvin
Web app to automatically generate subjective or an objective test and evaluate user responses without any human intervention in an efficient and automatic manner using machine learning and natural language processing.
flask-application final-year-project examination-system machine-learning natural-language-processing text-similarity nltk examination python flask
Language:CSS 115
IDEA-CCNL / GTS-Engine
GTS Engine: A powerful NLU Training System。GTS引擎（GTS-Engine）是一款开箱即用且性能强大的自然语言理解引擎，聚焦于小样本任务，能够仅用小样本就能自动化生产NLP模型。
natural-language-processing nli nlp pretrained-models python pytorch text text-classification text-similarity
Language:Python 93
ddangelov / RESTful-Top2Vec
Expose a Top2Vec model with a REST API.
rest-api top2vec semantic-search semantic-search-engine topic-modeling document-embedding word-embedding text-search text-similarity fastapi restful-api topic-model
Language:Python 92
adhaamehab / textblob-ar
Arabic support for textblob
nlp natural-language-processing machine-learning sentiment-analysis part-of-speech-tagger text-classification arabic-language arabic-nlp textblob text-similarity spelling-correction word-embeddings
Language:Python 86
hellonlp / sentence-similarity
文本相似度，语义向量，文本向量，text-similarity，similarity, sentence-similarity，BERT，SimCSE，BERT-Whitening，Sentence-BERT, PromCSE, SBERT
bert-embeddings similarity text-similarity bert sentence-embeddings sentence-similarity simcse sentence-bert whitening sbert promcse
Language:Python 75
zake7749 / CIKM-AnalytiCup-2018
[ACM-CIKM] 2nd place solution at CIKM AnalytiCup 2018, a task for determining short text similarities.
natural-language-understanding text-similarity semantic-matching semantic-similarity keras natural-language-processing cikm
Language:Python 75
Auto-Research
sidphbot / Auto-Research
Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!
nlp pytorch python research-software-engineering research-tool research-and-development research-data-management scientific-research scientific-publications arxiv arxiv-api text-generation text-clustering title-generation summarization pdf-document-processor topic-modeling text-similarity ocr
Language:Python 59
Lipairui / textgo
Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!
text-preprocessing nlp text-classification text-search text-similarity text-representation bert
Language:Python 45
xiaorancs / text-similarity
使用不同的方法计算相似度
text-similarity python
Language:Python 42
themaximalist / vectordb.js
Simple in-memory vector database for text similarity in Node.js
embeddings feature-extraction hnsw nodejs openai text-similarity vectordb
Language:HTML 39
giacbrd / python-dandelion-eu
A python client for connecting to all the services provided by https://dandelion.eu
python entity-extraction entity-linking machine-learning text-classification sentiment-analysis language-detection api-client api-wrapper api wikipedia wikipedia-api wikification text-similarity text-mining text-analysis semantic-analysis semantic-similarity
Language:Python 36
simphile-text-similarity-nlp
brianrisk / simphile-text-similarity-nlp
Python Text Similarity NLP Libray
library nlp text-classification text-similarity
Language:Python 34
siddgood / podcast-recommendation-engine
:microphone: Building a content-based podcast recommender system using NLP
recommendation recommender-system content-based-recommendation nlp similarity embeddings word2vec glove podcasts text-similarity text-analysis apple-podcasts itunes python nltk genism
Language:Jupyter Notebook 32
KeremZaman / semantic-sh
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
simhash word-vectors fasttext bert locality-sensitive-hashing transformer text-similarity text-clustering text-search
Language:Python 28
yongzhuo / near-synonym
near-synonym, 基于大模型LLM的中文反义词/近义词(antonyms/synonyms)工具包. 也可计算词语相似度/句子相似度/文本相似度等。
antonym antonyms near-antonym near-antonyms near-synonym near-synonyms sentence-similarity similarity synonyms text-similarity word-similarity
Language:Python 28
ZhengZixiang / chip2019_task2_question_pairs_matching
CHIP 2019平安医疗科技疾病问答迁移学习比赛baseline，rank7
bert quora-question-pairs natural-language-inference text-similarity semantic-similarity
Language:Python 28
amansrivastava17 / bns-short-text-similarity
📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.
bns text-vectorization nlp cosine-similarity text-similarity text-classification bns-vectorizer tf-idf term-frequency short-text-semantic-similarity
Language:Python 27
Shivamrai15 / Text-Similarity
Two-part information retrieval system: 1) Pre-process text files, generate TF-IDF matrix and inverted index. 2) Retrieve relevant documents ranked by cosine similarity for given queries.
cosine-similarity inverted-index preprocessing python sklearn stemming text-similarity tf-idf tfidf
Language:Python 24
piotrmaciejbednarski / text-similarity-node
High-performance and memory efficient native C++ text similarity algorithms for Node.js
cosine-similarity cpp17 cpp20 damerau-levenshtein hamming jaccard jaro jaro-winkler levenshtein nlp nodejs npm-package sorensen-dice string-similarity text-similarity tversky textdistance
Language:C++ 21

text-similarity

srbhr / Resume-Matcher

shibing624 / text2vec

CLUEbenchmark / CLUEDatasetSearch

NTMC-Community / awesome-neural-models-for-semantic-match

murray-z / text_analysis_tools

SeanLee97 / AnglE

fanghon / antiplag

nlpodyssey / cybertron

dodona-edu / dolos

cjymz886 / sentence-similarity

amansrivastava17 / lstm-siamese-text-similarity

padeoe / cail2019

awslabs / aws-ai-solution-kit

tlatkowski / multihead-siamese-nets

kiwirafe / xiangsi

lonePatient / TorchBlocks

yaoxiaoyuan / mimix

nityansuman / marvin

IDEA-CCNL / GTS-Engine

ddangelov / RESTful-Top2Vec

adhaamehab / textblob-ar

hellonlp / sentence-similarity

zake7749 / CIKM-AnalytiCup-2018

sidphbot / Auto-Research

Lipairui / textgo

xiaorancs / text-similarity

themaximalist / vectordb.js

giacbrd / python-dandelion-eu

brianrisk / simphile-text-similarity-nlp

siddgood / podcast-recommendation-engine

KeremZaman / semantic-sh

yongzhuo / near-synonym

ZhengZixiang / chip2019_task2_question_pairs_matching

amansrivastava17 / bns-short-text-similarity

Shivamrai15 / Text-Similarity

piotrmaciejbednarski / text-similarity-node