MOMO8966

MOMO8966

Geek Repo

Github PK Tool:Github PK Tool

MOMO8966's starred repositories

hybrid-jaccard

Implementation of hybrid jaccard similarity

Language:PythonLicense:Apache-2.0Stargazers:5Issues:0Issues:0

multi-head-selft-attention-lstm

在sts数据集上用多头注意力机制上进行测试。 pytorch torchtext 代码简练,非常适合新手了解多头注意力机制的运作。不想transformer牵扯很多层 multi-head attention + one layer lstm

Language:PythonStargazers:15Issues:0Issues:0

semantic_similarity

Comparing TF-IDF, fastText, LASER, Sentence-BERT & USE for semantic similarity. One test with STS Benchmark and one test with self-made sentences.

Stargazers:8Issues:0Issues:0

NLP-beginner-Task3

基于注意力机制的文本匹配

Language:PythonStargazers:5Issues:0Issues:0

chat

基于seq2seq的聊天系统,使用LSTM/GRU+注意力机制。使用框架pytorch。

Language:PythonStargazers:11Issues:0Issues:0

IR-project

try different method on TaipeiQA and LCQMC datasets

Language:PythonStargazers:2Issues:0Issues:0

Chinese-sentence-pair-modeling

Use deep models including BiLSTM, ABCNN, ESIM, RE2, BERT, etc. and evaluate on 5 Chinese NLP datasets: LCQMC, BQ Corpus, ChineseSTS, OCNLI, CMNLI

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:74Issues:0Issues:0

CHlikelihood

用于比较两个中文句子相似度的工具

Language:PythonLicense:MITStargazers:30Issues:0Issues:0

stopwords

中文常用停用词表(哈工大停用词表、百度停用词表等)

Stargazers:4548Issues:0Issues:0

BertSimilarity

Computing similarity of two sentences with google's BERT algorithm。利用Bert计算句子相似度。语义相似度计算。文本相似度计算。

Language:PythonStargazers:489Issues:0Issues:0

document-similarity

Document Similarity using Word2Vec

Language:PythonLicense:MITStargazers:102Issues:0Issues:0

Similarity

Calculate similarity between documents using TF-IDF weights

Language:RubyLicense:NOASSERTIONStargazers:115Issues:0Issues:0

textreuse

Detect text reuse and document similarity

Language:RStargazers:195Issues:0Issues:0

doc-similarity

Ranking documents using semantic similarity in Python

Language:Jupyter NotebookLicense:MITStargazers:35Issues:0Issues:0

document_similarity_algorithms_experiments

Document similarity algorithms experiment - Jaccard, TF-IDF, Doc2vec, USE, and BERT.

Language:PythonStargazers:83Issues:0Issues:0

sentence-similarity

对四种句子/文本相似度计算方法进行实验与比较

Language:PythonLicense:MITStargazers:289Issues:0Issues:0

Similarity

文本相似度算法

Language:PythonStargazers:39Issues:0Issues:0

text-similarity

用TF特征向量和simhash指纹计算中文文本的相似度

Language:PythonStargazers:211Issues:0Issues:0

simple-effective-text-matching-pytorch

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

Language:PythonLicense:Apache-2.0Stargazers:304Issues:0Issues:0

Text-Similarity

Text-Similarity Method in Pytorch

Language:PythonStargazers:465Issues:0Issues:0

BiMPM

BiMPM: Bilateral Multi-Perspective Matching for Natural Language Sentences

Language:PythonLicense:Apache-2.0Stargazers:440Issues:0Issues:0

ChineseVLBert

中文领域的多模态Bert

Stargazers:45Issues:0Issues:0

Trial2Vec

Findings of EMNLP'22 | Trial2Vec: Zero-Shot Clinical Trial Document Similarity Search using Self-Supervision

Language:PythonLicense:MITStargazers:19Issues:0Issues:0

aspect-document-embeddings

Code, dataset & models for the paper Specialized Document Embeddings for Aspect-based Similarity of Research Papers (#JCDL2022)

Language:Jupyter NotebookStargazers:11Issues:0Issues:0

semeval-code

GateNLP-UShef at SemEval-2022 Task 8: Entity-Enriched Siamese Transformer for Multilingual News Article Similarity

Language:PythonStargazers:4Issues:0Issues:0

deep-siamese-text-similarity

Tensorflow based implementation of deep siamese LSTM network to capture phrase/sentence similarity using character/word embeddings

Language:PythonLicense:MITStargazers:1403Issues:0Issues:0

text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:4352Issues:0Issues:0

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:14678Issues:0Issues:0

BERT-whitening

简单的向量白化改善句向量质量

Language:PythonStargazers:479Issues:0Issues:0
Language:PythonStargazers:3Issues:0Issues:0