dudulry

0

followers

following

stars

杜杜里's starred repositories

CLUEPretrainedModels

高质量中文预训练模型集合：最先进大模型、最快小模型、相似度专门模型

Language:Python79600

FQA-question-answer

基于深度学习的FAQ式问答系统

Language:Python100

nlp_paper_study_qa

仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【问答篇】

1900

ChineseEmbedding

Chinese Embedding collection incling token ,postag ,pinyin,dependency,word embedding.中文自然语言处理向量合集,包括字向量,拼音向量,词向量,词性向量,依存关系向量.共5种类型的向量

Language:Python44900

CoSENT_Pytorch

CoSENT、STS、SentenceBERT

Language:Python15900

text2vec

text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。

Language:PythonApache-2.0434200

text2vec

text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。

Apache-2.0100

TextMatch

基于Pytorch的，中文语义相似度匹配模型（ABCNN、Albert、Bert、BIMPM、DecomposableAttention、DistilBert、ESIM、RE2、Roberta、SiaGRU、XlNet）

Language:Python77800

Chinese-Word-Vectors

100+ Chinese Word Vectors 上百种预训练中文词向量

Language:PythonApache-2.01172800

sharpened-cosine-similarity

An alternative to convolution in neural networks

Language:PythonMIT24700

atec2018-nlp

2018年蚂蚁金服金融大脑赛题分享

Language:Python15100

Chinese-BERT-wwm

Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）

Language:PythonApache-2.0950500

roberta_zh

RoBERTa中文预训练模型: RoBERTa for Chinese

Language:Python257800

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonApache-2.04025000

lightNLP

基于Pytorch和torchtext的自然语言处理深度学习框架。

Language:PythonApache-2.082200

sentence_sim

Language:Python600

ShortTextMatching

基于lucene全文检索引擎实现的短文本匹配系统

Language:Java400

RoFormer_pytorch

RoFormer V1 & V2 pytorch

Language:PythonApache-2.045200

CLUEDatasetSearch

搜索所有中文NLP数据集，附常用英文NLP数据集

Language:Python405700

CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

MIT90400

nCoV-2019-sentence-similarity

天池-新冠疫情相似句对判定大赛 Rank8

Language:Python5200

epidemic-sentence-pair

天池疫情相似句对判定大赛线上第一名方案

Language:Python42700

kkndme_tianya

天涯 kkndme 神贴聊房价

100

text_matching

常用文本匹配模型tf版本，数据集为QA_corpus，持续更新中

Language:PythonApache-2.067200

MatchZoo

Facilitating the design, comparison and sharing of deep text matching models.

Language:PythonApache-2.0382900

deep_text_matching

implementation several deep text match (text similarly) models for keras . cdssm, arc-ii,match_pyramid, mvlstm ,esim, drcn ,bimpm, bert, albert, raberta

Language:Python28700

PACKD

The official implementation of [ACMMM2022] Pay Attention to Your Positive Pairs: Positive Pair Aware Contrastive Knowledge Distillation

Language:Python1100

nlp-basictasks

A simple framework for building some basic NLP tasks

Language:Jupyter NotebookMIT5900

SimCSE-Pytorch

中文数据集下SimCSE+ESimCSE的实现

Language:PythonMIT18400

chinese_sentence_embeddings

bert_avg，bert_whitening，sbert，consert，simcse，esimcse 中文句向量表示

Language:Python1600