tzSmilence's repositories

Pre-trained-Models

预训练语言模型综述

2020-tianchi-ChMedNER

2020 “万创杯”中医药天池大数据竞赛——中药说明书实体识别挑战 复盘

Stargazers:0Issues:1Issues:0

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

Stargazers:0Issues:0Issues:0

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0

bert_for_corrector

基于bert进行中文文本纠错

Language:PythonStargazers:0Issues:0Issues:0

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

License:Apache-2.0Stargazers:0Issues:0Issues:0

ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Stargazers:0Issues:0Issues:0

CLUEDatasetSearch

搜索所有中文NLP数据集,附常用英文NLP数据集

Language:PythonStargazers:0Issues:1Issues:0

CS-Book

计算机类常用电子书整理,并且附带下载链接,包括Java,Python,Linux,Go,C,C++,数据结构与算法,人工智能,计算机基础,面试,设计模式,数据库,前端等书籍

Stargazers:0Issues:1Issues:0

d2l-zh

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被全球140所大学采用教学。

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Deep-learning

Share some deep learning knowledge and reproduce the model framework

Language:PythonStargazers:0Issues:0Issues:0

jieba

结巴中文分词

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

joyful-pandas

Pandas中文教程

Language:HTMLLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Keras-TextClassification

中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

leetcode-master

LeetCode 刷题攻略:配思维导图,将近200道经典算法题目刷题顺序、经典算法模板、共60w字的详细图解,以及难点视频题解。按照刷题攻略上的顺序来刷题,让你在算法学习上不再迷茫!🔥🔥给个star支持一下吧!🚀

Stargazers:0Issues:1Issues:0

libtorch_tokenizer

BERT Tokenizer in C++

Language:C++Stargazers:0Issues:1Issues:0

ner

命名体识别(NER)综述-论文-模型-代码(BiLSTM-CRF/BERT-CRF)-竞赛资源总结-随时更新

Stargazers:0Issues:1Issues:0

PaddleNLP

👑 Easy-to-use and powerful NLP library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis and 🖼 Diffusion AIGC system etc.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-book

PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch:入门与实战》)

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

pytorch-loss

label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pytorch_chinese_lm_pretrain

pytorch中文语言模型预训练

Language:PythonStargazers:0Issues:1Issues:0

SimCSE

SimCSE有监督与无监督实验复现

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

stopwords

中文常用停用词表(哈工大停用词表、百度停用词表等)

Stargazers:0Issues:0Issues:0

team-learning-data-mining

主要存储Datawhale组队学习中“数据挖掘/机器学习”方向的资料。

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

team-learning-program

主要存储Datawhale组队学习中“编程、数据结构与算法”方向的资料。

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

Tech_Aarticle

主要是我是日常看过的不错的文章的资源汇总,方便自己也分享给大家。有些我看过的,就会做简单的解读,没看过的,就先罗列一下,然后之后看了把解读更新上;涉及到搜索/推荐/自然语言处理。

Stargazers:0Issues:1Issues:0

TextMatch

基于Pytorch的,中文语义相似度匹配模型(ABCNN、Albert、Bert、BIMPM、DecomposableAttention、DistilBert、ESIM、RE2、Roberta、SiaGRU、XlNet)

Language:PythonStargazers:0Issues:1Issues:0

the-gan-zoo

A list of all named GANs!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

three-method-to-find-synonyms

【Demo】找寻近义词的三种方法

Language:PythonStargazers:0Issues:0Issues:0