Yiwen-Yang-666's starred repositories
Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
ChineseNLPCorpus
中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
LangChain-Chinese-Getting-Started-Guide
LangChain 的中文入门教程
moss-finetune-and-moss-finetune-int8
实现moss int8的finetune和优化源moss项目模型保存问题
mesh-transformer-jax
Model parallel transformers in JAX and Haiku
bert_and_ernie
TensorFlow code and pre-trained models for BERT and ERNIE
Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
BERT4doc-Classification
Code and source for paper ``How to Fine-Tune BERT for Text Classification?``
GAN-BERT-CRF
An idea that take advantages of features of deep learning to use unannotated samples for NER and identify sequences with error labels.
Multi-Label-Text-Classification
About Muti-Label Text Classification Based on Neural Network.
scikit-multilearn
A scikit-learn based module for multi-label et. al. classification
lmtc-eurlex57k
Large-Scale Multi-Label Text Classification on EU Legislation
ML_Net-1
ML-Net is a novel end-to-end deep learning framework for multi-label classification of biomedical tasks. ML-Net combines the label prediction network with a label count prediction network, which can determine the output labels based on both label confidence scores and document context in an end-to-end manner.
BlurbGenreCollection-HMC
Hierarchical multi-label text classification of the BlurbGenreCollection using capsule networks.
knowledge-net
KnowledgeNet: A Benchmark Dataset for Knowledge Base Population
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
bert-relation-classification
A pytorch implementation of BERT-based relation classification
generative-models
Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.
sequence_tagging
Named Entity Recognition (LSTM + CRF) - Tensorflow