kingfan1998's starred repositories

rnn

一些RNN的实现

Language:PythonStargazers:47Issues:0Issues:0

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:12052Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2470Issues:0Issues:0

LOMO

LOMO: LOw-Memory Optimization

Language:PythonLicense:MITStargazers:958Issues:0Issues:0
Language:PythonStargazers:128Issues:0Issues:0

UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Language:PythonLicense:MITStargazers:2182Issues:0Issues:0

Document-Plugin

Plug-and-Play Document Modules for Pre-trained Models

Language:PythonStargazers:25Issues:0Issues:0

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonLicense:MITStargazers:4449Issues:0Issues:0

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9759Issues:0Issues:0

CSPostgraduate-408

💯 CSPostgraduate 计算机考研 408 专业课资料及真题资源

Language:C++Stargazers:4543Issues:0Issues:0

AlignScore

ACL2023 - AlignScore, a metric for factual consistency evaluation.

Language:PythonLicense:MITStargazers:99Issues:0Issues:0

AlgoXY

Book of Elementary Functional Algorithms and Data structures

Language:TeXStargazers:6025Issues:0Issues:0

text-to-text-transfer-transformer

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Language:PythonLicense:Apache-2.0Stargazers:6050Issues:0Issues:0

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:9894Issues:0Issues:0

Distinct-N

Compute Distinct-N metric proposed by Jiwei Li et al.

Language:PythonLicense:MITStargazers:110Issues:0Issues:0
Language:PythonStargazers:433Issues:0Issues:0

Luotuo-QA

骆驼QA,中文大语言阅读理解模型。

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:70Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1237Issues:0Issues:0

CPT

CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

Language:PythonStargazers:475Issues:0Issues:0

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40148Issues:0Issues:0

SeqDiffuSeq

Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]

Language:PythonStargazers:82Issues:0Issues:0

DiffuSeq

Official Codebase for DiffuSeq

Stargazers:1Issues:0Issues:0

zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Language:Jupyter NotebookLicense:MITStargazers:2731Issues:0Issues:0

ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Language:PythonStargazers:2588Issues:0Issues:0
Language:PythonStargazers:15Issues:0Issues:0

awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Language:PythonLicense:MITStargazers:4597Issues:0Issues:0

MT5ForGeneration

基于精简mt5预训练模型的seq2seq结构的实现

Language:PythonStargazers:4Issues:0Issues:0

MT5_chinese_simplify

pytorch版本MT5模型的中文精简代码

Language:PythonStargazers:15Issues:0Issues:0

turkish-question-generation

Automated question generation and question answering from Turkish texts using text-to-text transformers

Language:PythonLicense:MITStargazers:42Issues:0Issues:0

pytorch_med_T5-large_scale_pretraining_and_fientune-

基于T5 和 mt5 模型的医学nlp大规模预训练模型的训练和验证,测试

Language:PythonLicense:MITStargazers:3Issues:0Issues:0