Zian(Andy) Zheng's starred repositories
long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
The-Art-of-Linear-Algebra
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
ShareGPTQAExtractor-mnbvc
MNBVC项目-ShareGPT语料清洗
Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
awesome-lm-system
Summary of system papers/frameworks/codes/tools on training or serving large model
Awesome-Mixture-of-Experts-Papers
A curated reading list of research in Mixture-of-Experts(MoE).
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
awesome-huge-models
A collection of AWESOME things about HUGE AI models.
CS224n-Reading-Notes
CS224n Reading Notes in Chinese 中文阅读笔记
Bert_related
Data preparations for training Bert
wikiextractor
A tool for extracting plain text from Wikipedia dumps
Tensorflow-101
TensorFlow Tutorials
keras_bert_multi_label_cls
本项目采用Keras和Keras-bert实现文本多标签分类任务,对BERT进行微调。
data-centric-AI
A curated, but incomplete, list of data-centric AI resources.
LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)