Chen Yang's starred repositories
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Megatron-LM
Ongoing research training transformer models at scale
LLM-Merging
LLM-Merging: Building LLMs Efficiently through Merging
Index-1.9B
A SOTA lightweight multilingual LLM
JupyterNotebook
服务,项目,实验 Jupyter Notebook
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
openai_api_call
The original version of https://github.com/cubenlp/chatapi_toolkit
CausalLM-Lit
基于 Lightning 的训练语言模型的框架,目前限定模型架构为 DecoderOnly,方便自定义数据,并且集成 trl 强化学习框架。
CausalLM-Lit
基于 Lightning 的训练语言模型的框架,目前限定模型架构为 DecoderOnly,方便自定义数据,并且集成 trl 强化学习框架。
openai-cookbook
Examples and guides for using the OpenAI API
DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
TextGAN-PyTorch
TextGAN is a PyTorch framework for Generative Adversarial Networks (GANs) based text generation models.
English-to-IPA
Converts English text to IPA notation
street-fighter-ai
This is an AI agent for Street Fighter II Champion Edition.
gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。