Dr. Yong CHENG's repositories
Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Awesome-Video-Diffusion-Models
[Arxiv] A Survey on Video Diffusion Models
calculate-flops.pytorch
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)
ControlNet
Let us control diffusion models!
DeepLearningSystem
Deep Learning System core principles introduction.
DeepSpeedExamples
Example models using DeepSpeed
Efficient-LLMs-Survey
Efficient Large Language Models: A Survey
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
FastCkpt
Python package for rematerialization-aware gradient checkpointing
Flowise
Drag & drop UI to build your customized LLM flow
langflow
⛓️ Langflow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
LLaMA-Efficient-Tuning
Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA) (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)
Llama2-Chinese
Llama中文社区,最好的中文Llama大模型,完全开源可商用
llm-action
LLM 实战
llm-inference-benchmark
LLM Inference benchmark
llm-numbers
Numbers every LLM developer should know
llm-resource
LLM全栈优质资源汇总
LLMs-In-China
**大模型
LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
LocalAI
:robot: Self-hosted, community-driven, local OpenAI compatible API. Drop-in replacement for OpenAI running LLMs on consumer-grade hardware. Free Open Source OpenAI alternative. No GPU required. Runs ggml, GPTQ, onnx, TF compatible models: llama, gpt4all, rwkv, whisper, vicuna, koala, gpt4all-j, cerebras, falcon, dolly, starcoder, and many others
NeMo
NeMo: a toolkit for conversational AI
OpenRLHF
A Ray-based High-performance RLHF framework (for 34b+ models)
starcoder
Home of StarCoder: fine-tuning & inference!
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
text-generation-webui
A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, OPT, and GALACTICA.