powergiant's starred repositories
LLamaTuner
Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.
LangChain-Chinese-Getting-Started-Guide
LangChain 的中文入门教程
alignment-handbook
Robust recipes to align language models with human and AI preferences
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
rust-autograd
Tensors and differentiable operations (like TensorFlow) in Rust
pytorch-meta-optimizer
A PyTorch implementation of Learning to learn by gradient descent by gradient descent
Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
baby-llama2-chinese
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
hearthstone-ai
A Hearthstone AI based on Monte Carlo tree search and neural nets written in modern C++.
Multi-Agent-Reinforcement-Learning-papers
Multi-Agent Reinforcement Learning (MARL) papers
Reinforcement-Learning-Papers
📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.
Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Spec-Bench
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)