Beast code in Giters

powergiant's starred repositories

LLM-Dojo

欢迎来到 LLM-Dojo，这里是一个开源大模型学习场所，使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Language:Python26900

LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

Language:PythonApache-2.056900

LangChain-Chinese-Getting-Started-Guide

LangChain 的中文入门教程

735900

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.0453400

ml-mgie

Language:PythonNOASSERTION384100

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonApache-2.0424500

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonNOASSERTION488300

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.01958500

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT3346300

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonApache-2.01846600

TinyDL

基于Eigen运算库的深度学习框架(支持CUDA加速)

Language:C++1600

grain

autograd mir and CUDA library for dynamic neural networks in D.

Language:DBSL-1.06600

rust-autograd

Tensors and differentiable operations (like TensorFlow) in Rust

Language:RustMIT48400

pytorch-meta-optimizer

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

Language:PythonMIT30900

Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Language:PythonApache-2.0706700

baby-llama2-chinese

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Language:PythonMIT247100

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.02176000

hearthstone-ai

A Hearthstone AI based on Monte Carlo tree search and neural nets written in modern C++.

Language:C++29700

Multi-Agent-Reinforcement-Learning-papers

Multi-Agent Reinforcement Learning (MARL) papers

19800

Reinforcement-Learning-Papers

📚 List of Top-tier Conference Papers on Reinforcement Learning (RL)，including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.

MIT29300

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language:PythonMIT388000

reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Language:PythonMIT1351300

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.03183400

goat

a Fine-tuned LLaMA that is Good at Arithmetic Tasks

Language:Jupyter Notebook17400

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonMIT1745100

Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Language:PythonApache-2.016600

luminal

Deep learning at the speed of light.

Language:RustApache-2.0145500

Deep-Reinforcement-Learning-on-Atari-Games

Language:Python1300

VPN

快速搭建个人VPN/科学上网/翻墙/教程/ssr/ss/bbr/梯子搭建/自建机场/自由上网/代理服务/VPN/2023最新教程

Language:Shell95000

MARL-Papers

Paper list of multi-agent reinforcement learning (MARL)

398900