powergiant's starred repositories

LLM-Dojo

欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Language:PythonStargazers:269Issues:0Issues:0

LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

Language:PythonLicense:Apache-2.0Stargazers:569Issues:0Issues:0

LangChain-Chinese-Getting-Started-Guide

LangChain 的中文入门教程

Stargazers:7359Issues:0Issues:0

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4534Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:3841Issues:0Issues:0

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:4245Issues:0Issues:0

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4883Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:19585Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:33463Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:18466Issues:0Issues:0

TinyDL

基于Eigen运算库的深度学习框架(支持CUDA加速)

Language:C++Stargazers:16Issues:0Issues:0

grain

autograd mir and CUDA library for dynamic neural networks in D.

Language:DLicense:BSL-1.0Stargazers:66Issues:0Issues:0

rust-autograd

Tensors and differentiable operations (like TensorFlow) in Rust

Language:RustLicense:MITStargazers:484Issues:0Issues:0

pytorch-meta-optimizer

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

Language:PythonLicense:MITStargazers:309Issues:0Issues:0

Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Language:PythonLicense:Apache-2.0Stargazers:7067Issues:0Issues:0

baby-llama2-chinese

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Language:PythonLicense:MITStargazers:2471Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21760Issues:0Issues:0

hearthstone-ai

A Hearthstone AI based on Monte Carlo tree search and neural nets written in modern C++.

Language:C++Stargazers:297Issues:0Issues:0

Multi-Agent-Reinforcement-Learning-papers

Multi-Agent Reinforcement Learning (MARL) papers

Stargazers:198Issues:0Issues:0

Reinforcement-Learning-Papers

📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.

License:MITStargazers:293Issues:0Issues:0

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language:PythonLicense:MITStargazers:3880Issues:0Issues:0

reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Language:PythonLicense:MITStargazers:13513Issues:0Issues:0

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31834Issues:0Issues:0

goat

a Fine-tuned LLaMA that is Good at Arithmetic Tasks

Language:Jupyter NotebookStargazers:174Issues:0Issues:0

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:17451Issues:0Issues:0

Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Language:PythonLicense:Apache-2.0Stargazers:166Issues:0Issues:0

luminal

Deep learning at the speed of light.

Language:RustLicense:Apache-2.0Stargazers:1455Issues:0Issues:0

VPN

快速搭建个人VPN/科学上网/翻墙/教程/ssr/ss/bbr/梯子搭建/自建机场/自由上网/代理服务/VPN/2023最新教程

Language:ShellStargazers:950Issues:0Issues:0

MARL-Papers

Paper list of multi-agent reinforcement learning (MARL)

Stargazers:3989Issues:0Issues:0