puyuan1996

followers

following

stars

China

Shenzhen

蒲源's repositories

MARL

Implementation for mSAC methods in PyTorch

Language:Python35 1 1

LC-SAC

Implementation of LC-SAC method in PyTorch.

Language:Python700

ZeroPal

ZeroPal: A concise RAG example for LightZero QA.

Language:PythonApache-2.03 20

argsloader

Configuration Parsing and Management Based on ChainLoader

Language:PythonApache-2.0100

DI-engine

OpenDILab Decision AI Engine

Language:PythonApache-2.0100

DI-engine-docs

DI-engine docs(Chinese and English)

Language:PythonApache-2.0100

gomoku_server_ui

An integrated example of front-end and back-end for a Gomoku game 五子棋前后端集成示例

Language:JavaScript1 10

study

Explore a collection of code examples for learning C++ and Python.

Language:C++Apache-2.01 10

genius-invokation-gym

原神七圣召唤模拟环境 Simulator of Genius Invocation

Language:PythonMIT000

LightZero

LightZero: A lightweight and efficient MCTS/AlphaZero/MuZero algorithm toolkit.

Language:PythonApache-2.0000

NDRL-benchmark

000

PPOxFamily

PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）

Language:PythonApache-2.0000

puyuan1996.github.io

Language:HTML000

ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonApache-2.0000

ROMA

Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)

Language:PythonApache-2.0000

SOTA-RL-Algorithms

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Language:PythonApache-2.0000

awesome-visual-rl

A curated list of reinforcement learning with vision (Visual RL) resources

000

DI-card

Language:PythonApache-2.0000

gobang

javascript gobang AI，JS五子棋AI，源码+教程，基于Alpha-Beta剪枝算法（不是神经网络）

Language:JavaScript000

iris

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.

GPL-3.0000

KataGo

GTP engine and self-play learning in Go

Language:C++NOASSERTION000

katrain

Improve your Baduk skills by training with KataGo!

Language:PythonNOASSERTION000

LibMTL

A PyTorch Library for Multi-Task Learning

Language:PythonMIT000

llm-reasoners

A library for advanced large language model reasoning

Apache-2.0000

LLM_Tree_Search

The official implementation of paper: Alphazero-like Tree-Search can guide large language model decoding and training

Language:Python000

make-a-video-pytorch

Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch

MIT000

pinecone-vercel-starter

Pinecone + Vercel AI SDK Starter

000

quiz-system

A concise quiz-system example using Vue.js

Language:Vue000

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonMIT000