蒲源 (puyuan1996)

puyuan1996

Geek Repo

Company:China

Location:Shenzhen

Github PK Tool:Github PK Tool

蒲源's repositories

MARL

Implementation for mSAC methods in PyTorch

LC-SAC

Implementation of LC-SAC method in PyTorch.

Language:PythonStargazers:7Issues:0Issues:0

ZeroPal

ZeroPal: A concise RAG example for LightZero QA.

Language:PythonLicense:Apache-2.0Stargazers:3Issues:2Issues:0

argsloader

Configuration Parsing and Management Based on ChainLoader

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

DI-engine

OpenDILab Decision AI Engine

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

DI-engine-docs

DI-engine docs(Chinese and English)

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

gomoku_server_ui

An integrated example of front-end and back-end for a Gomoku game 五子棋前后端集成示例

Language:JavaScriptStargazers:1Issues:1Issues:0

study

Explore a collection of code examples for learning C++ and Python.

Language:C++License:Apache-2.0Stargazers:1Issues:1Issues:0

genius-invokation-gym

原神七圣召唤模拟环境 Simulator of Genius Invocation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LightZero

LightZero: A lightweight and efficient MCTS/AlphaZero/MuZero algorithm toolkit.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

PPOxFamily

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ROMA

Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SOTA-RL-Algorithms

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-visual-rl

A curated list of reinforcement learning with vision (Visual RL) resources

Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

gobang

javascript gobang AI,JS五子棋AI,源码+教程,基于Alpha-Beta剪枝算法(不是神经网络)

Language:JavaScriptStargazers:0Issues:0Issues:0

iris

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.

License:GPL-3.0Stargazers:0Issues:0Issues:0

KataGo

GTP engine and self-play learning in Go

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

katrain

Improve your Baduk skills by training with KataGo!

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

LibMTL

A PyTorch Library for Multi-Task Learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

llm-reasoners

A library for advanced large language model reasoning

License:Apache-2.0Stargazers:0Issues:0Issues:0

LLM_Tree_Search

The official implementation of paper: Alphazero-like Tree-Search can guide large language model decoding and training

Language:PythonStargazers:0Issues:0Issues:0

make-a-video-pytorch

Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch

License:MITStargazers:0Issues:0Issues:0

pinecone-vercel-starter

Pinecone + Vercel AI SDK Starter

Stargazers:0Issues:0Issues:0

quiz-system

A concise quiz-system example using Vue.js

Language:VueStargazers:0Issues:0Issues:0

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0