李毓瑞's starred repositories
awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
learning_research
本人的科研经验
Pytorch-Memory-Utils
pytorch memory track code
IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
MARL-Algorithms
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
PettingZoo
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
scikit-opt
Genetic Algorithm, Particle Swarm Optimization, Simulated Annealing, Ant Colony Optimization Algorithm,Immune Algorithm, Artificial Fish Swarm Algorithm, Differential Evolution and TSP(Traveling salesman)
reinforcement-learning-an-introduction-chinese
《Reinforcement Learning: An Introduction》(第二版)中文翻译