xiaohuojianchendiwen

xiaohuojianchendiwen

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

xiaohuojianchendiwen's repositories

annotated_deep_learning_paper_implementations

🧑‍🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

License:MITStargazers:0Issues:0Issues:0

federated

A framework for implementing federated learning

License:Apache-2.0Stargazers:0Issues:0Issues:0

Mava

A library of multi-agent reinforcement learning components and systems

License:Apache-2.0Stargazers:0Issues:0Issues:0

FATE

An Industrial Grade Federated Learning Framework

License:Apache-2.0Stargazers:0Issues:0Issues:0

tianshou

An elegant PyTorch deep reinforcement learning library.

License:MITStargazers:0Issues:0Issues:0

google-research

Google Research

License:Apache-2.0Stargazers:0Issues:0Issues:0

DRLib

DRLib:A concise deep reinforcement learning library, integrating HER and PER for almost off policy RL algos.

License:MITStargazers:0Issues:0Issues:0

pymarl

Python Multi-Agent Reinforcement Learning framework

License:Apache-2.0Stargazers:0Issues:0Issues:0

on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

License:MITStargazers:0Issues:0Issues:0

P-DQN

Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space

License:MITStargazers:0Issues:0Issues:0

RLsilde

Some notes about reinforce learning.

Stargazers:0Issues:0Issues:0

RL_Tutorial

Tutorial for Reinforcement Learning

License:MITStargazers:0Issues:0Issues:0

T-GCN

Temporal Graph Convolutional Network for Urban Traffic Flow Prediction Method

Stargazers:0Issues:0Issues:0

R2D2

An Implementation of Recurrent Experience Replay in Distributed Reinforcement Learning (Kapturowski et al. 2019) in PyTorch and Ray

Stargazers:0Issues:0Issues:0

Keras-GAN

Keras implementations of Generative Adversarial Networks.

License:MITStargazers:0Issues:0Issues:0

DeepRL

Modularized Implementation of Deep RL Algorithms in PyTorch

License:MITStargazers:0Issues:0Issues:0

smac

SMAC: The StarCraft Multi-Agent Challenge

License:MITStargazers:0Issues:0Issues:0

UAV-AHU

Knowledge distillation, drone delivery system, street photography, identification and matching

Stargazers:0Issues:0Issues:0

Dueling_DQN

Dueling DQN Pytorch

License:MITStargazers:0Issues:0Issues:0

machinelearning

My blogs and code for machine learning. http://cnblogs.com/pinard

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

docs

TensorFlow documentation

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

DRL_algorithm_library

This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.

Stargazers:0Issues:0Issues:0

pderl

Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020

License:MITStargazers:0Issues:0Issues:0

light_mappo

Lightweight version of MAPPO to help you quickly migrate to your local environment.

Stargazers:0Issues:0Issues:0

StarCraft

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Stargazers:0Issues:0Issues:0

maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

License:MITStargazers:0Issues:0Issues:0

malib_deprecated

A Multi-agent Learning Framework

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0