DennisWangCW

followers

following

stars

LeakyCauldron's repositories

FlowTune

000

nascell-automl

MIT000

leetcode

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解，记录自己的leetcode解题之路。)

NOASSERTION000

abc_py

Simple Python interface for ABC

MIT000

DRiLLS

DRiLLS: Deep Reinforcement Learning for Logic Synthesis Optimization

BSD-3-Clause000

basic_reinforcement_learning

An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.

GPL-3.0000

nas-dmrl

Learning to reinforcement learn for Neural Architecture Search

MIT000

learn2018-autodown

清华大学新版网络学堂课程自动下载脚本 / A python script to clone all files from learn.tsinghua.edu.cn

MIT000

population-based-training-of-NNs

Applying PBT optimization technique to different domains

000

LwH

Learning with Helper

Language:Python1800

MARL-Papers

Paper list of multi-agent reinforcement learning (MARL)

100

compile

CompILE: Compositional Imitation Learning and Execution (ICML 2019)

MIT000

tf_unet

Generic U-Net Tensorflow implementation for image segmentation

Language:PythonGPL-3.0000

pytorch-a2c-ppo-acktr

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).

Language:PythonMIT000

Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials

Language:PythonMIT000

THUnet

清华校园网掉线自动重连

Language:Python000

mesa

Mesa OpenGL library. This is where @anholt hosts some development branches, but the current usable code for vc4/v3d is *always* at https://gitlab.freedesktop.org/mesa/mesa

Language:C000

Codes-for-RL-PER

A novel DDPG method with prioritized experience replay (IEEE SMC 2017)

Language:PythonMIT000

pytorch-a3c-1

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Language:PythonMIT000

population-based-training

Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.

000

gotunet

golang for TsinghuaUniversityNetwork 清华大学校园网循环检测登录

Language:GoGPL-3.0000

prioritized-experience-replay

implement of prioritized experience replay

Language:PythonMIT000

pytorch-A3C

Simple A3C implementation with pytorch + multiprocessing

Language:PythonMIT000

pytorch-ddpg

Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch

Language:PythonApache-2.0200

GA3C

Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.

Language:PythonBSD-3-Clause000

ACER

Actor-critic with experience replay

Language:PythonMIT000

pix2pix-tensorflow

TensorFlow implementation of "Image-to-Image Translation Using Conditional Adversarial Networks".

Language:PythonMIT000

a3c_continuous

A continuous action space version of A3C LSTM in pytorch plus A3G design

Language:PythonApache-2.0000

triplet-loss-mnist

Triplet Loss 损失函数

Language:Python000

keras-nas-pgrl

Neural Architecture Search (NAS) using policy gradient Reinforcement Learning (RL)

000