LeakyCauldron's repositories
leetcode
LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)
abc_py
Simple Python interface for ABC
DRiLLS
DRiLLS: Deep Reinforcement Learning for Logic Synthesis Optimization
basic_reinforcement_learning
An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.
nas-dmrl
Learning to reinforcement learn for Neural Architecture Search
learn2018-autodown
清华大学新版网络学堂课程自动下载脚本 / A python script to clone all files from learn.tsinghua.edu.cn
population-based-training-of-NNs
Applying PBT optimization technique to different domains
MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
compile
CompILE: Compositional Imitation Learning and Execution (ICML 2019)
tf_unet
Generic U-Net Tensorflow implementation for image segmentation
pytorch-a2c-ppo-acktr
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).
Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials
THUnet
清华校园网掉线自动重连
mesa
Mesa OpenGL library. This is where @anholt hosts some development branches, but the current usable code for vc4/v3d is *always* at https://gitlab.freedesktop.org/mesa/mesa
Codes-for-RL-PER
A novel DDPG method with prioritized experience replay (IEEE SMC 2017)
pytorch-a3c-1
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
population-based-training
Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.
gotunet
golang for TsinghuaUniversityNetwork 清华大学校园网循环检测登录
prioritized-experience-replay
implement of prioritized experience replay
pytorch-A3C
Simple A3C implementation with pytorch + multiprocessing
pytorch-ddpg
Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch
GA3C
Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.
ACER
Actor-critic with experience replay
pix2pix-tensorflow
TensorFlow implementation of "Image-to-Image Translation Using Conditional Adversarial Networks".
a3c_continuous
A continuous action space version of A3C LSTM in pytorch plus A3G design
triplet-loss-mnist
Triplet Loss 损失函数
keras-nas-pgrl
Neural Architecture Search (NAS) using policy gradient Reinforcement Learning (RL)