seven8827's repositories
atari-representation-learning
Code for "Unsupervised State Representation Learning in Atari"
gym-sokoban
Sokoban environment for OpenAI Gym
MountainCar-v0_DeepRL
OpenAI MountainCar-v0 DeepRL-based solutions (DQN, DuelingDQN, D3QN)
SGI
Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)
Super-mario-bros-PPO-pytorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
autonomous_exploration_development_environment
Leveraging system development and robot deployment for ground-based autonomous navigation and exploration.
snn-binary-sample-main
Initial version
RL-Adventure-2
PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay
FrameRecorder
Imagine you are drawing pictures or writing a program on your computer. Wouldn't you like to shoot small clips of your work while doing this? That's when Frame Recorder comes to your aid. It will save it for you! See hours of process in just a few minutes!
tinyrl
Animated interactive visualization of Value-Iteration and Q-Learning in a Stochastic GridWorld environment.
reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
CLsurvey
Continual Hyperparameter Selection Framework. Compares 11 state-of-the-art Lifelong Learning methods and 4 baselines. Official Codebase of "A continual learning survey: Defying forgetting in classification tasks." in IEEE TPAMI.
rl_openai
RL with OpenAI Gym
MaplessNavigation
reinforcement learning algorithm for mapless navigation
spinning-up-basic
Basic versions of agents from Spinning Up in Deep RL written in PyTorch
normalization_correlation
Estudo da normalização para o cálculo da correlação (pearson, spearman)
Save-my-Cat
Small game with Python Tkinter
leetcode_101
LeetCode 101:和你一起你轻松刷题(C++)
rad_openaigym
RAD: Reinforcement Learning with Augmented Data (code for state augmentation)
rad
RAD: Reinforcement Learning with Augmented Data
3DObjectTracking
Official Code: A Sparse Gaussian Approach to Region-Based 6DoF Object Tracking
resume
个人中文简历 Latex 源码 https://hijiangtao.github.io/
continuous-transition
ICRA 2021
smarties
Lightweight and scalable framework for Reinforcement Learning
PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Rainbow
Rainbow: Combining Improvements in Deep Reinforcement Learning
ivideo
一个可以观看国内主流视频平台所有视频的客户端(Mac、Windows、Linux) A client that can watch video of domestic(China) mainstream video platform
OpenAIGym
Solving OpenAI Gym problems.
robogym
Robotics Gym Environments