seven8827 (liuqi8827)

liuqi8827

Geek Repo

Company:Harbin Institute of Technology

Location:Shenzhen, China

Github PK Tool:Github PK Tool

seven8827's repositories

atari-representation-learning

Code for "Unsupervised State Representation Learning in Atari"

License:MITStargazers:0Issues:0Issues:0

gym-sokoban

Sokoban environment for OpenAI Gym

License:MITStargazers:0Issues:0Issues:0

MountainCar-v0_DeepRL

OpenAI MountainCar-v0 DeepRL-based solutions (DQN, DuelingDQN, D3QN)

License:MITStargazers:0Issues:0Issues:0

SGI

Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)

License:MITStargazers:0Issues:0Issues:0

Super-mario-bros-PPO-pytorch

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

License:MITStargazers:0Issues:0Issues:0

autonomous_exploration_development_environment

Leveraging system development and robot deployment for ground-based autonomous navigation and exploration.

Stargazers:0Issues:0Issues:0

snn-binary-sample-main

Initial version

Stargazers:0Issues:0Issues:0

RL-Adventure-2

PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay

Stargazers:0Issues:0Issues:0

FrameRecorder

Imagine you are drawing pictures or writing a program on your computer. Wouldn't you like to shoot small clips of your work while doing this? That's when Frame Recorder comes to your aid. It will save it for you! See hours of process in just a few minutes!

License:MITStargazers:0Issues:0Issues:0

tinyrl

Animated interactive visualization of Value-Iteration and Q-Learning in a Stochastic GridWorld environment.

Stargazers:0Issues:0Issues:0

reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

License:MITStargazers:0Issues:0Issues:0

CLsurvey

Continual Hyperparameter Selection Framework. Compares 11 state-of-the-art Lifelong Learning methods and 4 baselines. Official Codebase of "A continual learning survey: Defying forgetting in classification tasks." in IEEE TPAMI.

License:NOASSERTIONStargazers:0Issues:0Issues:0

rl_openai

RL with OpenAI Gym

License:MITStargazers:0Issues:0Issues:0

MaplessNavigation

reinforcement learning algorithm for mapless navigation

Stargazers:0Issues:0Issues:0

spinning-up-basic

Basic versions of agents from Spinning Up in Deep RL written in PyTorch

License:MITStargazers:0Issues:0Issues:0

normalization_correlation

Estudo da normalização para o cálculo da correlação (pearson, spearman)

Stargazers:0Issues:0Issues:0

Save-my-Cat

Small game with Python Tkinter

Stargazers:0Issues:0Issues:0

leetcode_101

LeetCode 101:和你一起你轻松刷题(C++)

Stargazers:0Issues:0Issues:0

rad_openaigym

RAD: Reinforcement Learning with Augmented Data (code for state augmentation)

Stargazers:0Issues:0Issues:0

rad

RAD: Reinforcement Learning with Augmented Data

Stargazers:0Issues:0Issues:0

3DObjectTracking

Official Code: A Sparse Gaussian Approach to Region-Based 6DoF Object Tracking

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

resume

个人中文简历 Latex 源码 https://hijiangtao.github.io/

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

smarties

Lightweight and scalable framework for Reinforcement Learning

License:MITStargazers:0Issues:0Issues:0

PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

License:MITStargazers:0Issues:0Issues:0

Rainbow

Rainbow: Combining Improvements in Deep Reinforcement Learning

License:MITStargazers:0Issues:0Issues:0

ivideo

一个可以观看国内主流视频平台所有视频的客户端(Mac、Windows、Linux) A client that can watch video of domestic(China) mainstream video platform

License:MITStargazers:0Issues:0Issues:0

OpenAIGym

Solving OpenAI Gym problems.

Stargazers:0Issues:0Issues:0

robogym

Robotics Gym Environments

License:MITStargazers:0Issues:0Issues:0