Yunkun Xu (KunBB)

KunBB

Geek Repo

Company:Zhejiang University

Location:Hangzhou Zhejiang China

Github PK Tool:Github PK Tool

Yunkun Xu's repositories

homework

Assignments for CS294-112.

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

RL-Simple-implementation-of-AWAC-algorithm

Reinforcement Learning : This project aims at implementing an advanced algorithm called AWAC, Advantage weighted Actor Critic algorithm , which is discussed in \cite{nair2020accelerating} This project will attempt to understand the advantages and explore disadvantages (if any) of this algorithm. The main goal of this algorithm is to accelerate online reinforcement learning using offline datasets, which makes it a very useful tool for using reinforcement learning more efficient. But this task is also an extremely complicated and difficult task. The approach discussed in the paper aims to navigate through this task by efficiently handling its challenges with accumulation of error while bootstrapping, stemming from data inefficiency and excessive conservative on line learning.

Language:Jupyter NotebookStargazers:1Issues:1Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

CS-Notes

:books: Tech Interview Guide 技术面试必备基础知识、Leetcode 题解、Java、C++、Python、后端面试、操作系统、计算机网络、系统设计

Language:JavaStargazers:0Issues:1Issues:0

Deep-Learning-Papers-Reading-Roadmap

Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!

Language:PythonStargazers:0Issues:1Issues:0

garage

A toolkit for reproducible reinforcement learning research

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

google-access-helper

谷歌访问助手破解版

Stargazers:0Issues:0Issues:0

gym-gazebo2

gym-gazebo2 is a toolkit for developing and comparing reinforcement learning algorithms using ROS 2 and Gazebo

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Imagination-Augmented-Agents

Building Agents with Imagination: pytorch step-by-step implementation

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

imagination-augmented-agents-tf

Imagination Augmented Agents TensorFlow

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Machine-Learning-is-ALL-You-Need

🔥🌤🪐《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, pytorch, tensorflow, keras & the most important, from scratch!💪 This repository is ALL You Need!

Language:PythonStargazers:0Issues:1Issues:0

MARA

MARA, world's first modular industrial robot arm, official repository

License:GPL-3.0Stargazers:0Issues:0Issues:0

MarkdownPhotos

Photos for blog

Stargazers:0Issues:2Issues:0

MB-MPO-trajectory-buffer

A modification of MB-MPO with trajectory-buffer

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

mbpo

Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"

Language:PythonStargazers:0Issues:1Issues:0

ml_implementation

Implementation of Machine Learning Algorithms

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

models

Models and examples built with TensorFlow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

numpy-stl

Simple library to make working with STL files (and 3D objects in general) fast and easy.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

PARL

PARL A high-performance distributed training framework for Reinforcement Learning (『飞桨』强化学习库 )

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

pcl

Point Cloud Library (PCL)

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

python-pcl

Python bindings to the pointcloud library (pcl)

License:NOASSERTIONStargazers:0Issues:0Issues:0

pytorch-handbook

pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

License:MITStargazers:0Issues:0Issues:0

rlkit

Collection of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

slbo

Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees

License:NOASSERTIONStargazers:0Issues:0Issues:0

softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

stac

Statistical Tests for Algorithms Comparison (STAC) is a new platform for statistical analysis to verify the results obtained from computational intelligence algorithms.

Language:JavaScriptLicense:BSD-2-ClauseStargazers:0Issues:1Issues:0

TensorFlow-2.x-Tutorials

TensorFlow 2.x version's Tutorials and Examples, including CNN, RNN, GAN, Auto-Encoders, FasterRCNN, GPT, BERT examples, etc. TF 2.0版入门实例代码,实战教程。

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

zjuthesis

Zhejiang University Graduation Thesis/Design LaTeX template.

Language:TeXLicense:MITStargazers:0Issues:2Issues:0