KunBB

followers

following

stars

Zhejiang University

Hangzhou Zhejiang China

Yunkun Xu's repositories

homework

Assignments for CS294-112.

Language:PythonMIT1 10

RL-Simple-implementation-of-AWAC-algorithm

Reinforcement Learning : This project aims at implementing an advanced algorithm called AWAC, Advantage weighted Actor Critic algorithm , which is discussed in \cite{nair2020accelerating} This project will attempt to understand the advantages and explore disadvantages (if any) of this algorithm. The main goal of this algorithm is to accelerate online reinforcement learning using offline datasets, which makes it a very useful tool for using reinforcement learning more efficient. But this task is also an extremely complicated and difficult task. The approach discussed in the paper aims to navigate through this task by efficiently handling its challenges with accumulation of error while bootstrapping, stemming from data inefficiency and excessive conservative on line learning.

Language:Jupyter Notebook1 10

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT010

bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

Language:C++NOASSERTION010

CS-Notes

:books: Tech Interview Guide 技术面试必备基础知识、Leetcode 题解、Java、C++、Python、后端面试、操作系统、计算机网络、系统设计

Language:Java010

Deep-Learning-Papers-Reading-Roadmap

Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!

Language:Python010

garage

A toolkit for reproducible reinforcement learning research

Language:PythonMIT010

google-access-helper

谷歌访问助手破解版

000

gym-gazebo2

gym-gazebo2 is a toolkit for developing and comparing reinforcement learning algorithms using ROS 2 and Gazebo

Language:PythonApache-2.0010

Imagination-Augmented-Agents

Building Agents with Imagination: pytorch step-by-step implementation

Language:Jupyter Notebook010

imagination-augmented-agents-tf

Imagination Augmented Agents TensorFlow

Language:PythonMIT010

Machine-Learning-is-ALL-You-Need

🔥🌤🪐《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, pytorch, tensorflow, keras & the most important, from scratch!💪 This repository is ALL You Need!

Language:Python010

MARA

MARA, world's first modular industrial robot arm, official repository

GPL-3.0000

MarkdownPhotos

Photos for blog

020

MB-MPO-trajectory-buffer

A modification of MB-MPO with trajectory-buffer

Language:PythonNOASSERTION010

mbpo

Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"

Language:Python010

ml_implementation

Implementation of Machine Learning Algorithms

Language:PythonMIT010

models

Models and examples built with TensorFlow

Language:PythonApache-2.0010

numpy-stl

Simple library to make working with STL files (and 3D objects in general) fast and easy.

BSD-3-Clause000

PARL

PARL A high-performance distributed training framework for Reinforcement Learning （『飞桨』强化学习库）

Language:PythonApache-2.0010

pcl

Point Cloud Library (PCL)

Language:C++NOASSERTION010

python-pcl

Python bindings to the pointcloud library (pcl)

NOASSERTION000

pytorch-handbook

pytorch handbook是一本开源的书籍，目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门，其中包含的Pytorch教程全部通过测试保证可以成功运行

Language:Jupyter Notebook000

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

MIT000

rlkit

Collection of reinforcement learning algorithms

Language:PythonMIT000

slbo

Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees

NOASSERTION000

softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Language:PythonNOASSERTION010

stac

Statistical Tests for Algorithms Comparison (STAC) is a new platform for statistical analysis to verify the results obtained from computational intelligence algorithms.

Language:JavaScriptBSD-2-Clause010

TensorFlow-2.x-Tutorials

TensorFlow 2.x version's Tutorials and Examples, including CNN, RNN, GAN, Auto-Encoders, FasterRCNN, GPT, BERT examples, etc. TF 2.0版入门实例代码，实战教程。

Language:Jupyter Notebook010

zjuthesis

Zhejiang University Graduation Thesis/Design LaTeX template.

Language:TeXMIT020