yangyi0318

yangyi0318

Geek Repo

Github PK Tool:Github PK Tool

yangyi0318's repositories

adversarial-attacks-pytorch

PyTorch implementation of adversarial attacks.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome-latex-drawing

Drawing Bayesian networks, graphical models, tensors, and technical frameworks and illustrations in LaTeX.

Language:TeXLicense:MITStargazers:0Issues:0Issues:0

Awesome-Learning-with-Label-Noise

A curated list of resources for Learning with Noisy Labels

Stargazers:0Issues:0Issues:0

awesome-self-supervised-learning

A curated list of awesome self-supervised methods

Stargazers:0Issues:0Issues:0

badnets-pytorch

Simple PyTorch implementations of Badnets on MNIST and CIFAR10.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

boolean_composition

Code for the paper "A Boolean Task Algebra For Reinforcement Learning"

Language:PythonStargazers:0Issues:1Issues:0

cla_demo

Demo code for a clustering-based label-aware autoencoder

Language:PythonStargazers:0Issues:0Issues:0

composition

Code for the paper "Composing Value Functions in Reinforcement Learning"

Language:HTMLStargazers:0Issues:0Issues:0

CoNAL

Code for AAAI 2021 long paper Learning from Crowds by Modeling Common Confusions.

Language:PythonStargazers:0Issues:0Issues:0

cpu

《自己动手写CPU》

Language:VerilogStargazers:0Issues:0Issues:0

dads

Code for 'Dynamics-Aware Unsupervised Discovery of Skills' (DADS). Enables skill discovery without supervision, which can be combined with model-based control.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

Language:PythonStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

dm_control

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

garage

A toolkit for reproducible reinforcement learning research.

License:MITStargazers:0Issues:0Issues:0

hindsight-experience-replay

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

License:MITStargazers:0Issues:0Issues:0

Learning-Independent-SKills

Task dependent skill transformation is challenging due to the ignorance of the relationships between primitive skills. In this project, we propose a skill decomposition algorithm to learn independent skills, which are more suitable than primitive skills for task dependent skill transformation.

Stargazers:0Issues:0Issues:0
Language:TeXStargazers:0Issues:0Issues:0

paper-reading

比做算法的懂工程落地,比做工程的懂算法模型。

License:MITStargazers:0Issues:0Issues:0

ptan

PyTorch Agent Net: reinforcement learning toolkit for pytorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

raylab

Reinforcement learning algorithms in RLlib

License:MITStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

License:NOASSERTIONStargazers:0Issues:0Issues:0

SoftQLearning

SoftQ Implementation

Stargazers:0Issues:0Issues:0

spinningup-workspace

Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.

Stargazers:0Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

License:MITStargazers:0Issues:0Issues:0

Stein-Variational-Gradient-Descent

code for the paper "Stein Variational Gradient Descent (SVGD): A General Purpose Bayesian Inference Algorithm"

License:MITStargazers:0Issues:0Issues:0

Tabular-RL-with-Python

Tabular Reinforcement Learning Algorithms with NumPy & Visualizations with Seaborn

Stargazers:0Issues:0Issues:0