fuxianh

fuxianh

Geek Repo

Company:hfuxian@zju.edu.cn

Location:Hangzhou, China

Github PK Tool:Github PK Tool

fuxianh's repositories

Language:PythonLicense:MITStargazers:2Issues:2Issues:0

Evolutionary-Reinforcement-Learning

Codebase for Evolutionary Reinforcement Learning (ERL) from the paper "Evolution-Guided Policy Gradients in Reinforcement Learning" published at NeurIPS 2018

Language:PythonStargazers:1Issues:2Issues:0

latplan

LatPlan : A domain-independent, image-based classical planner

Language:PythonStargazers:1Issues:3Issues:0

996.TSC

996.ICU周边文化 | 创意板块(主站:996.ICU)

Stargazers:0Issues:2Issues:0

awesome-automl-papers

A curated list of automated machine learning papers, articles, tutorials, slides and projects

License:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-interview-questions

:octocat: A curated awesome list of lists of interview questions. Feel free to contribute! :mortar_board:

Stargazers:0Issues:0Issues:0

bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0

CIFAR-ZOO

PyTorch implementation of CNNs for CIFAR dataset (97.71% on cifar10)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

cvpr2019

cvpr2019 papers,极市团队整理

Stargazers:0Issues:2Issues:0

deeplearning_ai_books

deeplearning.ai(吴恩达老师的深度学习课程笔记及资源)

Language:HTMLStargazers:0Issues:2Issues:0

DeepRL

Modularized Implementation of Deep RL Algorithms in PyTorch

Language:PythonLicense:MITStargazers:0Issues:3Issues:0

DIM

Deep InfoMax (DIM), or "Learning Deep Representations by Mutual Information Estimation and Maximization"

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

gpytorch

A highly efficient and modular implementation of Gaussian Processes in PyTorch

License:MITStargazers:0Issues:0Issues:0

hindsight-experience-replay

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

learn2learn

PyTorch Meta-learning Framework for Researchers

License:MITStargazers:0Issues:0Issues:0

leedeeprl-notes

李宏毅《深度强化学习》笔记,在线阅读地址:https://datawhalechina.github.io/leedeeprl-notes/

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Lihang

Statistical learning methods, 统计学习方法 [李航] 值得反复读. [笔记, 代码, notebook, 参考文献, Errata]

Language:PythonStargazers:0Issues:0Issues:0

metaworld

An open source robotics benchmark for meta- and multi-task reinforcement learning

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

MNF_VBNN

Multiplicative Normalizing Flow (MNF) posteriors for variational Bayesian neural networks

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

NoveltySearchLevenshteinCode

Code to run experiments related to Novelty Search for Deep Reinforcement Learning Policy Network Weights by Action Sequence Edit Metric Distance

Stargazers:0Issues:0Issues:0

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

pwc

Papers with code. Sorted by stars. Updated weekly.

Stargazers:0Issues:0Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:0Issues:3Issues:0

pytorch_geometric

Geometric Deep Learning Extension Library for PyTorch

Language:PythonLicense:MITStargazers:0Issues:3Issues:0

RandomizedValueFunctions

Randomized Value Functions via Multiplicative Normalizing Flows

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

RenZhengfei

任正非**

Stargazers:0Issues:0Issues:0

RLPaperList

Personal Repo to keep track of RL papers

Stargazers:0Issues:0Issues:0

Super-mario-bros-A3C-pytorch

Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros

License:MITStargazers:0Issues:0Issues:0

tensorpack

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tianshou

An elegant, flexible, and superfast PyTorch deep Reinforcement Learning platform.

License:MITStargazers:0Issues:0Issues:0