Yufeng Yuan (YufengYuan)

YufengYuan

Geek Repo

Github PK Tool:Github PK Tool

Yufeng Yuan's starred repositories

text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:4229Issues:0Issues:0

Explorer

Explorer is a PyTorch reinforcement learning framework for exploring new ideas.

Language:PythonLicense:MITStargazers:87Issues:0Issues:0

D4RL

A collection of reference environments for offline reinforcement learning

Language:PythonLicense:Apache-2.0Stargazers:1244Issues:0Issues:0

BCQ

Author's PyTorch implementation of BCQ for continuous and discrete actions

Language:PythonLicense:MITStargazers:580Issues:0Issues:0

MuJoCo_RL_UR5

A MuJoCo/Gym environment for robot control using Reinforcement Learning. The task of agents in this environment is pixel-wise prediction of grasp success chances.

Language:PythonLicense:MITStargazers:365Issues:0Issues:0

surface-aggregator-module

Linux ACPI and Platform Drivers for Surface Devices using the Surface Aggregator Module over Surface Serial Hub (Surface Book 2, Surface Pro 2017, Surface Laptop, and Newer)

Language:CLicense:GPL-2.0Stargazers:93Issues:0Issues:0

linux-surface

Linux Kernel for Surface Devices

Language:ShellStargazers:4726Issues:0Issues:0

dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Language:PythonLicense:Apache-2.0Stargazers:3630Issues:0Issues:0

ai-deadlines

:alarm_clock: AI conference deadline countdowns

Language:JavaScriptLicense:MITStargazers:5475Issues:0Issues:0

Policy-Gradient-Methods

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

Language:Jupyter NotebookStargazers:88Issues:0Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonLicense:NOASSERTIONStargazers:79941Issues:0Issues:0

Conditional_Density_Estimation

Package implementing various parametric and nonparametric methods for conditional density estimation

Language:PythonLicense:MITStargazers:183Issues:0Issues:0

bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

Language:C++License:NOASSERTIONStargazers:12161Issues:0Issues:0

ohmyzsh

🙃 A delightful community-driven (with 2,300+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python, etc), 140+ themes to spice up your morning, and an auto-update tool so that makes it easy to keep up with the latest updates from the community.

Language:ShellLicense:MITStargazers:170509Issues:0Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:3517Issues:0Issues:0

TD3

Author's PyTorch implementation of TD3 for OpenAI gym tasks

Language:PythonLicense:MITStargazers:1640Issues:0Issues:0

spinningup

An educational resource to help anyone learn deep reinforcement learning.

Language:PythonLicense:MITStargazers:9800Issues:0Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:34174Issues:0Issues:0

SenseAct

SenseAct: A computational framework for developing real-world robot learning tasks

Language:PythonLicense:BSD-3-ClauseStargazers:211Issues:0Issues:0

playground

PlayGround: AI Research into Multi-Agent Learning.

Language:PythonLicense:Apache-2.0Stargazers:763Issues:0Issues:0

996.ICU

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

License:NOASSERTIONStargazers:269396Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:15491Issues:0Issues:0

documentation

Issue tracker for Plotly's open-source documentation.

Stargazers:421Issues:0Issues:0

mml-book.github.io

Companion webpage to the book "Mathematics For Machine Learning"

Language:Jupyter NotebookStargazers:12642Issues:0Issues:0

interpy-zh

📘《Python进阶》(Intermediate Python - Chinese Version)

Language:CSSLicense:Apache-2.0Stargazers:6437Issues:0Issues:0

AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Language:PythonLicense:MITStargazers:3224Issues:0Issues:0

EEDS-keras

End-to-End Image Super-Resolution via Deep and Shallow Convolutional Networks

Language:PythonStargazers:26Issues:0Issues:0