idanshen

idanshen

Geek Repo

Github PK Tool:Github PK Tool

idanshen's repositories

Language:PythonLicense:MITStargazers:14Issues:0Issues:0

alpaca_farm

A Simulation Framework for RLHF and alternatives

Language:PythonLicense:Apache-2.0Stargazers:3Issues:2Issues:0
Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:2Issues:1Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

AppriximateConvolutionalSparseCoding

An implementation of approximate convolutional sparse coding (CSC) based on paper: https://arxiv.org/abs/1711.00328

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

CQL

Code for conservative Q-learning

Language:PythonStargazers:0Issues:0Issues:0

DiffHand

[RSS 2021] An End-to-End Differentiable Framework for Contact-Aware Robot Design

Language:C++License:MITStargazers:0Issues:0Issues:0

EAAC

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

easyrl

A collection of reinforcement learning algorithms.

Language:PythonStargazers:0Issues:0Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

v202

Proceedings of ICML 2023

Language:TeXStargazers:0Issues:0Issues:0