sweetice

Johnny He's repositories

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language:PythonMIT3753 34 33

PEER-CVPR23

Authors' implementation of PEER

Language:PythonMIT7 10

The present anonymous repository serves as a guide for reproducing the results of the "BEER" method proposed in our ICLR submission "Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation".

Language:Python1 10

ERC-ECML-23

Anonymous code for ICML submission 45

Language:Python1 20

sweetice.github.io_old

Language:HTMLMIT1 20

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.0000

dalai_llama

The simplest way to run LLaMA on your local machine

Language:CSS000

deep-successor-features-for-transfer

A reusable framework for successor features for transfer in deep reinforcement learning using keras.

Language:PythonNOASSERTION000

dice_rl

Language:PythonApache-2.0010

drqv2

DrQ-v2: Improved Data-Augmented Reinforcement Learning

Language:PythonMIT010

ffn_geyang

Public Repo for the paper "Overcoming The Spectral-Bias of Neural Value Approximation"

Language:Python000

learned-fourier-features

Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"

Language:Python000

LibMTL

A PyTorch Library for Multi-Task Learning

Language:PythonMIT000

llama

Inference code for LLaMA models

Language:PythonGPL-3.0000

LLM4Arxiv

Language:PythonNOASSERTION000

MEPE

Official implementation of MEPE

Language:Python010

mpo

PyTorch Implementation of the Maximum a Posteriori Policy Optimisation

Language:PythonGPL-3.0010

neural-approx-ss-lfi

Codes for ICLR 21 paper: Neural Approximate Sufficient Statistics for Implicit Models

Language:Jupyter Notebook010

Online-RLHF

A recipe for online RLHF.

000

pderl

Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020

Language:Python010

reward-surfaces