Aviral Kumar (aviralkumar2907)

aviralkumar2907

Geek Repo

Github PK Tool:Github PK Tool

Aviral Kumar's starred repositories

models

Models and examples built with TensorFlow

Language:PythonLicense:NOASSERTIONStargazers:77000Issues:2724Issues:7276

rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Language:PythonLicense:NOASSERTIONStargazers:2898Issues:162Issues:183

rlkit

Collection of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:2479Issues:61Issues:131

EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Language:PythonLicense:Apache-2.0Stargazers:2383Issues:43Issues:88

distributional-dqn

Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regression' based on OpenAi DQN baselines.

Language:PythonLicense:MITStargazers:131Issues:4Issues:4

CQL

Conservative Q Learning on top of SAC

Language:PythonLicense:MITStargazers:118Issues:5Issues:7

Cal-QL

official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning

doodad

A job launching library for docker, EC2, GCP, etc.

Language:PythonLicense:GPL-3.0Stargazers:57Issues:4Issues:5

JaxCQL

Conservative Q learning in Jax

Language:PythonLicense:MITStargazers:50Issues:3Issues:4

PTR

This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning.

Language:PythonLicense:NOASSERTIONStargazers:29Issues:3Issues:1

diagnosing_qlearning

Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.

Language:PythonLicense:MITStargazers:19Issues:7Issues:0

SimpleSAC

A simple and easy to use implementation of the soft actor-critic algorithm.

Language:PythonLicense:MITStargazers:15Issues:3Issues:0

parameterized_model

TensorFlow parameterized model library

Language:PythonLicense:Apache-2.0Stargazers:4Issues:3Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3Issues:3Issues:0