agarwl

Rishabh Agarwal's repositories

neural_additive_models

Repo for open sourcing the NAMs.

25 50

pse

Website for "Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning"

Language:SCSS5 10

agarwl.github.io

Language:HTMLMIT400

Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

MIT300

neural-symbolic-machines

Neural Symbolic Machines is a framework to integrate neural networks and symbolic representations using reinforcement learning, with applications in program synthesis and semantic parsing.

Language:PythonApache-2.03 10

neural_additive_models-1

stand alone Neural Additive Models, forked from google-reasearch for easy import to colab

Language:Python300

off_policy_mujoco

PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"

Language:Python300

iup

Webpage for Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning

Language:SCSS2 10

rl_benchmark_scores

Raw scores for all papers for various benchmarks for reliable evaluation.

2 10

google-research

Google Research

Language:Jupyter NotebookApache-2.0100

reincarnating_rl

Language:SCSS1 20

rliable

Language:SCSS1 10

scala-open-letter.github.io

Language:Ruby100

spr

Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"

Language:PythonMIT100

auto-drac

Automatic Data-Regularized Actor-Critic (Auto-DrAC)

Language:PythonMIT000

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Language:Jupyter NotebookApache-2.0000

github-buttons

Showcase the success of any GitHub repo or user with these simple, static buttons with dynamic counts.

Language:JavaScriptApache-2.0000

gym-minigrid

Minimalistic gridworld package for OpenAI Gym

Language:PythonApache-2.0000

handful-of-trials

Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

Language:Python010

interpret

Fit interpretable models. Explain blackbox machine learning.

Language:C++MIT000

merl

Meta Reward Learning

Language:SCSS000

netrand

Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020

Language:PythonMIT000

neural-additive-models.github.io

Language:SCSS000

offline-rl.github.io

Language:SCSS000

planet

Deep Planning Network: Control from pixels by latent planning with learned dynamics

Language:PythonApache-2.0000

procgen

Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments

Language:C++MIT000

python-bloom-filter

Bloom filter for Python

Language:Python000

reincarnating_rl_tmp

Open source code for reusing prior computational work in RL.

Language:PythonApache-2.0000

shields

Concise, consistent, and legible badges in SVG and raster format

CC0-1.0000

tensorflow-value-iteration-networks

TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper

Language:Python010