Rishabh Agarwal (agarwl)

agarwl

Geek Repo

Company:@google

Location:Montreal

Home Page:agarwl.github.io

Twitter:@agarwl_

Github PK Tool:Github PK Tool

Rishabh Agarwal's repositories

neural_additive_models

Repo for open sourcing the NAMs.

pse

Website for "Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning"

Language:SCSSStargazers:5Issues:1Issues:0
Language:HTMLLicense:MITStargazers:4Issues:0Issues:0

Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

License:MITStargazers:3Issues:0Issues:0

neural-symbolic-machines

Neural Symbolic Machines is a framework to integrate neural networks and symbolic representations using reinforcement learning, with applications in program synthesis and semantic parsing.

Language:PythonLicense:Apache-2.0Stargazers:3Issues:1Issues:0

neural_additive_models-1

stand alone Neural Additive Models, forked from google-reasearch for easy import to colab

Language:PythonStargazers:3Issues:0Issues:0

off_policy_mujoco

PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"

Language:PythonStargazers:3Issues:0Issues:0

iup

Webpage for Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning

Language:SCSSStargazers:2Issues:1Issues:0

rl_benchmark_scores

Raw scores for all papers for various benchmarks for reliable evaluation.

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:SCSSStargazers:1Issues:1Issues:0
Language:RubyStargazers:1Issues:0Issues:0

spr

Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

auto-drac

Automatic Data-Regularized Actor-Critic (Auto-DrAC)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

github-buttons

Showcase the success of any GitHub repo or user with these simple, static buttons with dynamic counts.

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

gym-minigrid

Minimalistic gridworld package for OpenAI Gym

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

handful-of-trials

Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

Language:PythonStargazers:0Issues:1Issues:0

interpret

Fit interpretable models. Explain blackbox machine learning.

Language:C++License:MITStargazers:0Issues:0Issues:0

merl

Meta Reward Learning

Language:SCSSStargazers:0Issues:0Issues:0

netrand

Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:SCSSStargazers:0Issues:0Issues:0
Language:SCSSStargazers:0Issues:0Issues:0

planet

Deep Planning Network: Control from pixels by latent planning with learned dynamics

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

procgen

Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments

Language:C++License:MITStargazers:0Issues:0Issues:0

python-bloom-filter

Bloom filter for Python

Language:PythonStargazers:0Issues:0Issues:0

reincarnating_rl_tmp

Open source code for reusing prior computational work in RL.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

shields

Concise, consistent, and legible badges in SVG and raster format

License:CC0-1.0Stargazers:0Issues:0Issues:0

tensorflow-value-iteration-networks

TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper

Language:PythonStargazers:0Issues:1Issues:0