Jason Ma (JasonMa2016)

JasonMa2016

Geek Repo

Location:Philadelphia, PA

Home Page:jasonma2016.github.io

Twitter:@JasonMa2020

Github PK Tool:Github PK Tool

Jason Ma's repositories

GoFAR

Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)

Language:PythonLicense:MITStargazers:34Issues:4Issues:2

SMODICE

Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML 2022)

Language:PythonStargazers:25Issues:2Issues:0

LDS

Official repository for paper "Likelihood-Based Diverse Sampling for Trajectory Forecasting" (ICCV 2021)

Language:Jupyter NotebookStargazers:20Issues:2Issues:1

CODAC

Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)

2021

Website for the Offline RL Workshop at NeurIPS 2020.

Language:HTMLStargazers:0Issues:1Issues:0

BCQ

PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

conv-social-pooling

Code for model proposed in: Nachiket Deo and Mohan M. Trivedi,"Convolutional Social Pooling for Vehicle Trajectory Prediction." CVPRW, 2018

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

D-eck

A deck of Cards, written in D

Language:DStargazers:0Issues:1Issues:0

dqn-pytorch

DQN to play Atari Pong

Language:PythonStargazers:0Issues:1Issues:0

embodied-clip

Official codebase for EmbCLIP

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0

human_aware_rl

Code for "On the Utility of Learning about Humans for Human-AI Coordination"

Language:PythonStargazers:0Issues:1Issues:0

learn2learn

PyTorch Meta-learning Framework for Researchers

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

meta-rl-bandits

A simple RNN meta-learner

Language:PythonStargazers:0Issues:1Issues:0

mj_envs

A collection of MuJoCo based environments.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

mjrl

Reinforcement learning algorithms for MuJoCo tasks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

mushroom-rl

Python library for Reinforcement Learning experiments.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

oyster

Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

Reinforcement-learning

Modular implementations of reinforcement learning algorithms with PyTorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

rlkit

Collection of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

robomimic

robomimic: A Modular Framework for Robot Learning from Demonstration

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

spinningup

An educational resource to help anyone learn deep reinforcement learning.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

tensorflow

Computation using data flow graphs for scalable machine learning

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:1Issues:0