Stanford Intelligent and Interactive Autonomous Systems Group's repositories
PantheonRL
PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.
active-preference-based-gpr
Companion code for RSS 2020 paper: "Active Preference-Based Gaussian Process Regression for Reward Learning"
explore-eqa
Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"
RL_Routing
Companion code to TRC paper: Daniel A. Lazar, Erdem Bıyık, Dorsa Sadigh, Ramtin Pedarsani. "Learning how to Dynamically Route Autonomous Vehicles on Shared Roads". Transportation Research Part C: Emerging Technologies, , vol. 130, pp. 103258, 2021; doi: 10.1016/j.trc.2021.103258.
DPP-Batch-Active-Learning
Companion code to the preprint: E Bıyık, K Wang, N Anari, D Sadigh, "Batch Active Learning using Determinantal Point Processes". arXiv preprint arXiv:1906.07975, Dec. 2019.
TREX-pytorch
A PyTorch implementation for the paper 'Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations'.
Diverse-Conventions
Exploring techniques to generate diverse conventions in multi-agent settings
multimodal-rewards-from-rankings
Companion code to CoRL 2021 paper "Learning Multimodal Rewards from Rankings"
Learn-Imperfect-Varying-Dynamics
Code for the paper 'Learning from Imperfect Demonstrations from Agents with Varying Dynamics'
emergent-prosociality-through-gifting
Companion code for IJCAI 2021 paper "Emergent Prosociality in Multi-Agent Games Through Gifting"
reward-learning-scale-feedback
Companion code for the CoRL 2021 paper "Learning Reward Functions from Scale Feedback"
Learning-Feasibility-Different-Dynamics
Code for 'Learning Feasibility to Imitate Demonstrators with Different Dynamics'
plato_sandbox
Code for PLATO, Play-LMP, and Play-GCBC
partner-aware-ucb
Companion code to the AI-HRI 2021 paper "Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams"
xembodiment-databuilder-hydra
RLDS dataset builder for X-embodiment dataset conversion of HYDRA dataset