Stanford-ILIAD / multimodal-rewards-from-rankings

Companion code to CoRL 2021 paper "Learning Multimodal Rewards from Rankings"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Companion code to CoRL 2021 paper:
Vivek Myers, Erdem Bıyık, Nima Anari, Dorsa Sadigh. "Learning Multimodal Rewards from Rankings". 5th Conference on Robot Learning (CoRL), London, UK, Nov. 2021.

This code actively learns multimodal reward functions from rankings in various tasks with respect to an information gain acquisition function and compares it to random querying.

The codes for the interface of the user studies are excluded, but the environments can still be simulated with the given trajectory datasets.

Dependencies

You need to have the following libraries with Python3:

Running

You simply run:

	python run.py [task_name]

where [task_name] is either of the following: lunar, fetch, synthetic. The output is a PNG file in the main directory that compares the two querying methods.

About

Companion code to CoRL 2021 paper "Learning Multimodal Rewards from Rankings"

License:MIT License


Languages

Language:Python 100.0%