There are 0 repository under preference-learning topic.
RewardBench: the first evaluation tool for reward models.
Free and open source code of the https://tournesol.app platform. Meet the community on Discord https://discord.gg/WvcSG55Bf3
This repository contains the source code for our paper: "NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Transformer and Preference Learning". For more details, please refer to our project website at https://sites.google.com/view/san-navistar.
Python-based GUI to collect Feedback of Chemist in Molecules
A comprehensive survey on Internal Consistency and Self-Feedback in Large Language Models.
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
Data and models for the paper "Configurable Safety Tuning of Language Models with Synthetic Preference Data"
Preference Learning with Gaussian Processes and Bayesian Optimization
This repository contains the source code for our paper: "Feedback-efficient Active Preference Learning for Socially Aware Robot Navigation", accepted to IROS-2022. For more details, please refer to our project website at https://sites.google.com/view/san-fapl.
A paper under AAAI-20 review
Java framework for Preference Learning
Code for the project: "Analysis of Recommendation-systems based on User Preferences".
Code for the paper "Reward Design for Justifiable Sequential Decision-Making"; ICLR 2024
Constructive Preference Elicitation for Social Choice With Setwise max-margin Learning.
APReL: Active preference-based reward learning for human-robot interaction. Utilizing "Mountain Car" environment, learn from human preferences to reach the goal state. Applications in robotics and adaptability to other learning methods.
Survey of preference alignment algorithms
Project about experiments of the use of ILASP as a post-hoc method over black-box models, in which we also study and approach technical issues like exponential time execution.
(AISTATS 2024) "Looping in the Human: Collaborative and Explainable Bayesian Optimization"
Bayesian Spatial Bradley--Terry
An analysis of preference comparisons based on the Bayes factor
Project on preference learning - ENSAE ParisTech