Hao Sun's repositories
Prompt-OIRL
code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
RewardShifting
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
PCHID_code
Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics
Accountable-Offline-RL
Code for NeurIPS 2023 paper Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples
BenchmarkPromptsWithResponses
Every prompt engineering paper should provide not only on-average performance of the prompting strategy, but should also release the responses to facilitate future research and avoid repeatedly calling the LLMs for the same queries+prompts.
LeetCodeSolution
logs for my leetcoding fall 2023
Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
cuhkrlcourse.github.io
CUHK Reinforcement Learning Course
GPTChatAPI
Usage Example of GPT's API in chat bot applications.
hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
holarissun.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
paperreading
slides
Prompt4ReasoningPapers
Repository for the ACL2023 paper "Reasoning with Language Model Prompting: A Survey".
TD3
PyTorch implementation of TD3 and DDPG for OpenAI gym tasks
tianshou
An elegant PyTorch deep reinforcement learning library.