holarissun

Hao Sun's repositories

PanelGPT

We introduce new zero-shot prompting magic words that improves the reasoning ability of language models: panel discussion!

Language:Python110 3 2

Prompt-OIRL

code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning

Language:PythonMIT26 2 4

RewardShifting

Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL

Language:Python25 30

PCHID_code

Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics

Language:Jupyter Notebook15 20

Accountable-Offline-RL

Code for NeurIPS 2023 paper Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples

Language:Python4 20

Action-Refined-Temporal-Difference

Language:Jupyter Notebook2 10

DAUC

Code for Latent Density Models for Uncertainty Categorization

Language:Python2 10

NPSCO

Code for Novel Policy Seeking with Constrained Optimization

Language:Python200

Every prompt engineering paper should provide not only on-average performance of the prompting strategy, but should also release the responses to facilitate future research and avoid repeatedly calling the LLMs for the same queries+prompts.

100

LeetCodeSolution

logs for my leetcoding fall 2023

1 10

Policy-Continuation-with-Hindsight-Inverse-Dynamics

Language:Jupyter Notebook100

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

Language:MDXMIT100

2Groza.github.io

Language:HTML000

Causal-RL

Language:Jupyter Notebook000

cuhkrlcourse.github.io

CUHK Reinforcement Learning Course

000

decisionforce.github.io

Language:HTML010

GPTChatAPI

Usage Example of GPT's API in chat bot applications.

Language:Python010

hindsight-experience-replay

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

Language:Jupyter NotebookMIT000

holarissun.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT000

HPoker

Texas Hold'EM Poker Game

Language:Python010

images

images in markdown files.

010

MPhil_Thesis

000

paperreading

slides

Language:Jupyter Notebook000

Prompt4ReasoningPapers

Repository for the ACL2023 paper "Reasoning with Language Model Prompting: A Survey".

MIT000

Slides

slides for group meeting

Language:TeX020

TD3

PyTorch implementation of TD3 and DDPG for OpenAI gym tasks