Hao Sun (holarissun)

holarissun

Geek Repo

Company:University of Cambridge

Home Page:https://holarissun.github.io/

Twitter:@HolarisSun

Github PK Tool:Github PK Tool

Hao Sun's repositories

PanelGPT

We introduce new zero-shot prompting magic words that improves the reasoning ability of language models: panel discussion!

Prompt-OIRL

code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning

Language:PythonLicense:MITStargazers:25Issues:2Issues:4

RewardShifting

Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL

Language:PythonStargazers:23Issues:3Issues:0

PCHID_code

Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics

Language:Jupyter NotebookStargazers:15Issues:2Issues:0

Accountable-Offline-RL

Code for NeurIPS 2023 paper Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples

Language:PythonStargazers:4Issues:2Issues:0
Language:PythonLicense:MITStargazers:4Issues:2Issues:1

DAUC

Code for Latent Density Models for Uncertainty Categorization

Language:PythonStargazers:2Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0
Language:Jupyter NotebookStargazers:2Issues:0Issues:0

NPSCO

Code for Novel Policy Seeking with Constrained Optimization

Language:PythonStargazers:2Issues:0Issues:0

BenchmarkPromptsWithResponses

Every prompt engineering paper should provide not only on-average performance of the prompting strategy, but should also release the responses to facilitate future research and avoid repeatedly calling the LLMs for the same queries+prompts.

Stargazers:1Issues:0Issues:0

LeetCodeSolution

logs for my leetcoding fall 2023

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

Language:MDXLicense:MITStargazers:1Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

cuhkrlcourse.github.io

CUHK Reinforcement Learning Course

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

GPTChatAPI

Usage Example of GPT's API in chat bot applications.

Language:PythonStargazers:0Issues:0Issues:0

hindsight-experience-replay

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

holarissun.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

HPoker

Texas Hold'EM Poker Game

Language:PythonStargazers:0Issues:1Issues:0

images

images in markdown files.

Stargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Prompt4ReasoningPapers

Repository for the ACL2023 paper "Reasoning with Language Model Prompting: A Survey".

License:MITStargazers:0Issues:0Issues:0

Slides

slides for group meeting

Language:TeXStargazers:0Issues:2Issues:0

TD3

PyTorch implementation of TD3 and DDPG for OpenAI gym tasks

License:MITStargazers:0Issues:0Issues:0

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0