Haotian Sun's repositories
AdaPlanner
AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback
BBox-Adapter
Lightweight Adapting for Black-Box Large Language Models
absa_poc_pipeline
A GPT-3-based proof-of-concept Aspect-Based Sentiment Analysis pipeline
alf
Agent Learning Framework https://alf.readthedocs.io
alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
Qetch_Plus
CS8803MDS
Awesome-LLM-Uncertainty-Reliability-Robustness
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC
CSE6140-Fall-2022-Project-Minimum-Vertex-Cover
CSE6140 Fall 2022 Project: Minimum Vertex Cover
d2l-zh
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被60个国家的400所大学用于教学。
DIG
A library for graph deep learning research
ElegantRL
Cloud-native Deep Reinforcement Learning. 🔥
homework_fall2022
Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2022)
lihang-code
《统计学习方法》的代码实现
lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
rci-agent
A codebase for "Language Models can Solve Computer Tasks"
ReAct
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
ReAgent
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
reflexion
Reflexion: an autonomous agent with dynamic memory and self-reflection
repeat_motion_segmentation
Segmenting a time series with repeating patterns using DTW matching
score_sde_pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
self-refine
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
tianshou
An elegant PyTorch deep reinforcement learning library.