Haotian Sun (haotiansun14)

haotiansun14

Geek Repo

Company:Georgia Institute of Technology

Location:Atlanta, GA

Home Page:https://haotiansun.tech

Github PK Tool:Github PK Tool

Haotian Sun's repositories

AdaPlanner

AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback

Language:HTMLLicense:MITStargazers:74Issues:2Issues:2

BBox-Adapter

Lightweight Adapting for Black-Box Large Language Models

rl-rep

Representation Learning (RepL) Methods in Reinforcement Learning and Causal Inference

Language:PythonStargazers:3Issues:0Issues:0

absa_poc_pipeline

A GPT-3-based proof-of-concept Aspect-Based Sentiment Analysis pipeline

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

alf

Agent Learning Framework https://alf.readthedocs.io

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Qetch_Plus

CS8803MDS

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

Awesome-LLM-Uncertainty-Reliability-Robustness

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

License:MITStargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:1Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

CORL

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CSE6140-Fall-2022-Project-Minimum-Vertex-Cover

CSE6140 Fall 2022 Project: Minimum Vertex Cover

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

d2l-zh

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被60个国家的400所大学用于教学。

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DIG

A library for graph deep learning research

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

ElegantRL

Cloud-native Deep Reinforcement Learning. 🔥

License:NOASSERTIONStargazers:0Issues:0Issues:0

homework_fall2022

Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2022)

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

lihang-code

《统计学习方法》的代码实现

Stargazers:0Issues:0Issues:0

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

rci-agent

A codebase for "Language Models can Solve Computer Tasks"

License:MITStargazers:0Issues:0Issues:0

ReAct

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

ReAgent

A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

reflexion

Reflexion: an autonomous agent with dynamic memory and self-reflection

License:MITStargazers:0Issues:0Issues:0

repeat_motion_segmentation

Segmenting a time series with repeating patterns using DTW matching

Language:PythonStargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:1Issues:0
License:BSD-3-ClauseStargazers:0Issues:0Issues:0

score_sde_pytorch

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

License:Apache-2.0Stargazers:0Issues:0Issues:0

self-refine

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tianshou

An elegant PyTorch deep reinforcement learning library.

License:MITStargazers:0Issues:0Issues:0