zhiyuanhubj

Zhiyuan Hu's repositories

UoT

Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

Language:Python54 2 1

ProToD

600

longLLM-Extrapolation-Papers

400

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

200

Long_form_VideoQA

200

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonMIT2 10

PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Language:PythonMIT1 10

awesome-llm-powered-agent

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

MIT000

BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Language:PythonApache-2.0000

coding-interview-university

A complete computer science study plan to become a software engineer.

CC-BY-SA-4.0010

Data-Processing-for-Satisfaction-Prediction

Language:Python000

DiffuSeq

Official Codebase for DiffuSeq

Language:Python010

DPAC-DialogueGAN

This repo implements GAN-based models for Dialogue Generation (DP-GAN, SeqGAN, and our own proposed DPAC-GAN)

Language:Python000

code_switch

Language:Python010

cs_assigment

Language:Jupyter Notebook000

EvalAI-Starters

How to create a challenge on EvalAI?

000

GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

Language:PythonMIT010

gpt4free

decentralising the Ai Industry, just some language model api's...

Language:PythonGPL-3.0000

human_evaluation

010

LLaMA-Factory

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

Language:PythonApache-2.0000

longLLM-Extrapolation-Paper

010

MAgIC

This is the official implementation for the paper: Use Your INSTINCT: INSTruction optimization usIng Neural bandits Coupled with Transformers

000

multiwoz

Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)

MIT000

pics

Language:Jupyter Notebook000

Planning_Under_Uncertainty

000

Tenant

Language:Python000

tutorials

PyTorch tutorials.

BSD-3-Clause000

UGRO-CIMK23

000

zhiyuan.github.io

020

zhiyuanhubj.github.io

My personal homepage

Language:SCSSMIT000