Zhiyuan Hu's repositories
LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
coding-interview-university
A complete computer science study plan to become a software engineer.
DPAC-DialogueGAN
This repo implements GAN-based models for Dialogue Generation (DP-GAN, SeqGAN, and our own proposed DPAC-GAN)
EvalAI-Starters
How to create a challenge on EvalAI?
GPT2-Chinese
Chinese version of GPT2 training code, using BERT tokenizer.
gpt4free
decentralising the Ai Industry, just some language model api's...
LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
MAgIC
This is the official implementation for the paper: Use Your INSTINCT: INSTruction optimization usIng Neural bandits Coupled with Transformers
multiwoz
Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)
pics
pics
tutorials
PyTorch tutorials.
zhiyuanhubj.github.io
My personal homepage