Hao Liu's repositories
ringattention
Transformers with Arbitrarily Large Context
chain-of-hindsight
Chain-of-Hindsight, A Scalable RLHF Method
language-quantized-autoencoders
Language Quantized AutoEncoders
hybrid-discriminative-generative
Hybrid Discriminative-Generative Training via Contrastive Learning
instructrl
Instruction Following Agents with Multimodal Transforemrs
taming-maml
Taming MAML: efficient unbiased meta-reinforcement learning