Ruize Gao's repositories
knnmt-meta-optimizer
Implementaion of our EMNLP 2023 submission "Nearest Neighbor Machine Translation is Meta-Optimizer on Output Projection Layer"
Alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:PythonApache-2.0000
Language:Jupyter Notebook000
ruizgao.github.io
Ruize Gao's Home Page
Language:HTML000
RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:PythonApache-2.0000
Language:Python000