hyy's repositories
alpaca-rlhf
Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat
Language:PythonMIT000
tianshou
An elegant PyTorch deep reinforcement learning library.
MIT000
x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers
Language:PythonMIT000
Language:Python000
deep-learning-project-template
Pytorch Lightning code guideline for conferences
Apache-2.0000