hyy's repositories

alpaca-rlhf

Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tianshou

An elegant PyTorch deep reinforcement learning library.

License:MITStargazers:0Issues:0Issues:0

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dilp

differential ilp implemented by pytorch

Language:PythonStargazers:1Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0

deep-learning-project-template

Pytorch Lightning code guideline for conferences

License:Apache-2.0Stargazers:0Issues:0Issues:0