Liu-Zhenya's starred repositories
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
llama2_chat_templater
Wrapper to easily generate the chat template for Llama2
llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
zotero-night
Night theme for Zotero UI and PDF
Self-Rewarding-Language-Models
This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.
ChatGPT.nvim
ChatGPT Neovim Plugin: Effortless Natural Language Generation with OpenAI's ChatGPT API
lualine.nvim
A blazing fast and easy to configure neovim statusline plugin written in pure lua.
transparent.nvim
Remove all background colors to make nvim transparent
coc-pyright
Pyright extension for coc.nvim
Vundle.vim
Vundle, the plug-in manager for Vim
reward-bench
RewardBench: the first evaluation tool for reward models.
awesome-llm-human-preference-datasets
A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.
x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
ml-engineering
Machine Learning Engineering Open Book
MLQuestions
Machine Learning and Computer Vision Engineer - Technical Interview Questions