Liu-Zhenya

Liu-Zhenya

Geek Repo

Github PK Tool:Github PK Tool

Liu-Zhenya's starred repositories

DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Language:PythonLicense:Apache-2.0Stargazers:3002Issues:0Issues:0

localsend

An open-source cross-platform alternative to AirDrop

Language:DartLicense:Apache-2.0Stargazers:47377Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:132931Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:27841Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:26467Issues:0Issues:0

llama2_chat_templater

Wrapper to easily generate the chat template for Llama2

Language:PythonLicense:Apache-2.0Stargazers:62Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36585Issues:0Issues:0

llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.

Language:Jupyter NotebookStargazers:12008Issues:0Issues:0

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:7666Issues:0Issues:0

zotero-night

Night theme for Zotero UI and PDF

Language:SCSSLicense:GPL-3.0Stargazers:2387Issues:0Issues:0

Self-Rewarding-Language-Models

This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.

Language:PythonStargazers:102Issues:0Issues:0

ChatGPT.nvim

ChatGPT Neovim Plugin: Effortless Natural Language Generation with OpenAI's ChatGPT API

Language:LuaLicense:Apache-2.0Stargazers:3712Issues:0Issues:0

lualine.nvim

A blazing fast and easy to configure neovim statusline plugin written in pure lua.

Language:LuaLicense:MITStargazers:6025Issues:0Issues:0

transparent.nvim

Remove all background colors to make nvim transparent

Language:LuaStargazers:841Issues:0Issues:0

coc-pyright

Pyright extension for coc.nvim

Language:TypeScriptLicense:MITStargazers:1279Issues:0Issues:0

AstroNvim

AstroNvim is an aesthetic and feature-rich neovim config that is extensible and easy to use with a great set of plugins

Language:LuaLicense:GPL-3.0Stargazers:12620Issues:0Issues:0

neovim

Vim-fork focused on extensibility and usability

Language:Vim ScriptLicense:NOASSERTIONStargazers:82236Issues:0Issues:0

vim-plug

:hibiscus: Minimalist Vim Plugin Manager

Language:Vim ScriptLicense:MITStargazers:34011Issues:0Issues:0

Vundle.vim

Vundle, the plug-in manager for Vim

Language:Vim ScriptLicense:MITStargazers:23909Issues:0Issues:0

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonLicense:Apache-2.0Stargazers:375Issues:0Issues:0

awesome-llm-human-preference-datasets

A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.

License:MITStargazers:304Issues:0Issues:0

x-transformers

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonLicense:MITStargazers:4625Issues:0Issues:0

self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Language:PythonLicense:MITStargazers:1318Issues:0Issues:0

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

License:Apache-2.0Stargazers:3294Issues:0Issues:0

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:3243Issues:0Issues:0

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:2061Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4465Issues:0Issues:0

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:11097Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:9603Issues:0Issues:0

MLQuestions

Machine Learning and Computer Vision Engineer - Technical Interview Questions

Stargazers:2936Issues:0Issues:0