Morizeyao

followers

following

stars

Shopee

Shanghai, China

Zeyao Du's starred repositories

text-generation-webui

A Gradio web UI for Large Language Models.

Language:PythonAGPL-3.039692 326 3607

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.031245 199 4849

clash-verge

A Clash GUI based on tauri. Supports Windows, macOS and Linux.

Language:TypeScriptGPL-3.021127 123 795

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.019415 160 1488

triton

Development repository for the Triton language and compiler

Language:C++MIT12818 192 1416

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonApache-2.012017 101 520

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookMIT10066 149 30

open_clip

An open source implementation of CLIP.

Language:PythonNOASSERTION9855 77 470

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT9052 83 36

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:Shell7491 42 772

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION6039 45 80

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause5536 63 98

promptbase

All things prompt engineering

Language:PythonMIT5357 59 13

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.04530 50 295

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonApache-2.04416 45 189

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonBSD-3-Clause3995 45 528

LapisCV

📃 开箱即用的 Markdown 简历，支持 VSCode / Obsidian / Typora

Language:CSSMIT2659 34 13

DeBERTa

The implementation of DeBERTa

Language:PythonMIT1967 42 123

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonApache-2.01949 44 120

instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Language:PythonApache-2.01842 17 110

Telechat

Language:Python1754 23 60

open-instruct

Language:PythonApache-2.01204 14 112

cookbook

Language:Jupyter NotebookMIT1177 33 10

VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Language:Python1074 15 41

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonApache-2.0610 9 43

YuLan-Chat

YuLan: An Open-Source Large Language Model

Language:PythonMIT541 5 12

Aurora

🐳 Aurora is a [Chinese Version] MoE model. Aurora is a further work based on Mixtral-8x7B, which activates the chat capability of the model's Chinese open domain.

Language:PythonApache-2.0257 8 17

fm-cheatsheet

Website for hosting the Open Foundation Models Cheat Sheet.

Language:JavaScript256 13 21

MachineLearningFAQ

Machine Learning FAQ

hf-trim

Reduce the size of pretrained Hugging Face models via vocabulary trimming.

Language:PythonMPL-2.039 2 5