Zeyao Du (Morizeyao)

Morizeyao

Geek Repo

Company:Shopee

Location:Shanghai, China

Github PK Tool:Github PK Tool

Zeyao Du's starred repositories

text-generation-webui

A Gradio web UI for Large Language Models.

Language:PythonLicense:AGPL-3.0Stargazers:39692Issues:326Issues:3607

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31245Issues:199Issues:4849

clash-verge

A Clash GUI based on tauri. Supports Windows, macOS and Linux.

Language:TypeScriptLicense:GPL-3.0Stargazers:21127Issues:123Issues:795

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:19415Issues:160Issues:1488

triton

Development repository for the Triton language and compiler

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:12017Issues:101Issues:520

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookLicense:MITStargazers:10066Issues:149Issues:30

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9855Issues:77Issues:470

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:9052Issues:83Issues:36

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:6039Issues:45Issues:80

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5536Issues:63Issues:98

promptbase

All things prompt engineering

Language:PythonLicense:MITStargazers:5357Issues:59Issues:13

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4530Issues:50Issues:295

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4416Issues:45Issues:189

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonLicense:BSD-3-ClauseStargazers:3995Issues:45Issues:528

LapisCV

📃 开箱即用的 Markdown 简历,支持 VSCode / Obsidian / Typora

Language:CSSLicense:MITStargazers:2659Issues:34Issues:13

DeBERTa

The implementation of DeBERTa

Language:PythonLicense:MITStargazers:1967Issues:42Issues:123

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1949Issues:44Issues:120

instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Language:PythonLicense:Apache-2.0Stargazers:1842Issues:17Issues:110
Language:PythonLicense:Apache-2.0Stargazers:1204Issues:14Issues:112
Language:Jupyter NotebookLicense:MITStargazers:1177Issues:33Issues:10

VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonLicense:Apache-2.0Stargazers:610Issues:9Issues:43

YuLan-Chat

YuLan: An Open-Source Large Language Model

Language:PythonLicense:MITStargazers:541Issues:5Issues:12

Aurora

🐳 Aurora is a [Chinese Version] MoE model. Aurora is a further work based on Mixtral-8x7B, which activates the chat capability of the model's Chinese open domain.

Language:PythonLicense:Apache-2.0Stargazers:257Issues:8Issues:17

fm-cheatsheet

Website for hosting the Open Foundation Models Cheat Sheet.

MachineLearningFAQ

Machine Learning FAQ

hf-trim

Reduce the size of pretrained Hugging Face models via vocabulary trimming.

Language:PythonLicense:MPL-2.0Stargazers:39Issues:2Issues:5