dblate

followers

following

stars

Baidu

Beijing, China

yuhui's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonMIT162889 1558 2224

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.033308 337 2585

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptNOASSERTION33116 270 2151

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.029018 341 267

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Language:Jupyter NotebookCC-BY-4.027041 358 1324

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonApache-2.023979 164 3796

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.020751 195 2962

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.017351 156 1345

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.014549 109 925

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonApache-2.011964 96 1018

DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Language:PythonMIT10918 122 207

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustApache-2.08589 121 944

LWM

Language:PythonApache-2.06948 67 64

corenet

CoreNet: A library for training deep neural networks

Language:PythonNOASSERTION6643 61 15

interpy-zh

📘《Python进阶》（Intermediate Python - Chinese Version）

Language:CSSApache-2.06431 316 32

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT5488 36 870

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION5417 46 73

intermediatePython

Language:Python3822 167 50

leptonai

A Pythonic framework to simplify AI service building

Language:PythonApache-2.02499 21 53

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonBSD-3-Clause2034 21 295

course

The Hugging Face course on Transformers

Language:MDXApache-2.02000 48 132

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonApache-2.01636 41 71

ceval

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Language:PythonMIT1521 13 74

vae

a simple vae and cvae from keras

Language:Python1223 23 14

cc_net

Tools to download and cleanup Common Crawl data

Language:PythonMIT922 24 44

CMMLU

CMMLU: Measuring massive multitask language understanding in Chinese

Language:Python605 12 31

bagel

A bagel, with everything.

Language:Python298 11 11

MMMU

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Language:PythonApache-2.0278 4 25

doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

Language:HTMLMIT256 5 27

NeMo-Skills

A pipeline to improve skills of large language models

Language:PythonApache-2.0115 5 5