yuhui (dblate)

dblate

Geek Repo

Company:Baidu

Location:Beijing, China

Github PK Tool:Github PK Tool

yuhui's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:162889Issues:1558Issues:2224

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33308Issues:337Issues:2585

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:33116Issues:270Issues:2151

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29018Issues:341Issues:267

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:27041Issues:358Issues:1324

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:23979Issues:164Issues:3796

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:20751Issues:195Issues:2962

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17351Issues:156Issues:1345

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:14549Issues:109Issues:925

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:11964Issues:96Issues:1018

DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Language:PythonLicense:MITStargazers:10918Issues:122Issues:207

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustLicense:Apache-2.0Stargazers:8589Issues:121Issues:944
Language:PythonLicense:Apache-2.0Stargazers:6948Issues:67Issues:64

corenet

CoreNet: A library for training deep neural networks

Language:PythonLicense:NOASSERTIONStargazers:6643Issues:61Issues:15

interpy-zh

📘《Python进阶》(Intermediate Python - Chinese Version)

Language:CSSLicense:Apache-2.0Stargazers:6431Issues:316Issues:32

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:5488Issues:36Issues:870

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5417Issues:46Issues:73

leptonai

A Pythonic framework to simplify AI service building

Language:PythonLicense:Apache-2.0Stargazers:2499Issues:21Issues:53

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonLicense:BSD-3-ClauseStargazers:2034Issues:21Issues:295

course

The Hugging Face course on Transformers

Language:MDXLicense:Apache-2.0Stargazers:2000Issues:48Issues:132

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1636Issues:41Issues:71

ceval

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Language:PythonLicense:MITStargazers:1521Issues:13Issues:74

vae

a simple vae and cvae from keras

cc_net

Tools to download and cleanup Common Crawl data

Language:PythonLicense:MITStargazers:922Issues:24Issues:44

CMMLU

CMMLU: Measuring massive multitask language understanding in Chinese

bagel

A bagel, with everything.

MMMU

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Language:PythonLicense:Apache-2.0Stargazers:278Issues:4Issues:25

doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

Language:HTMLLicense:MITStargazers:256Issues:5Issues:27

NeMo-Skills

A pipeline to improve skills of large language models

Language:PythonLicense:Apache-2.0Stargazers:115Issues:5Issues:5