pengchengu's starred repositories

openai-cookbook

Examples and guides for using the OpenAI API

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49208Issues:561Issues:202

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20914Issues:179Issues:414

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:11196Issues:96Issues:337

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8809Issues:82Issues:36

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8772Issues:116Issues:115

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7436Issues:97Issues:1491

mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6769Issues:98Issues:706

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5772Issues:46Issues:75

Bard-API

The unofficial python package that returns response of Google Bard through cookie value.

Language:PythonLicense:MITStargazers:5351Issues:47Issues:203

composer

Supercharge Your Model Training

Language:PythonLicense:Apache-2.0Stargazers:5077Issues:51Issues:536

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4183Issues:47Issues:266

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonLicense:MITStargazers:3526Issues:100Issues:160

evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Language:PythonLicense:Apache-2.0Stargazers:1909Issues:48Issues:276

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonLicense:Apache-2.0Stargazers:1682Issues:37Issues:270

multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Language:PythonLicense:BSD-3-ClauseStargazers:1383Issues:22Issues:38
Language:PythonLicense:Apache-2.0Stargazers:1134Issues:19Issues:50

torchdynamo

A Python-level JIT compiler designed to make unmodified PyTorch programs faster.

Language:PythonLicense:BSD-3-ClauseStargazers:980Issues:46Issues:567

RepViT

RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:681Issues:10Issues:70

Chinese-Mixtral-8x7B

中文Mixtral-8x7B(Chinese-Mixtral-8x7B)

Language:PythonLicense:Apache-2.0Stargazers:635Issues:15Issues:28

OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Language:PythonLicense:MITStargazers:629Issues:15Issues:73

SiT

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Language:PythonLicense:MITStargazers:561Issues:10Issues:18

VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Language:PythonLicense:Apache-2.0Stargazers:415Issues:11Issues:45

Vlogger

[CVPR2024] Make Your Dream A Vlog

Language:PythonLicense:Apache-2.0Stargazers:392Issues:10Issues:15

GEMMA

Genome-wide Efficient Mixed Model Association

Language:C++License:GPL-3.0Stargazers:318Issues:19Issues:244

plip

Pathology Language and Image Pre-Training (PLIP) is the first vision and language foundation model for Pathology AI (Nature Medicine). PLIP is a large-scale pre-trained model that can be used to extract visual and language features from pathology images and text description. The model is a fine-tuned version of the original CLIP model.

Video-Motion-Customization

VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:151Issues:12Issues:2

Gemini-API

The unofficial python package that returns response of Google Gemini through cookie values.

Language:PythonLicense:MITStargazers:149Issues:7Issues:31

fastserve-ai

Machine Learning Serving focused on GenAI with simplicity as the top priority.

Language:PythonLicense:Apache-2.0Stargazers:55Issues:3Issues:11