Beast code in Giters

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonApache-2.07436 97 1491

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language:Jupyter NotebookApache-2.06769 98 706

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION5772 46 75

Bard-API

The unofficial python package that returns response of Google Bard through cookie value.

Language:PythonMIT5351 47 203

composer

Supercharge Your Model Training

Language:PythonApache-2.05077 51 536

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.04183 47 266

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonMIT3526 100 160

evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Language:PythonApache-2.01909 48 276

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonApache-2.01682 37 270

multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Language:PythonBSD-3-Clause1383 22 38

megablocks

Language:PythonApache-2.01134 19 50

torchdynamo

A Python-level JIT compiler designed to make unmodified PyTorch programs faster.

Language:PythonBSD-3-Clause980 46 567

RepViT

RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything

Language:Jupyter NotebookApache-2.0681 10 70

Chinese-Mixtral-8x7B

中文Mixtral-8x7B（Chinese-Mixtral-8x7B）

Language:PythonApache-2.0635 15 28

OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Language:PythonMIT629 15 73

SiT

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Language:PythonMIT561 10 18

VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Language:PythonApache-2.0415 11 45

Vlogger

[CVPR2024] Make Your Dream A Vlog

Language:PythonApache-2.0392 10 15

GEMMA

Genome-wide Efficient Mixed Model Association

Language:C++GPL-3.0318 19 244

plip

Pathology Language and Image Pre-Training (PLIP) is the first vision and language foundation model for Pathology AI (Nature Medicine). PLIP is a large-scale pre-trained model that can be used to extract visual and language features from pathology images and text description. The model is a fine-tuned version of the original CLIP model.

Language:Python241 6 23

pengchengu

pengchengu's starred repositories

openai-cookbook

grok-1

Open-Sora

CVPR2024-Papers-with-Code

latent-diffusion

minbpe

mistral-src

accelerate

mmagic

DiT