s9xie

Saining Xie's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.045036 298 648

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT33874 354 298

ControlNet

Let us control diffusion models!

Language:PythonApache-2.028658 214 523

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookApache-2.013927 114 368

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

10008 233 98

ConvNeXt

Code release for ConvNeXt model

Language:PythonMIT5602 33 130

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION5485 46 73

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonMIT3381 30 249

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonApache-2.03089 40 238

zero123

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Language:PythonMIT2548 43 120

Painter

Painter & SegGPT Series: Vision Foundation Models from BAAI

Language:PythonMIT2455 36 66

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonAGPL-3.02439 410

SuGaR

[CVPR 2024] Official PyTorch implementation of SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering

Language:C++NOASSERTION1828 59 195