yingqichao

Qichao Ying's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.047055 305 663

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookApache-2.014912 113 386

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

12099 271 111

pulse

PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models

Language:Python7891 228 85

FastSAM

Fast Segment Anything

Language:PythonAGPL-3.07407 56 204

Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Language:PythonApache-2.04336 59 147

sam-hq

Segment Anything in High Quality [NeurIPS 2023]

Language:PythonApache-2.03662 77 138

EditAnything

Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

Language:PythonApache-2.03295 39 57

SparseConvNet

Submanifold sparse convolutional networks

Language:C++NOASSERTION2031 44 224

bolei_awesome_posters

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1444 100

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Language:PythonMIT1429 26 83

Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonCC-BY-4.01176 14 119