Beast code in Giters

Yuzhong Zhao's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.046077 304 658

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.018578 159 1431

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

Language:HTMLMIT10561 265 45

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonMIT10012 65 105

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonApache-2.07971 56 1487

AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language:PythonMIT4754 60 79

recognize-anything

Open-source and strong foundation image recognition models.

Language:Jupyter NotebookApache-2.02659 26 154

InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Language:PythonMIT2439 34 260

Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Language:Python2182 23 91

densecap

Dense image captioning in Torch

Language:Jupyter NotebookMIT1575 68 89

DDNM

[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

Language:PythonMIT1099 27 72

ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!

Language:PythonMIT906 13 10

AlphaCLIP

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Language:Jupyter NotebookApache-2.0611 11 50

Awesome-Referring-Image-Segmentation

:books: A collection of papers about Referring Image Segmentation.

589 15 8

SEED

Official implementation of SEED-LLaMA (ICLR 2024).

Language:PythonNOASSERTION541 14 48

DatasetDM

[NeurIPS2023] DatasetDM:Synthesizing Data with Perception Annotations Using Diffusion Models

Language:Python292 16 34

VLDet

[ICLR 2023] PyTorch implementation of VLDet （https://arxiv.org/abs/2211.14843）

Language:PythonNOASSERTION177 5 17

Prompt-Can-Anything

You can do anything by sota AI with prompt ,auto AI tools , VL larger model fine and project

Language:Jupyter NotebookGPL-3.0175 7 1

Weakly-Supervised-Object-Localization

Weakly Supervised Object Localization Papers

166 11 2

ptp

[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》

Language:PythonApache-2.0147 7 10

PartImageNet

Introduction and scripts for the paper "PartImageNet: A Large, High-Quality Dataset of Parts" (Ju He, Shuo Yang, Shaokang Yang, Adam Kortylewski, Xiaoding Yuan, Jie-Neng Chen, Shuai Liu, Cheng Yang, Alan Yuille).

114 5 16