XuDong Frank Wang (frank-xwang)

frank-xwang

Geek Repo

Company:UC Berkeley

Location:San Francisco Bay Area

Home Page:http://people.eecs.berkeley.edu/~xdwang/

Twitter:@XDWang101

Github PK Tool:Github PK Tool

XuDong Frank Wang's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:45140Issues:299Issues:650

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:28736Issues:214Issues:525

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23077Issues:251Issues:273

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17619Issues:156Issues:1362

pytorch3d

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

Language:PythonLicense:NOASSERTIONStargazers:8448Issues:148Issues:1525

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8251Issues:94Issues:360

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:8024Issues:100Issues:83

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5523Issues:65Issues:395

CenterTrack

Simultaneous object detection and tracking using center points.

Language:PythonLicense:MITStargazers:2338Issues:52Issues:277

consistencydecoder

Consistency Distilled Diff VAE

Language:PythonLicense:MITStargazers:2089Issues:23Issues:19

ResNeXt

Implementation of a classification framework from the paper Aggregated Residual Transformations for Deep Neural Networks

Language:LuaLicense:NOASSERTIONStargazers:1897Issues:75Issues:12

GLIGEN

Open-Set Grounded Text-to-Image Generation

Language:PythonLicense:MITStargazers:1880Issues:38Issues:75

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1576Issues:10Issues:126

composer

Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"

CRATE

Code for CRATE (Coding RAte reduction TransformEr).

Language:PythonLicense:MITStargazers:1098Issues:20Issues:17

CutLER

Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation"

Language:PythonLicense:NOASSERTIONStargazers:885Issues:16Issues:57

ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Language:PythonLicense:NOASSERTIONStargazers:820Issues:40Issues:42

PointNeXt

[NeurIPS'22] PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies

Language:ShellLicense:MITStargazers:722Issues:14Issues:149

LaViLa

Code release for "Learning Video Representations from Large Language Models"

Language:PythonLicense:MITStargazers:456Issues:9Issues:32

BoxInstSeg

A toolbox for box-supervised instance segmentation.

Language:PythonLicense:Apache-2.0Stargazers:401Issues:13Issues:19

GTR

Global Tracking Transformers, CVPR 2022

dropout

Code release for "Dropout Reduces Underfitting"

Language:PythonLicense:NOASSERTIONStargazers:307Issues:6Issues:3

DatasetDM

[NeurIPS2023] DatasetDM:Synthesizing Data with Perception Annotations Using Diffusion Models

HIPIE

[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"

Language:Jupyter NotebookLicense:MITStargazers:246Issues:7Issues:17

AbSViT

Official code for "Top-Down Visual Attention from Analysis by Synthesis" (CVPR 2023 highlight)

Language:Jupyter NotebookStargazers:157Issues:2Issues:7

T2I-CompBench

[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation

Language:PythonLicense:MITStargazers:157Issues:2Issues:18

simpool

This repo contains the official implementation of ICCV 2023 paper "Keep It SimPool: Who Said Supervised Transformers Suffer from Attention Deficit?"

Language:PythonLicense:Apache-2.0Stargazers:92Issues:2Issues:1

single-img-extrapolating

Repo for the paper "Extrapolating from a Single Image to a Thousand Classes using Distillation"

OCLR_model

[NeurIPS 2022] Segmenting Moving Objects via an Object-Centric Representation. Junyu Xie, Weidi Xie, Andrew Zisserman.

Language:PythonLicense:MITStargazers:24Issues:0Issues:0

sesame

🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"

Language:PythonLicense:MITStargazers:17Issues:0Issues:0