Guangrun Wang (王广润)'s repositories

Grounded-Segment-Anything

Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2Issues:0Issues:0

4D-Humans

4DHumans: Reconstructing and Tracking Humans with Transformers

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

Depth-Anything

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

License:Apache-2.0Stargazers:1Issues:0Issues:0
License:Apache-2.0Stargazers:1Issues:0Issues:0

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

License:MITStargazers:0Issues:0Issues:0

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DIS

This is the repo for our new project Highly Accurate Dichotomous Image Segmentation

License:Apache-2.0Stargazers:0Issues:0Issues:0

FLatten-Transformer

Official repository of FLatten Transformer (ICCV2023)

Language:PythonStargazers:0Issues:0Issues:0

GPT-4V-Act

AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI

Language:JavaScriptStargazers:0Issues:0Issues:0

humannerf

HumanNeRF turns a monocular video of moving people into a 360 free-viewpoint video.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

IDM-VTON

IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Stargazers:0Issues:0Issues:0

im-server

即时通讯(IM)系统

Language:JavaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

inpaint-anything

Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

License:Apache-2.0Stargazers:0Issues:0Issues:0

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ladi-vton

This is the official repository for the paper "LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On".

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

OOTDiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

License:NOASSERTIONStargazers:0Issues:0Issues:0

PeRF

[Technical Report 2023] PERF: Panoramic Neural Radiance Field from a Single Panorama

Language:PythonStargazers:0Issues:0Issues:0

pyllama

LLaMA: Open and Efficient Foundation Language Models

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

pytorch-image-models-v2

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

rcg

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

torch-ngp

A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"

License:MITStargazers:0Issues:0Issues:0

ViT-Adapter

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0