yuanpengtu

Penalty_kl's starred repositories

PASD

[ECCV2024] Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization

Language:PythonApache-2.083000

dream-ood

source code for NeurIPS'23 paper "Dream the Impossible: Outlier Imagination with Diffusion Models"

Language:PythonNOASSERTION5700

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonApache-2.0934500

awesome_OpenSetRecognition_list

A curated list of papers & resources linked to open set recognition, out-of-distribution, open set domain adaptation and open world recognition

103700

T3Bench

T3Bench: Benchmarking Current Progress in Text-to-3D Generation

Language:Python107600

RAVE

RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models - CVPR 2024 - Official Repo

Language:PythonMIT24400

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonMIT385000

CCEdit

CCEdit: Creative and Controllable Video Editing via Diffusion Models

Language:PythonNOASSERTION8000

vid2vid-zero

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

Language:Python32900

vidtome-diffusion.github.io

Project webpage of paper "VidToMe: Video Token Merging for Zero-Shot Video Editing".

Language:JavaScript100

Ground-A-Video

Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)

Language:Python12400

Show-1

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Language:PythonNOASSERTION107800

2021-TIP-SIAMH

Salience-Guided Iterative Asymmetric Mutual Hashing for Fast Person Re-Identification (IEEE TIP 2021)

Language:Python400

StableVideo

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Language:PythonApache-2.0136300

VideoSwap

Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

32800

DreamComposer

[CVPR 2024] DreamComposer: Controllable 3D Object Generation via Multi-View Conditions

Language:PythonMIT11200

Awesome-Realistic-Semi-Supervised-Learning

An awesome paper list of Semi-Supervised Learning under realistic settings.

Language:Shell8600

LL3DA

[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.

Language:PythonMIT20900

Awesome-MIM

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

Language:PythonApache-2.027500

Upscale-A-Video

Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

88300

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonApache-2.01283700

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonApache-2.01061000

EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Language:Jupyter NotebookNOASSERTION80400

SLEEG

Source code for Self-supervised Likelihood Estimation with Energy Guidance for Anomaly Segmentation in Urban Scenes (AAAI 2024)

Language:Python300

MotionDirector

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Language:PythonApache-2.076600

semivl

Official Implementation of SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance

Language:PythonApache-2.08300

This is the official PyTorch Implementation of "SoTTA: Robust Test-Time Adaptation on Noisy Data Streams (NeurIPS '23)" by Taesik Gong*, Yewon Kim*, Taeckyung Lee*, Sorn Chottananurak, and Sung-Ju Lee (* Equal contribution).

Language:PythonMIT1600

SeCo

Semantic Connectivity-Driven Pseudo-labeling for Cross-domain Segmentation

Language:Python900

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Language:Python280600