yiyunchen

Yiyun Chen's starred repositories

MIGC

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

Language:PythonNOASSERTION51700

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.02529500

HI-Diff

PyTorch code for our NeurIPS 2023 paper "Hierarchical Integration Diffusion Model for Realistic Image Deblurring"

Language:PythonApache-2.015900

nerf

Code release for NeRF (Neural Radiance Fields)

Language:Jupyter NotebookMIT982200

MGLD-VSR

Code for ECCV 2024 Paper "Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution"

Language:PythonNOASSERTION8600

SCEdit

Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing

4600

BSSTNet

Implementation of "Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring". (Zhang et al., CVPR 2024)

Language:Python2100

U-DiT

[NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"

Language:PythonNOASSERTION6900

PySceneDetect

:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.

Language:PythonBSD-3-Clause314700

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.02172000

EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Language:PythonApache-2.0119200

Awesome-Deblurring

A curated list of resources for Image and Video Deblurring

239700

CFDVSR

Collaborative Feedback Discriminative Propagation for Video Super-Resolution

3900

VFRxBenchmark

[NTIRE2024] official code for "Towards Real-world Video Face Restoration: A New Benchmark"

Language:PythonNOASSERTION1800

Edit-Your-Motion

The code of Edit-Your-Motion

Apache-2.01100

Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Language:PythonMIT73900

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonMIT817600

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonMIT222900

music_source_separation

Language:PythonNOASSERTION125700

pydct

Short-Time Discrete Cosine Transform (DCT) for Python. SciPy, TensorFlow and PyTorch implementations.

Language:Jupyter NotebookISC2700

FRCRN

12700

MP-SENet

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Language:PythonMIT29000

ReLoBlur

Language:Python5200

PGTFormer

[IJCAI'24] Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent Transformer

Language:PythonNOASSERTION17000

SceneSegmentation-SCRL

Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"

Language:PythonNOASSERTION8700

chorus-detection

Language:Python1500

DeepChorus

An end-to-end chorus detection model DeepChorus.

Language:Python3000

chorus-from-music-structure

chorus detection for pop music

Language:Python3800

pop-music-highlighter

"Pop Music Highlighter: Marking the Emotion Keypoints", TISMIR vol. 1, no. 1

Language:PythonGPL-3.010600

TimeChat

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Language:PythonBSD-3-Clause27300