Yiyun Chen (yiyunchen)

yiyunchen

Geek Repo

Location:Shengzhen, China

Github PK Tool:Github PK Tool

Yiyun Chen's starred repositories

MIGC

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

Language:PythonLicense:NOASSERTIONStargazers:517Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:25295Issues:0Issues:0

HI-Diff

PyTorch code for our NeurIPS 2023 paper "Hierarchical Integration Diffusion Model for Realistic Image Deblurring"

Language:PythonLicense:Apache-2.0Stargazers:159Issues:0Issues:0

nerf

Code release for NeRF (Neural Radiance Fields)

Language:Jupyter NotebookLicense:MITStargazers:9822Issues:0Issues:0

MGLD-VSR

Code for ECCV 2024 Paper "Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution"

Language:PythonLicense:NOASSERTIONStargazers:86Issues:0Issues:0

SCEdit

Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing

Stargazers:46Issues:0Issues:0

BSSTNet

Implementation of "Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring". (Zhang et al., CVPR 2024)

Language:PythonStargazers:21Issues:0Issues:0

U-DiT

[NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"

Language:PythonLicense:NOASSERTIONStargazers:69Issues:0Issues:0

PySceneDetect

:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.

Language:PythonLicense:BSD-3-ClauseStargazers:3147Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21720Issues:0Issues:0

EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Language:PythonLicense:Apache-2.0Stargazers:1192Issues:0Issues:0

Awesome-Deblurring

A curated list of resources for Image and Video Deblurring

Stargazers:2397Issues:0Issues:0

CFDVSR

Collaborative Feedback Discriminative Propagation for Video Super-Resolution

Stargazers:39Issues:0Issues:0

VFRxBenchmark

[NTIRE2024] official code for "Towards Real-world Video Face Restoration: A New Benchmark"

Language:PythonLicense:NOASSERTIONStargazers:18Issues:0Issues:0

Edit-Your-Motion

The code of Edit-Your-Motion

License:Apache-2.0Stargazers:11Issues:0Issues:0

Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Language:PythonLicense:MITStargazers:739Issues:0Issues:0

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:8176Issues:0Issues:0

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonLicense:MITStargazers:2229Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:1257Issues:0Issues:0

pydct

Short-Time Discrete Cosine Transform (DCT) for Python. SciPy, TensorFlow and PyTorch implementations.

Language:Jupyter NotebookLicense:ISCStargazers:27Issues:0Issues:0
Stargazers:127Issues:0Issues:0

MP-SENet

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Language:PythonLicense:MITStargazers:290Issues:0Issues:0
Language:PythonStargazers:52Issues:0Issues:0

PGTFormer

[IJCAI'24] Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent Transformer

Language:PythonLicense:NOASSERTIONStargazers:170Issues:0Issues:0

SceneSegmentation-SCRL

Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"

Language:PythonLicense:NOASSERTIONStargazers:87Issues:0Issues:0
Language:PythonStargazers:15Issues:0Issues:0

DeepChorus

An end-to-end chorus detection model DeepChorus.

Language:PythonStargazers:30Issues:0Issues:0

chorus-from-music-structure

chorus detection for pop music

Language:PythonStargazers:38Issues:0Issues:0

pop-music-highlighter

"Pop Music Highlighter: Marking the Emotion Keypoints", TISMIR vol. 1, no. 1

Language:PythonLicense:GPL-3.0Stargazers:106Issues:0Issues:0

TimeChat

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:273Issues:0Issues:0