Yuechen (JulianJuaner)

JulianJuaner

Geek Repo

Company:CUHK, SmartMore

Location:Hong Kong SAR

Home Page:julianjuaner.github.io

Github PK Tool:Github PK Tool

Yuechen's starred repositories

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:50895Issues:499Issues:872

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-4-ClauseStargazers:10030Issues:124Issues:643

StableCascade

Official Code for Stable Cascade

Language:Jupyter NotebookLicense:MITStargazers:6446Issues:61Issues:121

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5659Issues:46Issues:73

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2541Issues:46Issues:0

co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2531Issues:25Issues:75

consistencydecoder

Consistency Distilled Diff VAE

Language:PythonLicense:MITStargazers:2106Issues:22Issues:19

LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

watermark-removal

a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1455Issues:28Issues:82

Hotshot-XL

✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL

Language:PythonLicense:Apache-2.0Stargazers:987Issues:14Issues:43

PIA

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

Language:PythonLicense:Apache-2.0Stargazers:799Issues:19Issues:38

LaVie

LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Language:PythonLicense:Apache-2.0Stargazers:779Issues:26Issues:20

X-Adapter

[CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

Language:PythonLicense:Apache-2.0Stargazers:699Issues:44Issues:28

video2dataset

Easily create large video dataset from video urls

Language:PythonLicense:MITStargazers:498Issues:9Issues:154

distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Language:PythonLicense:MITStargazers:497Issues:8Issues:16

Magic-Me

Codes for ID-Specific Video Customized Diffusion

Language:PythonLicense:Apache-2.0Stargazers:443Issues:14Issues:13

Panda-70M

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

FreeNoise

[ICLR 2024] Code for FreeNoise based on VideoCrafter

Language:PythonLicense:Apache-2.0Stargazers:346Issues:6Issues:13

AnimateZero

Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot Image Animators"

FiT

[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model

watermark-removal

通过水印减除方法去掉视频中的水印,快速但不完美

VIRL

(ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life

oft

Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".

Language:PythonLicense:MITStargazers:273Issues:17Issues:20

FreeNoise-AnimateDiff

[ICLR 2024] Code for FreeNoise based on AnimateDiff

Language:PythonLicense:Apache-2.0Stargazers:100Issues:4Issues:1

DDSM

Denoising Diffusion Step-aware Models (ICLR2024)

Language:PythonLicense:MITStargazers:48Issues:2Issues:0