IceClear

Jianyi Wang's starred repositories

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Language:PythonAGPL-3.037562 224 474

QtScrcpy

Android real-time display control software

Language:C++Apache-2.018936 194 838

flux

Official inference repo for FLUX.1 models

Language:PythonApache-2.014403 130 136

StoryDiffusion

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Language:Jupyter NotebookApache-2.05836 85 143

llama-models

Utilities intended for use with Llama models.

Language:PythonNOASSERTION4294 57 84

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonNOASSERTION3325 40 169

Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Language:PythonApache-2.02255 41 95

minimind

【大模型】3小时完全从0训练一个仅有26M的小参数GPT，最低仅需2G显卡即可推理训练！

Language:PythonApache-2.02101 23 36

ControlNetPlus

ControlNet++: All-in-one ControlNet for image generations and editing!

Language:PythonApache-2.01679 19 58

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonMIT1217 21 55

Efficient-Computing

Efficient computing methods developed by Huawei Noah's Ark Lab

Language:Jupyter Notebook1185 24 127

DepthCrafter

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Language:PythonNOASSERTION654 47 22

PLLaVA

Official repository for the paper PLLaVA

Language:Python568 13 76

UltraPixel

Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks

Language:PythonAGPL-3.0538 6 21

MambaIR

[ECCV2024] An official pytorch implement of the paper "MambaIR: A simple baseline for image restoration with state-space model".

Language:PythonApache-2.0422 5 63

FIFO-Diffusion_public

Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)

Language:Python350 12 28

Phased-Consistency-Model

[NeurIPS 2024] Boosting the performance of consistency models with PCM!

Language:PythonApache-2.0342 20 19

Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

Language:HTML312 11 4

NVS_Solver

Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"

Language:PythonApache-2.0244 14 26

mmdit

Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch

Language:PythonMIT241 3 1

CV-VAE

[NeurIPS 24] CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Language:Jupyter Notebook213 14 13

genwarp

Language:PythonMIT207 7 10

Be-Your-Outpainter

[ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745

Language:Python204 12 12

MLLA

Official repository of MLLA (NeurIPS 2024)

Language:Python179 3 27

rope-vit

[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"

Language:PythonNOASSERTION164 10 9

aesthetic-predictor-v2-5

SigLIP-based Aesthetic Score Predictor

Language:PythonAGPL-3.0128 1 7

DiffTSR

[CVPR2024] Diffusion-based Blind Text Image Super-Resolution (Official)

Language:Python67 5 10

VidGen

Language:Python53 2 6

LAR-IQA

Language:Python3000

MaskInversion

Language:Python17 1 2