Jianyi Wang (IceClear)

IceClear

Geek Repo

Company:Nanyang Technological University (NTU)

Location:Singapore

Home Page:https://iceclear.github.io

Twitter:@Iceclearwjy

Github PK Tool:Github PK Tool

Jianyi Wang's starred repositories

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Language:PythonLicense:AGPL-3.0Stargazers:37562Issues:224Issues:474

QtScrcpy

Android real-time display control software

Language:C++License:Apache-2.0Stargazers:18936Issues:194Issues:838

flux

Official inference repo for FLUX.1 models

Language:PythonLicense:Apache-2.0Stargazers:14403Issues:130Issues:136

StoryDiffusion

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5836Issues:85Issues:143

llama-models

Utilities intended for use with Llama models.

Language:PythonLicense:NOASSERTIONStargazers:4294Issues:57Issues:84

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:3325Issues:40Issues:169

Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:2255Issues:41Issues:95

minimind

【大模型】3小时完全从0训练一个仅有26M的小参数GPT,最低仅需2G显卡即可推理训练!

Language:PythonLicense:Apache-2.0Stargazers:2101Issues:23Issues:36

ControlNetPlus

ControlNet++: All-in-one ControlNet for image generations and editing!

Language:PythonLicense:Apache-2.0Stargazers:1679Issues:19Issues:58

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:1217Issues:21Issues:55

Efficient-Computing

Efficient computing methods developed by Huawei Noah's Ark Lab

Language:Jupyter NotebookStargazers:1185Issues:24Issues:127

DepthCrafter

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Language:PythonLicense:NOASSERTIONStargazers:654Issues:47Issues:22

PLLaVA

Official repository for the paper PLLaVA

UltraPixel

Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks

Language:PythonLicense:AGPL-3.0Stargazers:538Issues:6Issues:21

MambaIR

[ECCV2024] An official pytorch implement of the paper "MambaIR: A simple baseline for image restoration with state-space model".

Language:PythonLicense:Apache-2.0Stargazers:422Issues:5Issues:63

FIFO-Diffusion_public

Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)

Phased-Consistency-Model

[NeurIPS 2024] Boosting the performance of consistency models with PCM!

Language:PythonLicense:Apache-2.0Stargazers:342Issues:20Issues:19

Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

NVS_Solver

Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"

Language:PythonLicense:Apache-2.0Stargazers:244Issues:14Issues:26

mmdit

Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch

Language:PythonLicense:MITStargazers:241Issues:3Issues:1

CV-VAE

[NeurIPS 24] CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Language:Jupyter NotebookStargazers:213Issues:14Issues:13
Language:PythonLicense:MITStargazers:207Issues:7Issues:10

Be-Your-Outpainter

[ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745

MLLA

Official repository of MLLA (NeurIPS 2024)

rope-vit

[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"

Language:PythonLicense:NOASSERTIONStargazers:164Issues:10Issues:9

aesthetic-predictor-v2-5

SigLIP-based Aesthetic Score Predictor

Language:PythonLicense:AGPL-3.0Stargazers:128Issues:1Issues:7

DiffTSR

[CVPR2024] Diffusion-based Blind Text Image Super-Resolution (Official)

Language:PythonStargazers:30Issues:0Issues:0