Penalty_kl (yuanpengtu)

yuanpengtu

Geek Repo

Company:The University of Hong Kong

Location:上海

Home Page:yuanpengtu.github.io

Github PK Tool:Github PK Tool

Penalty_kl's starred repositories

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:Apache-2.0Stargazers:10883Issues:0Issues:0

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Stargazers:7178Issues:0Issues:0

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1447Issues:0Issues:0

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2566Issues:0Issues:0

DiS

Scalable Diffusion Models with State Space Backbone

Language:PythonLicense:NOASSERTIONStargazers:142Issues:0Issues:0

Neural-Network-Parameter-Diffusion

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Language:PythonStargazers:788Issues:0Issues:0

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2526Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:23952Issues:0Issues:0

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5151Issues:0Issues:0

DiffiT

[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation

Stargazers:380Issues:0Issues:0

FiT

[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model

License:Apache-2.0Stargazers:341Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:6993Issues:0Issues:0

OpenSORA

A public repository for reproducing a open source sora comparable video generation model

Stargazers:9Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5629Issues:0Issues:0

Cleaned-Webvid

Use strategy to achieve clean webvid-10m dataset

Language:PythonStargazers:3Issues:0Issues:0

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4042Issues:0Issues:0

lumiere-pytorch

Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch

Language:PythonLicense:MITStargazers:227Issues:0Issues:0

VIRL

Code for V-IRL: Grounding Virtual Intelligence in Real Life

Language:PythonStargazers:289Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:387Issues:0Issues:0

Video-Motion-Customization

VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:142Issues:0Issues:0

4DGen

"4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency", Yuyang Yin*, Dejia Xu*, Zhangyang Wang, Yao Zhao, Yunchao Wei

Language:PythonStargazers:201Issues:0Issues:0

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Language:PythonLicense:MITStargazers:41459Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:11528Issues:0Issues:0

custom-diffusion

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

Language:PythonLicense:NOASSERTIONStargazers:1810Issues:0Issues:0

llmblueprint

[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"

Language:Jupyter NotebookStargazers:60Issues:0Issues:0

Awesome-4D-Generation

An organized list of academic papers focused on the topic of 4D Generation. If you have any additions or suggestions, feel free to contribute.

Stargazers:48Issues:0Issues:0

4dfy

4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling

Language:PythonLicense:Apache-2.0Stargazers:292Issues:0Issues:0

Free-Form-Video-Inpainting

Official Pytorch implementation of "Learnable Gated Temporal Shift Module for Deep Video Inpainting. Chang et al. BMVC 2019." and the FVI dataset in "Free-form Video Inpainting with 3D Gated Convolution and Temporal PatchGAN, Chang et al. ICCV 2019"

Language:PythonStargazers:331Issues:0Issues:0

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:6380Issues:0Issues:0

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Language:PythonLicense:NOASSERTIONStargazers:4907Issues:0Issues:0