Tianhao-Qi's starred repositories

Tora

Official repo for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"

Stargazers:15Issues:0Issues:0

Kolors

Kolors Team

Language:PythonLicense:Apache-2.0Stargazers:2852Issues:0Issues:0

I2V-Adapter-repo

I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models

Stargazers:183Issues:0Issues:0

VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Language:PythonLicense:Apache-2.0Stargazers:423Issues:0Issues:0

SEED-Story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Language:PythonLicense:NOASSERTIONStargazers:617Issues:0Issues:0

VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.

Language:PythonStargazers:153Issues:0Issues:0

InfEdit

[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"

Language:PythonLicense:NOASSERTIONStargazers:248Issues:0Issues:0
Language:PythonLicense:MITStargazers:20Issues:0Issues:0

Portrait-Mode-Video

Video dataset dedicated to portrait-mode video recognition.

Language:PythonStargazers:31Issues:0Issues:0

ComfyUI_LayerStyle

A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.

Language:PythonLicense:MITStargazers:820Issues:0Issues:0

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:62868Issues:0Issues:0

CV-VAE

CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Language:Jupyter NotebookStargazers:187Issues:0Issues:0
Language:PythonStargazers:1410Issues:0Issues:0

RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Language:Jupyter NotebookStargazers:1616Issues:0Issues:0

Awesome-Animation-Research

Papers, datasets, and resources related to 2D cartoon video research. Contributions welcome.

License:MITStargazers:60Issues:0Issues:0

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Language:PythonStargazers:2107Issues:0Issues:0

Omost

Your image is almost there!

Language:PythonLicense:Apache-2.0Stargazers:6994Issues:0Issues:0

DiT-Visualization

Visualization of DiT self attention features

Language:PythonStargazers:94Issues:0Issues:0

sglang

SGLang is yet another fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:3693Issues:0Issues:0

sdxl_prompt_styler

Custom prompt styler node for SDXL in ComfyUI

Language:PythonLicense:MITStargazers:673Issues:0Issues:0

VGDiffZero

[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders

Language:PythonStargazers:8Issues:0Issues:0

360DVD

[CVPR2024] 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model

Language:PythonStargazers:93Issues:0Issues:0

HandyFigure

HandyFigure provides the sources file (ususally PPT files) for paper figures

Language:JavaScriptLicense:MITStargazers:152Issues:0Issues:0

SmartEdit

Official code of SmartEdit [CVPR-2024 Highlight]

Language:PythonStargazers:204Issues:0Issues:0

SSM

[IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition

Stargazers:8Issues:0Issues:0

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:2965Issues:0Issues:0

FreeInit

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

Language:PythonLicense:MITStargazers:462Issues:0Issues:0

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5599Issues:0Issues:0

MoneyPrinterTurbo

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonLicense:MITStargazers:15379Issues:0Issues:0

DisenDiff

[CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization

Language:PythonLicense:MITStargazers:74Issues:0Issues:0