Tianhao-Qi

Tianhao-Qi's starred repositories

Pyramid-Flow

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Language:PythonMIT31500

movie-gen

An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!

Language:PythonMIT3900

CogVideoX-Fun

📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.

Language:PythonApache-2.034400

Vchitect-2.0

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Language:PythonApache-2.060800

Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Language:PythonApache-2.0253300

MoMA

MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation

Language:Jupyter Notebook18600

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonAGPL-3.0272900

CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonApache-2.0795000

Kolors

Kolors Team

Language:PythonApache-2.0368700

I2V-Adapter-repo

I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models

19900

VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Language:PythonApache-2.051300

SEED-Story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Language:PythonNOASSERTION72200

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.

Language:Python20000

InfEdit

[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"

Language:PythonNOASSERTION27000

talc

Language:PythonMIT2100

Portrait-Mode-Video

Video dataset dedicated to portrait-mode video recognition.

Language:Python3500

ComfyUI_LayerStyle

A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.

Language:PythonMIT126400

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonGPL-3.06467500