yangbinb's repositories

aphantasia

CLIP + FFT/DWT/RGB = text to image/video

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

BoxDiff

[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

Language:PythonStargazers:0Issues:0Issues:0

CLIP_prefix_caption

Simple image captioning model

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

CogVideo

Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CoDeF

Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ComfyUI-Marigold

Marigold depth estimation in ComfyUI

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

CVPR23_LFDM

The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0

DirectInversion

Official repo for paper "Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Director3D

Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text".

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DynamiCrafter

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FIFO-Diffusion_public

Official implementation of FIFO-Diffusion

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Omost

Your image is almost there!

License:Apache-2.0Stargazers:0Issues:0Issues:0

Open-Sora

Building your own video generation model like OpenAI's Sora

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

rich-text-to-image

Rich-Text-to-Image Generation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

T-Rex

T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Text-To-Video-Finetuning

Finetune ModelScope's Text To Video model using Diffusers 🧨

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TrackDiffusion

Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)

Stargazers:0Issues:0Issues:0

TTNet-Real-time-Analysis-System-for-Table-Tennis-Pytorch

Unofficial implementation of "TTNet: Real-time temporal and spatial video analysis of table tennis" (CVPR 2020)

Language:PythonStargazers:0Issues:0Issues:0

vector-quantize-pytorch

Vector Quantization, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Video-BLIP2-Preprocessor

A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0

WaveDiff

Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0