Beast code in Giters

yangbinb's repositories

aphantasia

CLIP + FFT/DWT/RGB = text to image/video

Language:PythonMIT000

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonMIT000

BoxDiff

[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

Language:Python000

CLIP_prefix_caption

Simple image captioning model

Language:Jupyter NotebookMIT000

CogVideo

Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"

Language:PythonApache-2.0000

CoDeF

Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Language:PythonNOASSERTION000

ComfyUI-DragNUWA

Language:PythonMIT000

ComfyUI-Marigold

Marigold depth estimation in ComfyUI

Language:PythonGPL-3.0000

CVPR23_LFDM

The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"

Language:PythonBSD-2-Clause000

DirectInversion

Official repo for paper "Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"

Language:Jupyter Notebook000

Director3D

Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text".

NOASSERTION000

DragNUWA

Language:PythonMIT000

DynamiCrafter

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonApache-2.0000

FIFO-Diffusion_public

Official implementation of FIFO-Diffusion

Language:Python000

lorahub

Language:PythonMIT000

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookApache-2.0000

Omost

Your image is almost there!

Apache-2.0000

Open-Sora

Building your own video generation model like OpenAI's Sora

Language:PythonApache-2.0000

rich-text-to-image

Rich-Text-to-Image Generation

Language:PythonMIT000

stable-diffusion

Language:Jupyter NotebookMIT000

svd-temporal-controlnet

Language:Python000

T-Rex

T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Language:PythonNOASSERTION000

Text-To-Video-Finetuning

Finetune ModelScope's Text To Video model using Diffusers 🧨

Language:PythonMIT000

TrackDiffusion

Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)

000

TTNet-Real-time-Analysis-System-for-Table-Tennis-Pytorch

Unofficial implementation of "TTNet: Real-time temporal and spatial video analysis of table tennis" (CVPR 2020)

Language:Python000

vector-quantize-pytorch

Vector Quantization, in Pytorch

Language:PythonMIT000

Video-BLIP2-Preprocessor

A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it

Language:PythonMIT000

vidmaestro.github.io

Language:JavaScript000

WaveDiff

Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)

Language:PythonApache-2.0000

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM)

Language:PythonApache-2.0000