text-to-video

There are 119 repositories under text-to-video topic.

lucidrains / imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
artificial-intelligence deep-learning imagination-machine text-to-image text-to-video
Language:Python 7871
AILab-CVC / VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
image-to-video text-to-video video-generation
Language:Python 4277
promptslab / Awesome-Prompt-Engineering
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
chatgpt chatgpt-api few-shot-learning gpt gpt-3 openai prompt promptengineering text-to-image text-to-speech text-to-video prompt-engineering prompt-generator prompt-learning prompt-toolkit prompt-tuning prompt-based-learning deep-learning machine-learning
Language:Python 3463
YangLing0818 / Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
diffusion-models stable-diffusion survey text-to-3d text-to-image text-to-video
2791
showlab / Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
awesome diffusion-models video-editing video-generation video-understanding video-restoration text-to-motion text-to-video
2768
Stable-Diffusion
FurkanGozukara / Stable-Diffusion
Stable Diffusion, SDXL, LoRA Training, DreamBooth Training, Automatic1111 Web UI, DeepFake, Deep Fakes, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya LoRA, Kandinsky 2, DeepFloyd IF, Midjourney
deepfake deepfakes dreambooth guide guides stable-diffusion tts tutorial tutorials text-to-video text-to-image machine-learning education learning deep-learning computer-vision how-to ai-art coding programming
Language:Jupyter Notebook 1868
lucidrains / make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
artificial-intelligence attention-mechanisms axial-convolutions deep-learning text-to-video
Language:Python 1868
omerbt / TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
stable-diffusion text-to-image text-to-video video-editing tokenflow iclr2024
Language:Python 1509
ChenHsing / Awesome-Video-Diffusion-Models
[Arxiv] A Survey on Video Diffusion Models
awesome-list text-to-video video-diffusion-model awesome video-editing survey diffusion diffusion-models video video-diffusion
1502
camenduru / text-to-video-synthesis-colab
Text To Video Synthesis Colab
colab colab-notebook colaboratory t2v text-to-video
Language:Jupyter Notebook 1418
PKU-YuanGroup / MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
text-to-video video-generation diffusion-models time-lapse time-lapse-dataset open-sora-plan long-video-generation metamorphic-video-generation
Language:Python 1206
lucidrains / video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
artificial-intelligence ddpm deep-learning text-to-video video-generation
Language:Python 1176
ShareGPT4Omni / ShareGPT4Video
An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
chatgpt gpt gpt-4v large-language-models large-multimodal-models large-vision-language-models large-video-language-models sora text-to-video
Language:Python 1112
hotshotco / Hotshot-XL
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
ai hotshot hotshot-xl sdxl text-to-gif text-to-video text-to-video-generation
Language:Python 984
aphantasia
eps696 / aphantasia
CLIP + FFT/DWT/RGB = text to image/video
clip text-to-image text-to-video
Language:Python 772
showlab / MotionDirector
MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
diffusion-models motion-customization text-to-motion text-to-video text-to-video-generation video-generation
Language:Python 737
lucidrains / phenaki-pytorch
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch
artificial-intelligence attention-mechanisms deep-learning imagination-machine text-to-video transformers
Language:Python 731
ExponentialML / Text-To-Video-Finetuning
Finetune ModelScope's Text To Video model using Diffusers 🧨
diffusers stable-diffusion text-to-video modelscope deep-learning diffusion-models pytorch text2video
Language:Python 637
lucidrains / nuwa-pytorch
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
artificial-intelligence attention-mechanism deep-learning text-to-audio text-to-video transformers
Language:Python 536
jianzhnie / awesome-text-to-video
A Survey on Text-to-Video Generation/Synthesis.
aigc image-generation text-to-image text-to-video video-generation
501
jaketae / storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
gpt image-generation pytorch stable-diffusion text-to-image text-to-speech text-to-video video-generation ddpm diffusion-models natural-language-generation
Language:Python 482
TianxingWu / FreeInit
FreeInit: Bridging Initialization Gap in Video Diffusion Models
aigc text-to-video video-diffusion-model video-generation
Language:Python 443
Zhen-Dong / Magic-Me
Codes for ID-Specific Video Customized Diffusion
diffusion-models personalized-generation video-diffusion video-editing video-generation image-editting image-animation text-to-video
Language:Python 441
noahgsolomon / brainrot.js
Text to video generator in the brainrot form. Learn about any topic from your favorite personalities.
automate chatgpt nextjs python remotion text-to-video text-to-video-generation youtubeshorts
Language:TypeScript 424
Text2Video
sibozhang / Text2Video
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
vid2vid video gan metaverse deep-learning avatar virtual-humans aigc digital-humanities generative-ai speech-synthesis text-to-video tts talking talking-face-generation talking-head talking-heads icassp
Language:Python 413
Vchitect / VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
aigc evaluation-kit gen-ai stable-diffusion text-to-video video-generation benchmark dataset
Language:Python 364
atfortes / Awesome-Controllable-Generation
Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, T2I-Adapter, IP-Adapter.
consistency-models controlnet customization dall-e deep-generative-model diffusion-models dreambooth generative-ai generative-art image-synthesis ip-adapter latent-diffusion midjourney papers personalization stable-diffusion t2i-adapter text-to-3d text-to-image text-to-video
298
G-U-N / Gen-L-Video
The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".
diffusion-models long-video-generation stable-diffusion text-to-video text2video video-editing video-generation
Language:Jupyter Notebook 269
gyunggyung / AGI-Papers
Papers and Book to look at when starting AGI 📚
nlg llm nlp dialogue sentence-similarity efficient distillation multimodal sentence-embeddings multiple-tasks stable-diffusion text-to-video tts all-to-all
256
songweige / TATS
Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)
audio-to-video long-video-generation pytorch text-to-video video-generation video-manipulation video-synthesizer
Language:Python 253
PaddlePaddle / PaddleMIX
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
aigc stable-diffusion blip2 clip minigpt4 image-to-text text-to-image ppdiffusers controlnet multimodal eva-clip sd-xl text-to-video dit llava qwen-vl sora stablevideodiffusion
Language:Python 238
RQ-Wu / LAMP
Official implement code of LAMP: Learn a Motion Pattern by Few-Shot Tuning a Text-to-Image Diffusion Model (Few-shot-based text-to-video diffusion)
aigc diffusion diffusion-model diffusion-models few-shot-learning stable-diffusion text-to-video video-editing
Language:Python 234
lucidrains / lumiere-pytorch
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
artificial-intelligence deep-learning denoising-diffusion text-to-video
Language:Python 226
artkulak / text2youtube
🎥 Create youtube videos from a text prompt in seconds
automation content-creation text-to-video youtube youtube-automation
Language:Python 202
YingqingHe / Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
aigc large-language-models large-vision-language-models multimodal-generation multimodal-large-language-models multimodal-models multimodality text-to-3d text-to-audio text-to-image text-to-music text-to-sound text-to-speech text-to-video
Language:HTML 201
snap-research / MMVID
[CVPR 2022] Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
bert deep-learning multimodal-learning multimodal-video-generation text-to-video transformer video-generation
Language:Python 190

text-to-video

lucidrains / imagen-pytorch

AILab-CVC / VideoCrafter

promptslab / Awesome-Prompt-Engineering

YangLing0818 / Diffusion-Models-Papers-Survey-Taxonomy

showlab / Awesome-Video-Diffusion

FurkanGozukara / Stable-Diffusion

lucidrains / make-a-video-pytorch

omerbt / TokenFlow

ChenHsing / Awesome-Video-Diffusion-Models

camenduru / text-to-video-synthesis-colab

PKU-YuanGroup / MagicTime

lucidrains / video-diffusion-pytorch

ShareGPT4Omni / ShareGPT4Video

hotshotco / Hotshot-XL

eps696 / aphantasia

showlab / MotionDirector

lucidrains / phenaki-pytorch

ExponentialML / Text-To-Video-Finetuning

lucidrains / nuwa-pytorch

jianzhnie / awesome-text-to-video

jaketae / storyteller

TianxingWu / FreeInit

Zhen-Dong / Magic-Me

noahgsolomon / brainrot.js

sibozhang / Text2Video

Vchitect / VBench

atfortes / Awesome-Controllable-Generation

G-U-N / Gen-L-Video

gyunggyung / AGI-Papers

songweige / TATS

PaddlePaddle / PaddleMIX

RQ-Wu / LAMP

lucidrains / lumiere-pytorch

artkulak / text2youtube

YingqingHe / Awesome-LLMs-meet-Multimodal-Generation

snap-research / MMVID