Vineshg's repositories
AI-Shorts-Creator
AI-Video-Cropper is a Python-based tool that leverages the power of GPT-4 (OpenAI's language model) to automatically analyze videos, extract the most interesting sections, and crop them for improved viewing experience. This project combines the capabilities of GPT-4, FFmpeg, and OpenCV to automate the process of identifying highlights in videos
ai-stories
Generate video stories with AI ✨
ai-video-generator
AI agent to automatically generate and post short videos
AI_short_video_generator
Automated short video generated using Artificial intelligence tools
blended-latent-diffusion
Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
chat-gpt-video-maker
An AI (LLM) based generative tool for creating TikTok/Shorts/Reels and YouTube videos automatically using OpenAI tools like gpt-4o and Dall-E 3
civitai
A repository of models, textual inversions, and more
ctrlora
Codebase for "CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation"
diffusion-nbs
Getting started with diffusion
DM-VTON
👗 DM-VTON: Distilled Mobile Real-time Virtual Try-On
FashionMatrix
Fashion Matrix is dedicated to bridging various visual and language models and continuously refining its capabilities as a comprehensive fashion AI assistant. This project will continue to update new features and optimization effects.
fast-stable-diffusion
fast-stable-diffusion + DreamBooth
Glyph-ByT5
This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering"
InstructCV
Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"
InstructEdit
Implementation of InstructEdit
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Mini-DALLE3
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
multimodal-garment-designer
This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023
Pinterest-Image-Downloader
Universal browser extension for batch downloading images from sites
RB-Modulation
Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"
ScaleCrafter
Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.
sd-webui-inpaint-anything
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
shortrocity
Generate YouTube Shorts with AI
Visual-Style-Prompting
Official Pytorch implementation of "Visual Style Prompting with Swapping Self-Attention"
VLPart
[ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation