Moein Heidari's starred repositories
consistency_models
Official repo for consistency models.
x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.
efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
AITreasureBox
🤖 Collect practical AI repos, tools, websites, papers and tutorials on AI. 实用的AI百宝箱 💎
flash-diffusion
Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
video-generation-survey
A reading list of video generation
VideoBooth
[CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts
Segment-Anything-CLIP
Connecting segment-anything's output masks with the CLIP model; Awesome-Segment-Anything-Works
llmblueprint
[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"
saros-dataset
Sparsely Annotated Region and Organ Segmentation (SAROS) - A large, heterogeneous, and sparsely annotated segmentation dataset on CT imaging data
advdiffuser
AdvDiffuser: Natural Adversarial Example Synthesis with Diffusion Models (ICCV 2021)