moeinheidari7829

Moein Heidari's starred repositories

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT13589 115 230

yolov10

YOLOv10: Real-Time End-to-End Object Detection

Language:PythonAGPL-3.08000 39 269

consistency_models

Official repo for consistency models.

Language:PythonMIT6022 60 51

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonMIT4343 52 199

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonGPL-3.03897 37 360

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

Language:Jupyter NotebookMIT3181 39 107

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonApache-2.02555 32 91

torchio

Medical imaging toolkit for deep learning

Language:PythonApache-2.01985 18 453

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonApache-2.01865 6 240

efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

Language:PythonApache-2.01588 33 114

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonMIT479 30 33

AITreasureBox

🤖 Collect practical AI repos, tools, websites, papers and tutorials on AI. 实用的AI百宝箱 💎

Language:RubyGPL-3.0417 16 2

flash-diffusion

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Language:PythonNOASSERTION323 8 9

video-generation-survey

A reading list of video generation

297 26 1

GiT

🔥 [ECCV2024] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Language:PythonApache-2.0236 6 7

pycon2024

Tutorial Materials for "The Fundamentals of Modern Deep Learning with PyTorch" workshop at PyCon 2024

Language:Jupyter NotebookMIT212 110

VideoBooth

[CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts

Language:Python209 22 8

minRF

Minimal implementation of scalable rectified flow transformers, based on SD3's approach

Language:Jupyter NotebookApache-2.0195 3 4

gflownet

GFlowNet library specialized for graph & molecular data

Language:PythonMIT175 38 22

Segment-Anything-CLIP

Connecting segment-anything's output masks with the CLIP model; Awesome-Segment-Anything-Works

Language:Jupyter NotebookApache-2.0158 4 3

t2v-turbo

Code repository for T2V-Turbo

Language:Python128 2 10

CT-CLIP

A foundation model utilizing chest CT volumes and radiology reports for supervised-level zero-shot detection of abnormalities

Language:Python124 2 14

OIR

[ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"

Language:Python68 9 1

llmblueprint

[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"

Language:Jupyter Notebook60 2 4

FlowIE

This repository contains the official implementation of "FlowIE: Efficient Image Enhancement via Rectified Flow"

Language:PythonMIT45 2 5

saros-dataset

Sparsely Annotated Region and Organ Segmentation (SAROS) - A large, heterogeneous, and sparsely annotated segmentation dataset on CT imaging data

Language:PythonMIT2600

LHUNet

LHU-Net: A Light Hybrid U-Net for Cost-efficient, High-performance Volumetric Medical Image Segmentation

Language:PythonApache-2.017 20

advdiffuser

AdvDiffuser: Natural Adversarial Example Synthesis with Diffusion Models (ICCV 2021)

MIT1100

X-Diffusion

Language:Jupyter NotebookMIT600

MEDDAP

Official implementation of "MEDDAP: Medical Dataset Enhancement via Diversified Augmentation Pipeline"

Language:Python200