Maitreya Patel (Maitreyapatel)

Maitreyapatel

Geek Repo

Location:Tempe, Arizona, USA

Home Page:maitreyapatel.com

Twitter:@patelmaitreya

Github PK Tool:Github PK Tool


Organizations
eclipse-t2i

Maitreya Patel's starred repositories

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonLicense:Apache-2.0Stargazers:8010Issues:55Issues:1494

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3937Issues:114Issues:77

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2728Issues:30Issues:106

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:1075Issues:21Issues:31

rcg

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Language:PythonLicense:MITStargazers:784Issues:7Issues:33

sdxs

Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"

Language:PythonLicense:Apache-2.0Stargazers:579Issues:26Issues:16

AnglE

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

Language:PythonLicense:MITStargazers:431Issues:11Issues:46

Ctrl-Adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

Language:PythonLicense:Apache-2.0Stargazers:364Issues:22Issues:22

minRF

Minimal implementation of scalable rectified flow transformers, based on SD3's approach

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:353Issues:6Issues:9

Video-MME

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

FIFO-Diffusion_public

Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training

LaVi-Bridge

[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Language:PythonLicense:MITStargazers:297Issues:16Issues:16

MACE

[CVPR 2024] "MACE: Mass Concept Erasure in Diffusion Models" (Official Implementation)

Language:PythonLicense:MITStargazers:287Issues:2Issues:11
Language:RustLicense:Apache-2.0Stargazers:276Issues:33Issues:15

awesome-video-generation

A collection of awesome video generation studies.

Language:TeXLicense:MITStargazers:215Issues:9Issues:0

VTimeLLM

[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".

Language:PythonLicense:NOASSERTIONStargazers:194Issues:2Issues:30
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:178Issues:10Issues:24

StyleID

[CVPR 2024 Highlight] Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer

Language:PythonLicense:MITStargazers:153Issues:3Issues:12

d3po

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Language:PythonLicense:MITStargazers:149Issues:7Issues:13

Awesome_Long_Form_Video_Understanding

Awesome papers & datasets specifically focused on long-term videos.

FouriScale

Official implementation of FouriScale (ECCV2024)

Language:PythonLicense:Apache-2.0Stargazers:127Issues:11Issues:7

TokenCompose

(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:104Issues:3Issues:9

FreeStyle

FreeStyle : Free Lunch for Text-guided Style Transfer using Diffusion Models

Language:PythonLicense:Apache-2.0Stargazers:58Issues:6Issues:9

edit-one-for-all

✏️ Edit One for All: Interactive Batch Image Editing (CVPR 2024)

SpLiCE

Sparse Linear Concept Embeddings

Language:PythonLicense:Apache-2.0Stargazers:43Issues:3Issues:4

DAC

Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models

Language:PythonLicense:NOASSERTIONStargazers:23Issues:2Issues:1

ID-Preserving-Facial-Aging

Identity-Preserving Aging of Face Images via Latent Diffusion Models [IJCB 2023]

Language:Jupyter NotebookLicense:MITStargazers:17Issues:2Issues:4

WOUAF

WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models (CVPR 2024)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10Issues:1Issues:0