Maitreya Patel (Maitreyapatel)

Maitreyapatel

Geek Repo

Location:Tempe, Arizona, USA

Home Page:maitreyapatel.com

Twitter:@patelmaitreya

Github PK Tool:Github PK Tool


Organizations
eclipse-t2i

Maitreya Patel's starred repositories

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:22108Issues:186Issues:490

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11503Issues:154Issues:344

VAR

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:4198Issues:115Issues:81

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2951Issues:30Issues:113

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:1307Issues:27Issues:48

ELLA

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

Language:PythonLicense:Apache-2.0Stargazers:1084Issues:42Issues:47

rcg

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Language:PythonLicense:MITStargazers:820Issues:7Issues:37

sdxs

Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"

Language:PythonLicense:Apache-2.0Stargazers:603Issues:24Issues:18

AnglE

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

Language:PythonLicense:MITStargazers:474Issues:10Issues:49

piecewise-rectified-flow

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:437Issues:17Issues:11

awesome-text-to-image-studies

A collection of awesome text-to-image generation studies.

Language:TeXLicense:MITStargazers:407Issues:12Issues:0

Ctrl-Adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

Language:PythonLicense:Apache-2.0Stargazers:388Issues:21Issues:25

MACE

[CVPR 2024] "MACE: Mass Concept Erasure in Diffusion Models" (Official Implementation)

Language:PythonLicense:MITStargazers:351Issues:2Issues:14

awesome-video-generation

A collection of awesome video generation studies.

Language:TeXLicense:MITStargazers:325Issues:13Issues:1

LaVi-Bridge

[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Language:PythonLicense:MITStargazers:313Issues:16Issues:16
Language:RustLicense:Apache-2.0Stargazers:310Issues:33Issues:16

PnPInversion

[ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"

Language:Jupyter NotebookStargazers:249Issues:6Issues:13

ml-veclip

The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:228Issues:15Issues:0

VTimeLLM

[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".

Language:PythonLicense:NOASSERTIONStargazers:218Issues:2Issues:37

StyleID

[CVPR 2024 Highlight] Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer

Language:PythonLicense:MITStargazers:210Issues:3Issues:22

Awesome_Long_Form_Video_Understanding

Awesome papers & datasets specifically focused on long-term videos.

TokenCompose

(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:111Issues:3Issues:9

FreeStyle

FreeStyle : Free Lunch for Text-guided Style Transfer using Diffusion Models

RealCompo

[NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models

SpLiCE

Sparse Linear Concept Embeddings

Language:PythonLicense:Apache-2.0Stargazers:64Issues:3Issues:4

edit-one-for-all

✏️ Edit One for All: Interactive Batch Image Editing (CVPR 2024)

DAC

Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models

Language:PythonLicense:NOASSERTIONStargazers:25Issues:2Issues:1

WOUAF

WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models (CVPR 2024)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12Issues:1Issues:2