Minxing Luo (Tony-Lowe)

Tony-Lowe

Geek Repo

Company:VCIP, Nankai University

Github PK Tool:Github PK Tool

Minxing Luo's starred repositories

Language:PythonStargazers:75Issues:0Issues:0
Language:MATLABLicense:GPL-3.0Stargazers:10309Issues:0Issues:0

EAST

EAST: An Efficient and Accurate Scene Text Detector.

Language:C++License:MITStargazers:14Issues:0Issues:0

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:4088Issues:0Issues:0

MIGC

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

Language:PythonLicense:NOASSERTIONStargazers:498Issues:0Issues:0
Stargazers:4Issues:0Issues:0

diffusion_reward

[ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"

Language:PythonLicense:MITStargazers:62Issues:0Issues:0

interactdiffusion

[CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".

Language:PythonStargazers:84Issues:0Issues:0

Q-DiT

PyTorch code for Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers

Language:PythonStargazers:19Issues:0Issues:0

Six-CD

Six-CD: Benchmarking Concept Removals for Benign Text-to-image Diffusion Models

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

bioclip

This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].

Language:PythonLicense:NOASSERTIONStargazers:129Issues:0Issues:0

geneval

GenEval: An object-focused framework for evaluating text-to-image alignment

Language:HTMLLicense:MITStargazers:71Issues:0Issues:0

mmdit

Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch

Language:PythonLicense:MITStargazers:205Issues:0Issues:0

SimpleTuner

A general fine-tuning kit geared toward Stable Diffusion 2.1, Stable Diffusion 3, DeepFloyd, and SDXL.

Language:PythonLicense:AGPL-3.0Stargazers:457Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:5034Issues:0Issues:0

make-it-count

Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects"

Language:PythonStargazers:47Issues:0Issues:0

VideoTetris

VideoTetris: Towards Compositional Text-To-Video Generation

Language:PythonStargazers:188Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54854Issues:0Issues:0

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Language:PythonLicense:MITStargazers:23439Issues:0Issues:0

Omost

Your image is almost there!

Language:PythonLicense:Apache-2.0Stargazers:7024Issues:0Issues:0

PatchScaler

PatchScaler: An Efficient Patch-independent Diffusion Model for Super-Resolution

License:Apache-2.0Stargazers:29Issues:0Issues:0

LOVA3

The official repo of "Learning to Visual Question Answering, Asking and Assessment"

Language:PythonStargazers:9Issues:0Issues:0

MoRA

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Language:PythonStargazers:309Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:369Issues:0Issues:0

EditWorld

EditWorld: Simulating World Dynamics for Instruction-Following Image Editing

Language:PythonStargazers:101Issues:0Issues:0

Diff-BGM

official code for CVPR'24 paper Diff-BGM

Language:PythonStargazers:34Issues:0Issues:0

ViViD

ViViD: Video Virtual Try-on using Diffusion Models

Language:PythonLicense:Apache-2.0Stargazers:400Issues:0Issues:0

fast-kan

FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:300Issues:0Issues:0

Efficient-Vision-Language-Pre-training-by-Cluster-Masking

[CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.

Language:PythonStargazers:20Issues:0Issues:0

ContextDiff

[ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation

Language:PythonStargazers:53Issues:0Issues:0