sunny2109

Long's starred repositories

Kolors

Kolors Team

Language:PythonApache-2.0286500

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.0687000

FVMD-frechet-video-motion-distance

Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos

Language:PythonApache-2.01800

how-do-vits-work

(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"

Language:PythonApache-2.080100

SAFMN

[ICCV 2023] Spatially-Adaptive Feature Modulation for Efficient Image Super-Resolution; runner-up method for the model complexity track in NTIRE2023 Efficient SR challenge

Language:Python24400

SMFANet

[ECCV 2024] SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution

Language:PythonApache-2.03100

UltraPixel

Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks

Language:PythonAGPL-3.037200

MNN

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba

Language:C++850500

SMFANet

[ECCV 2024] SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution

Language:PythonApache-2.0600

VideoGPT

Language:Jupyter NotebookMIT95000

CV-VAE

CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Language:Jupyter Notebook18800

minRF

Minimal implementation of scalable rectified flow transformers, based on SD3's approach

Language:Jupyter NotebookApache-2.032900

IQA-PyTorch

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Language:PythonNOASSERTION169500

VideoTetris

VideoTetris: Towards Compositional Text-To-Video Generation

Language:Python18500

rfpp

The codebase of our paper "Improving the Training of Rectified Flows"

Language:Python5200

EVSSM

Efficient Visual State Space Model for Image Deblurring

MIT1700

DiT-Visualization

Visualization of DiT self attention features

Language:Python9400

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

1097100

understanding_dl

A lecture note for understanding deep learning

Language:Jupyter Notebook14400

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT389700

sunny2109

Long's starred repositories

Kolors

segment-anything-2

FVMD-frechet-video-motion-distance

how-do-vits-work

SAFMN

SMFANet

UltraPixel

MNN

SMFANet

VideoGPT

CV-VAE

minRF

IQA-PyTorch

VideoTetris

rfpp

EVSSM

DiT-Visualization

Awesome-Multimodal-Large-Language-Models

understanding_dl

VAR

Open-Sora

Open-Sora-Plan

colormnet

NTIRE2024-ESR-SMFAN

NeRD-Rain

Awesome-Video-Diffusion

openISP

DSTNet-plus

Awesome-diffusion-model-for-image-processing

aimet