zero is not none's starred repositories

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonLicense:MITStargazers:37511Issues:441Issues:294

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11920Issues:103Issues:865

denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Language:PythonLicense:MITStargazers:7506Issues:32Issues:279

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7417Issues:85Issues:1562

OutfitAnyone

Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:4605Issues:50Issues:909

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:4303Issues:68Issues:70

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:3934Issues:37Issues:365

T2I-Adapter

T2I-Adapter

Language:PythonLicense:Apache-2.0Stargazers:3298Issues:40Issues:107

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2679Issues:27Issues:165

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2533Issues:46Issues:0

DynamiCrafter

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonLicense:Apache-2.0Stargazers:2076Issues:25Issues:100

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonLicense:MITStargazers:1551Issues:20Issues:95

Yuan-2.0

Yuan 2.0 Large Language Model

Language:PythonLicense:NOASSERTIONStargazers:669Issues:5Issues:91

Low-Level-Vision-Paper-Record

记录近期的 1) 图像/视频的超分增强等low level vision任务; 2) 图像生成 等任务相关论文, 主要为18年以后的DL based方法.

2D-Gaussian-Splatting

A 2D Gaussian Splatting paper for no obvious reasons. Enjoy!

Language:Jupyter NotebookLicense:MITStargazers:335Issues:5Issues:7

FudanOCR

A toolbox of scene text super-resolution and recognition

TimeChat

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:241Issues:5Issues:38

deep-learning-dynamics-paper-list

This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and stochastic optimization. Thus, deep learning dynamics play an essentially important role in theoretical foundation of deep learning.

License:MITStargazers:232Issues:14Issues:0

Python-Image-Morpher

Python Image Morpher (PIM) is a program that blends images to your content!

Language:PythonLicense:MITStargazers:162Issues:7Issues:9

GEM

[CVPR24] Official Implementation of GEM (Grounding Everything Module)

Language:PythonLicense:MITStargazers:61Issues:4Issues:5

PHDiffusion-Painterly-Image-Harmonization

[ACM MM 2023] The code used in our paper "Painterly Image Harmonization using Diffusion Model", ACM MM2023.

Language:PythonLicense:Apache-2.0Stargazers:48Issues:8Issues:8

PICTURE

Official code for paper "PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns"

Language:Jupyter NotebookLicense:MITStargazers:38Issues:10Issues:8

DDM-Public

code for paper: Decoupled diffusion models: image to zero and zero to noise

mix-bt

Official PyTorch Implementation of Guarding Barlow Twins Against Overfitting with Mixed Samples

Language:PythonLicense:MITStargazers:13Issues:2Issues:1

wechat-official-account-toolkit

处理微信公众号文章的工具包

Language:PythonStargazers:6Issues:1Issues:0