Wenyi Hong's starred repositories
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
taming-transformers
Taming Transformers for High-Resolution Image Synthesis
VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
frame-interpolation
FILM: Frame Interpolation for Large Motion, In ECCV 2022.
MAE-pytorch
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
awesome-human-pose-estimation
A collection of awesome resources in Human Pose estimation.
webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Neighborhood-Attention-Transformer
Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022
SwissArmyTransformer
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
cycle-diffusion
[ICCV 2023] A latent space for stochastic diffusion models
RelayDiffusion
The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]
ScreenAgent
ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)
kinetics-datasets-downloader
Download DeepMind's Kinetics dataset.