Xu Yang's starred repositories
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Chat-UniVi
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
FlagEmbedding
Retrieval and Retrieval-augmented LLMs
stable-diffusion
Latent Text-to-Image Diffusion
stable-diffusion
A latent text-to-image diffusion model
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Beta-DARTS
official implementation of β-DARTS: Beta-Decay Regularization for Differentiable Architecture Search (CVPR22 oral).
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
mini-imagenet-tools
Tools for generating mini-ImageNet dataset and processing batches
pytorch-cifar100
Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet, NasNet, Residual Attention Network, SENet, WideResNet)
PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch