Bingyin Zhao's repositories
3DCommonCorruptions
3D Common Corruptions and Data Augmentation, CVPR 2022 [Oral]
adv-training-corruptions
On the effectiveness of adversarial training against common corruptions [UAI 2022]
Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
bxz9200.github.io
Under Construction
DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
dspy
DSPy: The framework for programming—not prompting—foundation models
easyrobust
EasyRobust: an Easy-to-use library for state-of-the-art Robust Computer Vision Research with PyTorch.
FAN
Official PyTorch implementation of Fully Attentional Networks
frequency-backdoor
ICCV 2021, We find most existing triggers of backdoor attacks in deep learning contain severe artifacts in the frequency domain. This Repo. explores how we can use these artifacts to develop stronger backdoor defenses and attacks.
GAN-for-tabular-data
We well know GANs for success in the realistic image generation. However, they can be applied in tabular data generation. We will review and examine some recent papers about tabular GANs in action.
generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI đź”— https://microsoft.github.io/generative-ai-for-beginners/
imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
kornia
Open Source Differentiable Computer Vision Library
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
Machine-Learning-Interviews
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
MachineLearningPlayground
Implementation of basic mathematical pattern recognition/machine learning techniques for fun
mem0
The memory layer for Personalized AI
netron
Visualizer for neural network, deep learning, and machine learning models
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
TransNeXt
Code release for TransNeXt model
ttt-lm-pytorch
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
VCoder
VCoder: Versatile Vision Encoders for Multimodal Large Language Models, arXiv 2023 / CVPR 2024
ViT-pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
vit-pytorch-playground
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch