Ivy's starred repositories
annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
paper-reading
深度学习经典、新论文逐段精读
deep-learning-for-image-processing
deep learning for image processing including classification and object-detection etc.
nndl.github.io
《神经网络与深度学习》 邱锡鹏著 Neural Network and Deep Learning
External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Person_reID_baseline_pytorch
:bouncing_ball_person: Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial
MachineLearning
Machine learning resources
mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
Awesome-Backbones
Integrate deep learning models for image classification | Backbone learning/comparison/magic modification project
Transformer-in-Vision
Recent Transformer-based CV and related works.
conv-emotion
This repo contains implementation of different architectures for emotion recognition in conversations.
pytorch-video-recognition
PyTorch implemented C3D, R3D, R2Plus1D models for video activity recognition.
multimodal-deep-learning
This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.
MultiBench
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
Face-Transformer
Face Transformer for Recognition
long-short-term-transformer
[NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection
AWESOME-MER
🔆 📝 A reading list focused on Multimodal Emotion Recognition (MER) 👂👄 👀 💬
MOSEI_UMONS
A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis
Multimodal-End2end-Sparse
The code repository for NAACL 2021 paper "Multimodal End-to-End Sparse Model for Emotion Recognition".
CMU-MultimodalSDK-Tutorials
This is a short tutorial for using the CMU-MultimodalSDK.
Former-DFER
[MM'21] Former-DFER: Dynamic Facial Expression Recognition Transformer
EMO-AffectNetModel
Dynamic and static models for real-time facial emotion recognition