IvyXia

0

followers

following

stars

Ivy's starred repositories

d2l-zh

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Language:PythonApache-2.058373 10420

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonMIT51005 429 127

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.030567 312 882

paper-reading

深度学习经典、新论文逐段精读

Apache-2.024868 7010

deep-learning-for-image-processing

deep learning for image processing including classification and object-detection etc.

Language:PythonGPL-3.021566 156 405

nndl.github.io

《神经网络与深度学习》邱锡鹏著 Neural Network and Deep Learning

Language:HTML17205 753 626

External-Attention-pytorch

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Language:PythonMIT11092 104 79

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Language:PythonNOASSERTION5441 115 652

Person_reID_baseline_pytorch

:bouncing_ball_person: Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial

Language:PythonMIT4002 77 377

MachineLearning

Machine learning resources

mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Language:PythonApache-2.03272 31 758

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonApache-2.03116 40 240

Awesome-Backbones

Integrate deep learning models for image classification | Backbone learning/comparison/magic modification project

Language:Python1388 35 110

Transformer-in-Vision

Recent Transformer-based CV and related works.

conv-emotion

This repo contains implementation of different architectures for emotion recognition in conversations.

Language:PythonMIT1300 370

pytorch-video-recognition

PyTorch implemented C3D, R3D, R2Plus1D models for video activity recognition.

Language:PythonMIT1164 17 75

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Language:Jupyter NotebookBSD-3-Clause1055 18 130

multimodal-deep-learning

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

Language:OpenEdge ABLMIT682 5 8

MultiBench

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

Language:HTMLMIT455 16 32

awesome-vit

Face-Transformer

Face Transformer for Recognition

Language:PythonMIT244 4 38

long-short-term-transformer

[NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection

Language:PythonApache-2.0123 7 30

AWESOME-MER

🔆 📝 A reading list focused on Multimodal Emotion Recognition (MER) 👂👄 👀 💬

MOSEI_UMONS

A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis

Language:PythonMIT113 8 22

Multimodal-End2end-Sparse

The code repository for NAACL 2021 paper "Multimodal End-to-End Sparse Model for Emotion Recognition".

Language:Python89 7 24

CMU-MultimodalSDK-Tutorials

This is a short tutorial for using the CMU-MultimodalSDK.

Language:Jupyter Notebook78 5 11

Former-DFER

[MM'21] Former-DFER: Dynamic Facial Expression Recognition Transformer

Language:PythonMIT68 2 6

EMO-AffectNetModel

Dynamic and static models for real-time facial emotion recognition

Language:Jupyter NotebookMIT59 3 4

SDPSO

Strategy dynamics particle swarm optimizer (SDPSO)

Language:MATLAB2 10

MAFW

MAFW: A Large-scale, Multi-modal, Compound Affective Database for Dynamic Facial Expression Recognition in the Wild

MIT1 20