Ivy (IvyXia)

IvyXia

Geek Repo

Github PK Tool:Github PK Tool

Ivy's starred repositories

d2l-zh

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Language:PythonLicense:Apache-2.0Stargazers:58373Issues:1042Issues:0

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonLicense:MITStargazers:51005Issues:429Issues:127

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:30567Issues:312Issues:882

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:24868Issues:701Issues:0

deep-learning-for-image-processing

deep learning for image processing including classification and object-detection etc.

Language:PythonLicense:GPL-3.0Stargazers:21566Issues:156Issues:405

nndl.github.io

《神经网络与深度学习》 邱锡鹏著 Neural Network and Deep Learning

External-Attention-pytorch

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Language:PythonLicense:MITStargazers:11092Issues:104Issues:79

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Language:PythonLicense:NOASSERTIONStargazers:5441Issues:115Issues:652

Person_reID_baseline_pytorch

:bouncing_ball_person: Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial

Language:PythonLicense:MITStargazers:4002Issues:77Issues:377

mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:3272Issues:31Issues:758

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonLicense:Apache-2.0Stargazers:3116Issues:40Issues:240

Awesome-Backbones

Integrate deep learning models for image classification | Backbone learning/comparison/magic modification project

Transformer-in-Vision

Recent Transformer-based CV and related works.

conv-emotion

This repo contains implementation of different architectures for emotion recognition in conversations.

Language:PythonLicense:MITStargazers:1300Issues:37Issues:0

pytorch-video-recognition

PyTorch implemented C3D, R3D, R2Plus1D models for video activity recognition.

Language:PythonLicense:MITStargazers:1164Issues:17Issues:75

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:1055Issues:18Issues:130

multimodal-deep-learning

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

Language:OpenEdge ABLLicense:MITStargazers:682Issues:5Issues:8

MultiBench

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

Language:HTMLLicense:MITStargazers:455Issues:16Issues:32

Face-Transformer

Face Transformer for Recognition

Language:PythonLicense:MITStargazers:244Issues:4Issues:38

long-short-term-transformer

[NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection

Language:PythonLicense:Apache-2.0Stargazers:123Issues:7Issues:30

AWESOME-MER

🔆 📝 A reading list focused on Multimodal Emotion Recognition (MER) 👂👄 👀 💬

MOSEI_UMONS

A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis

Language:PythonLicense:MITStargazers:113Issues:8Issues:22

Multimodal-End2end-Sparse

The code repository for NAACL 2021 paper "Multimodal End-to-End Sparse Model for Emotion Recognition".

CMU-MultimodalSDK-Tutorials

This is a short tutorial for using the CMU-MultimodalSDK.

Language:Jupyter NotebookStargazers:78Issues:5Issues:11

Former-DFER

[MM'21] Former-DFER: Dynamic Facial Expression Recognition Transformer

Language:PythonLicense:MITStargazers:68Issues:2Issues:6

EMO-AffectNetModel

Dynamic and static models for real-time facial emotion recognition

Language:Jupyter NotebookLicense:MITStargazers:59Issues:3Issues:4

SDPSO

Strategy dynamics particle swarm optimizer (SDPSO)

Language:MATLABStargazers:2Issues:1Issues:0

MAFW

MAFW: A Large-scale, Multi-modal, Compound Affective Database for Dynamic Facial Expression Recognition in the Wild

License:MITStargazers:1Issues:2Issues:0