Artinacode's starred repositories
VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
HRNet-Facial-Landmark-Detection
This is an official implementation of facial landmark detection for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919
SpeechEmoRec
Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching
MultiBench
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
Multimodal-Transformer
[ACL'19] [PyTorch] Multimodal Transformer
llm-universe
本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/
Plug-and-Play
深度学习中各种即插即用小模块
torchkeras
Pytorch❤️ Keras 😋😋
Image-Classification-PyTorch
Learning and Building Convolutional Neural Networks using PyTorch
lightning-hydra-template
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
multimodal-deep-learning
This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.
Multimodal-Emotion-Recognition
This repository contains the code for the paper `End-to-End Multimodal Emotion Recognition using Deep Neural Networks`.
Multimodal-Emotion-Recognition
A real time Multimodal Emotion Recognition web app for text, sound and video inputs
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
Chinese-Text-Classification-PyTorch
中文文本分类任务,基于PyTorch实现(TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention, DPCNN, Transformer,Bert,ERNIE),开箱即用!
magic-python
Python 黑魔法手册
X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.