Jesjes1233's starred repositories
Baichuan-13B
A 13B large language model developed by Baichuan Intelligent Technology
multimodal-deep-learning
This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.
ch-sims-v2
Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module
AWESOME-MSA
Paper List for Multimodal Sentiment Analysis
multimodal_emotion_recognition
Improved Multi-modal Emotion Recognition using Squeeze-and-Exciation Block in Cross-Modal Attention
MetaTransformer
Meta-Transformer for Unified Multimodal Learning
Multimodal-Transformer
[ACL'19] [PyTorch] Multimodal Transformer
EfficientAT
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.
VisDrone-Dataset
The dataset for drone based detection and tracking is released, including both image/video, and annotations.
Audio-signal-classification-and-identification
基于梅尔频谱的信号分类和识别
Awesome-LLM-Safety
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.