ChenWang's starred repositories
Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
vocalsound
Dataset and baseline code for the VocalSound dataset (ICASSP2022).
lp-music-caps
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
ai-audio-datasets
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
Spec-Bench
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
fish-speech
Brand new TTS solution
SpeculativeDecodingPapers
📰 Must-read papers and blogs on Speculative Decoding ⚡️
MT-Reading-List
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
nndl.github.io
《神经网络与深度学习》 邱锡鹏著 Neural Network and Deep Learning
nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers