Ziyang Ma's repositories
Speech-Resources
语音方向实验室/公司/资源/实习等,欢迎推荐或自荐
emotion2vec
Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Awesome-Speech-Pretraining
Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.
Awesome-Speech-Generation
Paper, Code and Statistics for Speech Generatation.
pre-train-dockerfile
An Intro to set up your Speech Docker environment and debug using VSCode
CS-BAOYAN-2022
计算机保研交流群(QQ群号:605176069)
DL-NLP-Readings
My Reading Lists of Deep Learning and Natural Language Processing
alpaca-lora
Instruct-tune LLaMA on consumer hardware
audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
Awesome-Video-Grounding
A reading list of papers about Visual Grounding.
CSLabInfo2022
关于2022年CS保研实验室/导师招生广告的汇总。欢迎想要打广告的小伙伴积极pr,资瓷一下互联网精神吼不吼啊?
CSSummerCamp2022
关于2022年CS保研夏令营通知公告的汇总。欢迎大家积极分享夏令营信息,资瓷一下互联网精神吼不吼啊?
FunASR
A Fundamental End-to-End Speech Recognition Toolkit
Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
llama
Inference code for LLaMA models
Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
MovieChat
🎬💭 chat with over 10K frames of video!
NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
T2A
Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023
team-learning-program
主要存储Datawhale组队学习中“编程、数据结构与算法”方向的资料。
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
whisper
Robust Speech Recognition via Large-Scale Weak Supervision