ddlBoJack

followers

following

stars

Shanghai Jiao Tong University

Ziyang Ma's repositories

emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Language:Python561 14 38

Speech-Resources

语音方向实验室/公司/资源/实习等，欢迎推荐或自荐

Awesome-Speech-Pretraining

Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.

MT4SSL

[INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets

Language:PythonMIT42 4 1

Awesome-Speech-Generation

Paper, Code and Statistics for Speech Generatation.

pre-train-dockerfile

An Intro to set up your Speech Docker environment and debug using VSCode

Language:Dockerfile4 30

CS-BAOYAN-2022

计算机保研交流群（QQ群号：605176069）

Language:HTML1 10

ddlBoJack.github.io

Language:HTML1 10

DL-NLP-Readings

My Reading Lists of Deep Learning and Natural Language Processing

Language:TeXMIT1 10

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.0000

amlt

A repo for amlt examples.

Language:Python020

audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

000

Awesome-Video-Grounding

A reading list of papers about Visual Grounding.

010

CSLabInfo2022

关于2022年CS保研实验室/导师招生广告的汇总。欢迎想要打广告的小伙伴积极pr，资瓷一下互联网精神吼不吼啊？

010

CSSummerCamp2022

关于2022年CS保研夏令营通知公告的汇总。欢迎大家积极分享夏令营信息，资瓷一下互联网精神吼不吼啊？

010

ddlBoJack

020

dynamic-superb

The official repository of Dynamic-SUPERB.

Language:Python000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT010

FastHuBERT

Language:Python000

FunASR

A Fundamental End-to-End Speech Recognition Toolkit

Language:PythonNOASSERTION000

Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

000

llama

Inference code for LLaMA models

Language:PythonGPL-3.0000

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonApache-2.0000

MovieChat

🎬💭 chat with over 10K frames of video!

Language:PythonBSD-3-Clause000

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonBSD-3-Clause000

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.0000

T2A

Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023

Language:Python000

team-learning-program

主要存储Datawhale组队学习中“编程、数据结构与算法”方向的资料。

Language:Jupyter Notebook010

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Language:PythonNOASSERTION000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT000