Ziyang Ma (ddlBoJack)

ddlBoJack

Geek Repo

Company:Shanghai Jiao Tong University

Home Page:ziyang.tech

Github PK Tool:Github PK Tool

Ziyang Ma's repositories

Speech-Resources

语音方向实验室/公司/资源/实习等,欢迎推荐或自荐

emotion2vec

Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Language:PythonLicense:MITStargazers:302Issues:0Issues:0

Awesome-Speech-Pretraining

Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.

MT4SSL

Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets

Language:PythonLicense:MITStargazers:41Issues:4Issues:1

Awesome-Speech-Generation

Paper, Code and Statistics for Speech Generatation.

pre-train-dockerfile

An Intro to set up your Speech Docker environment and debug using VSCode

Language:DockerfileStargazers:4Issues:3Issues:0

CS-BAOYAN-2022

计算机保研交流群(QQ群号:605176069)

Language:HTMLStargazers:1Issues:1Issues:0

DL-NLP-Readings

My Reading Lists of Deep Learning and Natural Language Processing

Language:TeXLicense:MITStargazers:1Issues:1Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

amlt

A repo for amlt examples.

Language:PythonStargazers:0Issues:2Issues:0

audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

Stargazers:0Issues:0Issues:0

Awesome-Video-Grounding

A reading list of papers about Visual Grounding.

Stargazers:0Issues:1Issues:0

CSLabInfo2022

关于2022年CS保研实验室/导师招生广告的汇总。欢迎想要打广告的小伙伴积极pr,资瓷一下互联网精神吼不吼啊?

Stargazers:0Issues:1Issues:0

CSSummerCamp2022

关于2022年CS保研夏令营通知公告的汇总。欢迎大家积极分享夏令营信息,资瓷一下互联网精神吼不吼啊?

Stargazers:0Issues:1Issues:0
Stargazers:0Issues:2Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

Stargazers:0Issues:0Issues:0

llama

Inference code for LLaMA models

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MovieChat

🎬💭 chat with over 10K frames of video!

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

T2A

Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023

Language:PythonStargazers:0Issues:0Issues:0

team-learning-program

主要存储Datawhale组队学习中“编程、数据结构与算法”方向的资料。

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:0Issues:0Issues:0