Kai-Wei Chang (張凱爲)'s repositories
speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
ML2021-Spring
**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2021 Spring
Speech-Prompts-Adapters
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
SpeechPrompt
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speech processing with prompting paradigm
FastSpeech2
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:
SpeechPrompt-v2
《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm
Taiwanese-Whisper
fine-tune Whipser model for Taiwanese speech recognition
AudioCodec-Hub
AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models
Taiwanese-Speech-Synthesis
Taiwanese Speech Synthesis with Tacotron2
Taiwanese-Translation
Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus
FlappyBird
:fire: Super Flappy Bird in p5.js
Kai-Wei-Chang-Talks
A repository sharing slides of the talks I gave
seamless_communication_emo
Foundational Models for State-of-the-Art Speech and Text Translation
speech-language-model
A collection of papers related to speech language models
TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese and Easy to adapt for other languages)
awesome-llm-role-playing-with-persona
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark