ga642381

Kai-Wei Chang (張凱爲)'s repositories

speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

864 50 3

ML2021-Spring

**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2021 Spring

Language:Jupyter Notebook860 240

Speech-Prompts-Adapters

This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.

107 10 2

SpeechPrompt

**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speech processing with prompting paradigm

Language:Python98 6 3

FastSpeech2

Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:

Language:Python95 8 6

SpeechPrompt-v2

《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm

Language:Python81 6 6

SpeechGen

《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》

74 8 2

Taiwanese-Whisper

fine-tune Whipser model for Taiwanese speech recognition

Language:Python28 6 1

RobustVC

**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degradation / adversarial robustness of VC models.

Language:PythonMIT23 40

AudioCodec-Hub

AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models

Language:PythonMIT22 30

Taiwanese-Speech-Synthesis

Taiwanese Speech Synthesis with Tacotron2

Language:PythonMIT19 3 1

Taiwanese-Translation

Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus

Language:Python11 20

FlappyBird

:fire: Super Flappy Bird in p5.js

Language:JavaScript9 20

moth

虫我研所 Moth Institute 新一代設計展 https://ga642381.github.io/moth

8 20

TaiwaneseTTS

Language:Python8 2 1

Kai-Wei-Chang-Talks

A repository sharing slides of the talks I gave

6 20

FinanceWeb

Language:JavaScript5 10

CA2021-Final

Language:Jupyter Notebook2 20

neurips2021-sas-react

Language:JavaScriptMIT2 20

S2VC

Language:Python2 10

seamless_communication_emo

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookNOASSERTION2 10

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Language:PythonMIT1 10

speech-language-model

A collection of papers related to speech language models

1 10

speech_quality

Language:Jupyter Notebook1 20

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese and Easy to adapt for other languages)

Language:PythonApache-2.01 10

AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

Language:PythonNOASSERTION010

awesome-llm-role-playing-with-persona

Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas

000

Codec-SUPERB

Audio Codec Speech processing Universal PERformance Benchmark

Language:Python000

Linguistics-111

Language:Jupyter Notebook020

vision

Datasets, Transforms and Models specific to Computer Vision

Language:PythonBSD-3-Clause010