Kai-Wei Chang (張凱爲) (ga642381)

ga642381

Geek Repo

Company:National Taiwan University (NTU)

Location:Taipei, Taiwan

Home Page:kwchang.org

Github PK Tool:Github PK Tool

Kai-Wei Chang (張凱爲)'s repositories

ML2021-Spring

**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2021 Spring

Language:Jupyter NotebookStargazers:771Issues:23Issues:0

speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

SpeechPrompt

**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speech processing with prompting paradigm

Speech-Prompts-Adapters

This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.

FastSpeech2

Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:

SpeechPrompt-v2

《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm

SpeechGen

《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》

RobustVC

**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degradation / adversarial robustness of VC models.

Language:PythonLicense:MITStargazers:24Issues:4Issues:0

Taiwanese-Whisper

fine-tune Whipser model for Taiwanese speech recognition

AudioCodec-Hub

AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models

Language:PythonLicense:MITStargazers:19Issues:1Issues:0

Taiwanese-Speech-Synthesis

Taiwanese Speech Synthesis with Tacotron2

Language:PythonLicense:MITStargazers:18Issues:3Issues:1

Taiwanese-Translation

Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus

Language:PythonStargazers:11Issues:3Issues:0

FlappyBird

:fire: Super Flappy Bird in p5.js

Language:JavaScriptStargazers:9Issues:2Issues:0

moth

虫我研所 Moth Institute 新一代設計展 https://ga642381.github.io/moth

Language:PythonStargazers:8Issues:0Issues:0
Language:JavaScriptStargazers:5Issues:1Issues:0

Kai-Wei-Chang-Talks

A repository sharing slides of the talks I gave

Deep-Q-learning

Playing Atari game (breakout) with deep reinforcement learning

Language:PythonStargazers:3Issues:0Issues:0
Language:Jupyter NotebookStargazers:2Issues:0Issues:0
Language:JavaScriptLicense:MITStargazers:2Issues:2Issues:0
Language:PythonStargazers:2Issues:0Issues:0

seamless_communication_emo

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2Issues:1Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

speech-language-model

A collection of papers related to speech language models

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese and Easy to adapt for other languages)

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Codec-SUPERB

Audio Codec Speech processing Universal PERformance Benchmark

Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

vision

Datasets, Transforms and Models specific to Computer Vision

License:BSD-3-ClauseStargazers:0Issues:0Issues:0