Tsai Meng-Ting (mengting7tw)

mengting7tw

Geek Repo

Location:Taipei, Taiwan

Twitter:@mttsai_

Github PK Tool:Github PK Tool

Tsai Meng-Ting's starred repositories

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonLicense:Apache-2.0Stargazers:31362Issues:166Issues:4560

llamafile

Distribute and run LLMs with a single file.

Language:C++License:NOASSERTIONStargazers:18074Issues:167Issues:383

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

approachingalmost

Approaching (Almost) Any Machine Learning Problem

Language:Rich Text FormatLicense:MITStargazers:6369Issues:64Issues:137

espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Language:CLicense:GPL-3.0Stargazers:3962Issues:102Issues:994

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonLicense:MITStargazers:3390Issues:65Issues:97

Resemblyzer

A python package to analyze and compare voices with deep learning

Language:PythonLicense:Apache-2.0Stargazers:2696Issues:73Issues:80

chatgpt-prompts-for-academic-writing

This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.

whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language:PythonLicense:MITStargazers:2578Issues:28Issues:163

RealtimeTTS

Converts text to speech in realtime

40-questions

Questions that I ask myself at the end of each year and each decade.

Python-Wrapper-for-World-Vocoder

A Python wrapper for the high-quality vocoder "World"

Language:CythonLicense:MITStargazers:717Issues:26Issues:57

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Language:PythonLicense:MITStargazers:567Issues:19Issues:84

Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Language:Jupyter NotebookLicense:MITStargazers:533Issues:15Issues:52

UniAudio

The Open Source Code of UniAudio

Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

mustango

Mustango: Toward Controllable Text-to-Music Generation

Language:PythonLicense:MITStargazers:312Issues:15Issues:11

Pengi

An Audio Language model for Audio Tasks

Language:PythonLicense:MITStargazers:272Issues:14Issues:13

dvector

Speaker embedding (d-vector) trained with GE2E loss

SpeechMOS

Easy-to-Use Speech MOS predictors

Language:PythonLicense:MITStargazers:189Issues:7Issues:15

UTMOS22

UT-Sarulab MOS prediction system using SSL models

Language:PythonLicense:MITStargazers:151Issues:7Issues:10
Language:PythonLicense:NOASSERTIONStargazers:107Issues:8Issues:348

AQUA-Tk

AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)

Language:PythonLicense:GPL-3.0Stargazers:91Issues:3Issues:3
Language:PythonLicense:BSD-3-ClauseStargazers:65Issues:5Issues:9

Matcha-TTS-2

E2E TTS using Conditional Flow Matching (Experimental*)

Language:Jupyter NotebookLicense:MITStargazers:59Issues:10Issues:3
Language:PythonLicense:Apache-2.0Stargazers:25Issues:0Issues:0

naplib-python

Tools and functions for neural data processing and analysis in python

Language:PythonLicense:MITStargazers:19Issues:4Issues:51
Language:PythonStargazers:1Issues:1Issues:0