Tsai Meng-Ting (mengting7tw)

mengting7tw

Geek Repo

Location:Taipei, Taiwan

Twitter:@mttsai_

Github PK Tool:Github PK Tool

Tsai Meng-Ting's starred repositories

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonLicense:Apache-2.0Stargazers:31360Issues:166Issues:4560

llamafile

Distribute and run LLMs with a single file.

Language:C++License:NOASSERTIONStargazers:18074Issues:167Issues:383

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

Language:PythonLicense:NOASSERTIONStargazers:8241Issues:153Issues:0

approachingalmost

Approaching (Almost) Any Machine Learning Problem

Language:Rich Text FormatLicense:MITStargazers:6369Issues:64Issues:137

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonLicense:MITStargazers:3390Issues:65Issues:97

chatgpt-prompts-for-academic-writing

This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.

audio2photoreal

Code and dataset for photorealistic Codec Avatars driven from audio

Language:PythonLicense:NOASSERTIONStargazers:2642Issues:30Issues:52

so-vits-svc-5.0

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language:PythonLicense:MITStargazers:2491Issues:29Issues:159

RealtimeTTS

Converts text to speech in realtime

Language:PythonLicense:Apache-2.0Stargazers:856Issues:9Issues:0

40-questions

Questions that I ask myself at the end of each year and each decade.

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Language:PythonLicense:MITStargazers:567Issues:19Issues:84

UniAudio

The Open Source Code of UniAudio

mirdata

Python library for working with Music Information Retrieval datasets

Language:PythonLicense:BSD-3-ClauseStargazers:356Issues:14Issues:307

mustango

Mustango: Toward Controllable Text-to-Music Generation

Language:PythonLicense:MITStargazers:312Issues:15Issues:11

Pengi

An Audio Language model for Audio Tasks

Language:PythonLicense:MITStargazers:272Issues:14Issues:13

dvector

Speaker embedding (d-vector) trained with GE2E loss

SpeechMOS

Easy-to-Use Speech MOS predictors

Language:PythonLicense:MITStargazers:189Issues:7Issues:15

UTMOS22

UT-Sarulab MOS prediction system using SSL models

Language:PythonLicense:MITStargazers:151Issues:7Issues:10

SC_VALL-E

Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E

Language:PythonLicense:MITStargazers:133Issues:7Issues:1
Language:PythonLicense:NOASSERTIONStargazers:107Issues:8Issues:348

AQUA-Tk

AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)

Language:PythonLicense:GPL-3.0Stargazers:91Issues:3Issues:3

Matcha-TTS-2

E2E TTS using Conditional Flow Matching (Experimental*)

Language:Jupyter NotebookLicense:MITStargazers:59Issues:10Issues:3
Language:PythonLicense:Apache-2.0Stargazers:25Issues:0Issues:0

naplib-python

Tools and functions for neural data processing and analysis in python

Language:PythonLicense:MITStargazers:19Issues:4Issues:51
Language:PythonStargazers:1Issues:1Issues:0