semicarryispig

semicarryispig's starred repositories

downkyi

哔哩下载姬downkyi，哔哩哔哩网站视频下载工具，支持批量下载，支持8K、HDR、杜比视界，提供工具箱（音视频提取、去水印等）。

Language:C#GPL-3.02045100

ChatTTS_colab

🚀 一键部署（含离线整合包）！基于 ChatTTS ，支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用，无需复杂安装。

Language:Python182500

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonApache-2.01082300

audio-SNR

Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)

Language:Python21300

SpeechMOS

Easy-to-Use Speech MOS predictors

Language:PythonMIT20100

DeepFilterNet

Noise supression using deep filtering

Language:PythonNOASSERTION231100

xiaoyuzhoufmdownload

下载小宇宙播客中的音频

Language:Python1700

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:Python55300

chinese_speech_pretrain

chinese speech pretrained models

Language:Shell99500

brouhaha-vad

Predicts the level of noise and reverberation on your audiofiles

Language:Jupyter NotebookMIT13000

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonApache-2.0396300

g2p-kd

Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion

Language:PythonNOASSERTION2000

mlm-scoring

Python library & examples for Masked Language Model Scoring (ACL 2020)

Language:PythonApache-2.033200

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonMIT64600

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonMIT79500

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonMIT807800

hume-python-sdk

Python client for Hume AI APIs

Language:PythonMIT7200

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT440300

semicarryispig

semicarryispig's starred repositories

downkyi

asvspoof5

seed-tts-eval

ChatTTS_colab

PaddleSpeech

audio-SNR

SpeechMOS

DeepFilterNet

xiaoyuzhoufmdownload

AcademiCodec

chinese_speech_pretrain

brouhaha-vad

parler-tts

g2p-kd

mlm-scoring

NISQA

demucs

demucs

hume-python-sdk

Amphion

Montreal-Forced-Aligner

aeneas

g2p-zh-en

spear-tts-pytorch

MediaCrawler

you-get

seamless_communication

metavoice-src

laughter-detection

SpeechT5