semicarryispig

semicarryispig

Geek Repo

Github PK Tool:Github PK Tool

semicarryispig's starred repositories

downkyi

哔哩下载姬downkyi,哔哩哔哩网站视频下载工具,支持批量下载,支持8K、HDR、杜比视界,提供工具箱(音视频提取、去水印等)。

Language:C#License:GPL-3.0Stargazers:20451Issues:0Issues:0
Language:PythonStargazers:26Issues:0Issues:0
Language:PythonStargazers:893Issues:0Issues:0

ChatTTS_colab

🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。

Language:PythonStargazers:1825Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonLicense:Apache-2.0Stargazers:10823Issues:0Issues:0

audio-SNR

Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)

Language:PythonStargazers:213Issues:0Issues:0

SpeechMOS

Easy-to-Use Speech MOS predictors

Language:PythonLicense:MITStargazers:201Issues:0Issues:0

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:2311Issues:0Issues:0

xiaoyuzhoufmdownload

下载小宇宙播客中的音频

Language:PythonStargazers:17Issues:0Issues:0

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:PythonStargazers:553Issues:0Issues:0

chinese_speech_pretrain

chinese speech pretrained models

Language:ShellStargazers:995Issues:0Issues:0

brouhaha-vad

Predicts the level of noise and reverberation on your audiofiles

Language:Jupyter NotebookLicense:MITStargazers:130Issues:0Issues:0

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:3963Issues:0Issues:0

g2p-kd

Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion

Language:PythonLicense:NOASSERTIONStargazers:20Issues:0Issues:0

mlm-scoring

Python library & examples for Masked Language Model Scoring (ACL 2020)

Language:PythonLicense:Apache-2.0Stargazers:332Issues:0Issues:0

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonLicense:MITStargazers:646Issues:0Issues:0

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:795Issues:0Issues:0

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:8078Issues:0Issues:0

hume-python-sdk

Python client for Hume AI APIs

Language:PythonLicense:MITStargazers:72Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4403Issues:0Issues:0

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonLicense:MITStargazers:1277Issues:0Issues:0

aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Language:PythonLicense:AGPL-3.0Stargazers:2470Issues:0Issues:0

g2p-zh-en

Chinese and English Bilinguish G2P

Language:PythonLicense:NOASSERTIONStargazers:18Issues:0Issues:0

spear-tts-pytorch

Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch

Language:PythonLicense:MITStargazers:249Issues:0Issues:0

MediaCrawler

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫

Language:PythonLicense:NOASSERTIONStargazers:16115Issues:0Issues:0

you-get

:arrow_double_down: Dumb downloader that scrapes the web

Language:PythonLicense:NOASSERTIONStargazers:49849Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10693Issues:0Issues:0

metavoice-src

Foundational model for human-like, expressive TTS

Language:PythonLicense:Apache-2.0Stargazers:3665Issues:0Issues:0
Language:PythonLicense:MITStargazers:197Issues:0Issues:0

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Language:PythonLicense:MITStargazers:1139Issues:0Issues:0