splinter21's repositories

BcutASR

必剪的语音识别逆向api

Language:GoStargazers:0Issues:0Issues:0

codec-bpe

Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs

License:MITStargazers:0Issues:0Issues:0

contentvec

speech self-supervised representations

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

ControlNetPlus

ControlNet++: All-in-one ControlNet for image generations and editing!

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

License:MITStargazers:0Issues:0Issues:0

FasterLivePortrait

Bring portraits to life in Real Time!onnx/tensorrt support!

Stargazers:0Issues:0Issues:0

fftw3

DO NOT CHECK OUT THESE FILES FROM GITHUB UNLESS YOU KNOW WHAT YOU ARE DOING. (See below.)

License:GPL-2.0Stargazers:0Issues:0Issues:0

inferStreamHiFiGAN

StreamHiFiGAN offers a HiFiGAN vocoder model optimized for streaming inference, providing real-time audio synthesis capabilities.

Stargazers:0Issues:0Issues:0

Kolors

Kolors Team

License:Apache-2.0Stargazers:0Issues:0Issues:0

models

All my self trained & released AI upscaling models. After gathering and applying over 600 different upscaling models, I learned how to train my own models, and these are the results.

Stargazers:0Issues:0Issues:0

Music-Source-Separation-Training

Repository for training models for music source separation.

Language:PythonStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

noise-reduction

noise reduction

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Paints-UNDO

Understand Human Behavior to Align True Needs

License:Apache-2.0Stargazers:0Issues:0Issues:0

phrex

Phrex is a PyTorch model for inferring speaker-independent embeddings and pitch from speech audio spectrograms

License:MITStargazers:0Issues:0Issues:0

promonet

Prosody and Pronunciation Modification Network

License:MITStargazers:0Issues:0Issues:0

SenseVoice-python

sensevoice with onnx runtime

Stargazers:0Issues:0Issues:0
License:GPL-3.0Stargazers:0Issues:0Issues:0

SOFA

SOFA: Singing-Oriented Forced Aligner

License:MITStargazers:0Issues:0Issues:0

SpeechDenoiser

SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denoising using an ONNX model. This repository contains everything you need to get started with enhancing audio quality by reducing noise, making it perfect for improving voice recordings and live communication.

Stargazers:0Issues:0Issues:0

split-lang

✨ Split text by language (i18n) powered by wtpsplit and langdetect (fasttext) [e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗]

License:MITStargazers:0Issues:0Issues:0

vampnet

music generation with masked transformers!

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

vs_deepdeinterlace

AI Deinterlacing functions for Vapoursynth

Stargazers:0Issues:0Issues:0

wetext

Python runtime for WeTextProcessing (does not depend on Pynini)

License:Apache-2.0Stargazers:0Issues:0Issues:0

YOLO-Stutter

YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection

License:MITStargazers:0Issues:0Issues:0