isaac (juntengzhang)

juntengzhang

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

isaac's starred repositories

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:3950Issues:0Issues:0

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Language:PythonLicense:MITStargazers:735Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:29384Issues:0Issues:0

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonLicense:BSD-3-ClauseStargazers:2163Issues:0Issues:0

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:3715Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:31187Issues:0Issues:0

dpm-solver

Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)

Language:PythonLicense:MITStargazers:1488Issues:0Issues:0

MELD

MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation

Language:PythonLicense:GPL-3.0Stargazers:780Issues:0Issues:0
Language:PythonStargazers:123Issues:0Issues:0

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1589Issues:0Issues:0

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:7161Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:27968Issues:0Issues:0

InsightFace_Pytorch

Pytorch0.4.1 codes for InsightFace

Language:Jupyter NotebookLicense:MITStargazers:1725Issues:0Issues:0

TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Language:PythonLicense:MITStargazers:290Issues:0Issues:0

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:4586Issues:0Issues:0

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonLicense:MITStargazers:1147Issues:0Issues:0

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonLicense:NOASSERTIONStargazers:2188Issues:0Issues:0

Diffusion-Models-Papers-Survey-Taxonomy

Diffusion model papers, survey, and taxonomy

Stargazers:2854Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:4017Issues:0Issues:0

diffusion

Denoising Diffusion Probabilistic Models

Language:PythonStargazers:3566Issues:0Issues:0

AgentGPT

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

Language:TypeScriptLicense:GPL-3.0Stargazers:31125Issues:0Issues:0

agents

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Language:PythonLicense:Apache-2.0Stargazers:5099Issues:0Issues:0

diff-svc

Singing Voice Conversion via diffusion model

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:2613Issues:0Issues:0

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:2277Issues:0Issues:0

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Language:PythonLicense:CC-BY-4.0Stargazers:1045Issues:0Issues:0
Stargazers:3Issues:0Issues:0

speech-rate-meter

The Speech Rate Meter (hereinafter SRM) software module is designed to measure a complex of characteristics of the tempo (rate) of oral speech.

Language:QMLLicense:MITStargazers:17Issues:0Issues:0
Language:PythonLicense:MITStargazers:427Issues:0Issues:0

VQ-Diffusion

Official implementation of VQ-Diffusion

Language:PythonLicense:MITStargazers:873Issues:0Issues:0