hildazzz

hildazzz

Geek Repo

Github PK Tool:Github PK Tool

hildazzz's starred repositories

Capital

研究《资本论》

License:MITStargazers:44Issues:0Issues:0

nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Language:PythonLicense:Apache-2.0Stargazers:4680Issues:0Issues:0

Bert-VITS2-ext

基于Bert-VITS2做的表情、动画测试. Animation testing based on Bert-VITS2.

Language:PythonLicense:AGPL-3.0Stargazers:516Issues:0Issues:0

chn_text_norm

Chinese text normalization. 中文文本规范化。

Language:PythonStargazers:48Issues:0Issues:0

python_speech_features

This library provides common speech features for ASR including MFCCs and filterbank energies.

Language:PythonLicense:MITStargazers:2367Issues:0Issues:0

AniTalker

[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1389Issues:0Issues:0

zuko

Normalizing flows in PyTorch

Language:PythonLicense:MITStargazers:309Issues:0Issues:0

flow-matching

Annotated Flow Matching paper

Language:Jupyter NotebookStargazers:96Issues:0Issues:0

nnAudio

Audio processing by using pytorch 1D convolution network

Language:PythonLicense:MITStargazers:1013Issues:0Issues:0

torch-stft

An STFT/iSTFT for PyTorch.

Language:PythonLicense:BSD-3-ClauseStargazers:342Issues:0Issues:0

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Language:PythonLicense:MITStargazers:771Issues:0Issues:0

silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:4898Issues:0Issues:0

piper

A fast, local neural text to speech system

Language:C++License:MITStargazers:5990Issues:0Issues:0

python-soundfile

SoundFile is an audio library based on libsndfile, CFFI, and NumPy

Language:PythonLicense:BSD-3-ClauseStargazers:701Issues:0Issues:0

adam-atan2-pytorch

Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch

Language:PythonLicense:MITStargazers:90Issues:0Issues:0

CosyVoice_For_Windows

CosyVoice在Windows环境下使用的版本

Language:PythonLicense:Apache-2.0Stargazers:389Issues:0Issues:0

sap-voicebox

Speech Processing Toolbox for MATLAB

Language:MATLABStargazers:235Issues:0Issues:0

RUCM3ED

M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database. ACL 2022

Stargazers:87Issues:0Issues:0
Language:PythonStargazers:131Issues:0Issues:0

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Language:PythonStargazers:806Issues:0Issues:0

ScriptsForVoxBlink2

Official Repository For VoxBlink2

Language:PythonLicense:NOASSERTIONStargazers:43Issues:0Issues:0

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:6863Issues:0Issues:0

best-rq-pytorch

Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.

Language:PythonLicense:MITStargazers:83Issues:0Issues:0

BEST-RQ

Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.

Language:PythonLicense:Apache-2.0Stargazers:57Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:12720Issues:0Issues:0

ChatTTS_colab

🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。

Language:PythonStargazers:1957Issues:0Issues:0

I_am_a_person

实时互动的GPT数字人

Language:PythonLicense:Apache-2.0Stargazers:206Issues:0Issues:0

EmoBox

[INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark

Language:PythonStargazers:126Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:5214Issues:0Issues:0

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:2811Issues:0Issues:0