润心's starred repositories

Language:PythonStargazers:195Issues:0Issues:0

MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:2360Issues:0Issues:0

v2rayA

A web GUI client of Project V which supports VMess, VLESS, SS, SSR, Trojan, Tuic and Juicity protocols. 🚀

Language:GoLicense:AGPL-3.0Stargazers:10702Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:4034Issues:0Issues:0

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7587Issues:0Issues:0

slidev-theme-frankfurt

A theme for Slidev, inspired by the Frankfurt theme in Beamer.

Language:VueStargazers:18Issues:0Issues:0

yutto

:ice_cube: 一个可爱且任性的 B 站视频下载器(bilili V2)

Language:PythonLicense:GPL-3.0Stargazers:938Issues:0Issues:0

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonLicense:MITStargazers:1897Issues:0Issues:0

media-get

Get the media through the url

Language:GoLicense:Apache-2.0Stargazers:251Issues:0Issues:0

JianZiPu

A font for writing Guqin music in JianZiPu.

Language:JavaScriptLicense:MITStargazers:14Issues:0Issues:0

FontDiffuser

[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

Language:PythonStargazers:248Issues:0Issues:0
Language:PythonStargazers:97Issues:0Issues:0

SDT

This repository is the official implementation of Disentangling Writer and Character Styles for Handwriting Generation (CVPR23).

Language:PythonLicense:MITStargazers:954Issues:0Issues:0

Python-Wrapper-for-World-Vocoder

A Python wrapper for the high-quality vocoder "World"

Language:CythonLicense:MITStargazers:717Issues:0Issues:0

World

A high-quality speech analysis, manipulation and synthesis system

Language:C++License:NOASSERTIONStargazers:1161Issues:0Issues:0

pits

PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor

Language:PythonLicense:MITStargazers:273Issues:0Issues:0

WenetSpeech

A 10000+ hours dataset for Chinese speech recognition

Language:ShellLicense:Apache-2.0Stargazers:486Issues:0Issues:0

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7779Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:3370Issues:0Issues:0

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:7187Issues:0Issues:0
Language:PythonStargazers:8Issues:0Issues:0

eindex

Multidimensional indexing for tensors

Language:Jupyter NotebookStargazers:107Issues:0Issues:0

einops

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Language:PythonLicense:MITStargazers:8296Issues:0Issues:0

torchcrepe

Pytorch implementation of the CREPE pitch tracker

Language:PythonLicense:MITStargazers:394Issues:0Issues:0

ppgs

High-Fidelity Neural Phonetic Posteriorgrams

Language:PythonLicense:MITStargazers:69Issues:0Issues:0

OCR_DataSet

收集并整理有关OCR的数据集并统一标注格式,以便实验需要

Language:PythonStargazers:858Issues:0Issues:0

librime-lua

Extending RIME with Lua scripts

Language:C++License:BSD-3-ClauseStargazers:297Issues:0Issues:0
Language:ShellStargazers:18Issues:0Issues:0

css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Language:HTMLLicense:Apache-2.0Stargazers:456Issues:0Issues:0

zm-text-tts

[IJCAI'23] Learning to Speak from Text for Low-Resource TTS

Language:PythonLicense:Apache-2.0Stargazers:63Issues:0Issues:0