yxfy

yxfy

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

yxfy's repositories

Language:PythonStargazers:0Issues:0Issues:0

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

License:NOASSERTIONStargazers:0Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

License:Apache-2.0Stargazers:0Issues:0Issues:0

MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

License:AGPL-3.0Stargazers:0Issues:0Issues:0

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

VoiceprintRecognition-Pytorch

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

License:Apache-2.0Stargazers:0Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

License:MITStargazers:0Issues:0Issues:0

LLaSM

第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。

License:Apache-2.0Stargazers:0Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

AuxFormer

AuxFormer: Robust Approach to Audiovisual Emotion Recognition

License:MITStargazers:0Issues:0Issues:0

SoundLabel

语音数据集制作标记工具

Stargazers:0Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

AdaSpeech

An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"

Stargazers:0Issues:0Issues:0

MOSNettf

Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

License:NOASSERTIONStargazers:0Issues:0Issues:0

MOSNet-pytorch

The pytorch implement of MOSNet

License:NOASSERTIONStargazers:0Issues:0Issues:0

Robust_Fine_Grained_Prosody_Control

PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Multimodal-Emotion-Recognition

This repository contains the code for the paper `End-to-End Multimodal Emotion Recognition using Deep Neural Networks`.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

License:MITStargazers:0Issues:0Issues:0

Information-Extraction-Chinese

Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取

Stargazers:0Issues:0Issues:0

FastSpeech

The Implementation of FastSpeech based on pytorch.

Stargazers:0Issues:0Issues:0

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese)

License:Apache-2.0Stargazers:0Issues:0Issues:0