Beast code in Giters

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonNOASSERTION000

Information-Extraction-Chinese

Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取

000

LLaSM

第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验，同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。

Apache-2.0000

MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

AGPL-3.0000

MOSNet-pytorch

The pytorch implement of MOSNet

NOASSERTION000

MOSNettf

Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Language:PythonNOASSERTION000

Multimodal-Emotion-Recognition

This repository contains the code for the paper `End-to-End Multimodal Emotion Recognition using Deep Neural Networks`.

BSD-3-Clause000

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.

Language:PythonApache-2.0000

Robust_Fine_Grained_Prosody_Control

PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis

BSD-3-Clause000

SenseVoice

Multilingual Voice Understanding Model

Language:PythonNOASSERTION000

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese)

Language:Jupyter NotebookApache-2.0000

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

Apache-2.0000

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:C++Apache-2.0000

yxfy

yxfy's repositories

AdaSpeech

AuxFormer

bark

ChatGLM-6B

ChatTTS

CosyVoice

espnet

FastSpeech

FastSpeech2

FunASR

Information-Extraction-Chinese

LLaSM

MARS5-TTS

MOSNet-pytorch

MOSNettf

Multimodal-Emotion-Recognition

NeuralSpeech

PaddleSpeech

Robust_Fine_Grained_Prosody_Control

seed-tts-eval

SenseVoice

SoundLabel

speech

TeleSpeech-ASR

TensorFlowTTS

VoiceprintRecognition-Pytorch

wenet