sangeet2020

Sangeet Sagar's starred repositories

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION52319 939 1080

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonApache-2.08673 133 1085

Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust

Language:C++Apache-2.03252 51 488

whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Language:PythonMIT1889 36 97

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonMIT1428 43 226

sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

Language:C++Apache-2.0997 36 144

punctuator2

A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text

Language:PythonMIT658 28 79

ctc-segmentation

Segment an audio file and obtain utterance alignments. (Python package)

Language:PythonApache-2.0319 13 29

speech-emotion-recognition

Speaker independent emotion recognition

Language:PythonMIT315 17 34

deepsegment

A sentence segmenter that actually works!

Language:PythonGPL-3.0302 14 38

VBx

Variational Bayes HMM over x-vectors diarization

Language:Python251 21 63

brouhaha-vad

Predicts the level of noise and reverberation on your audiofiles

Language:Jupyter NotebookMIT138 10 16

deepspeare

Code for Deep-speare: a joint neural model of poetic language, meter and rhyme

Language:HTMLApache-2.070 5 3

wordwise

N-gram keyword extraction using spaCy and pretrained language models

Language:PythonMIT62 4 7

Text-Classification-CNN-PyTorch

The aim of this repository is to show a baseline model for text classification through convolutional neural networks in the PyTorch framework. The architecture implemented in this model was inspired by the one proposed in the paper: Convolutional Neural Networks for Sentence Classification.

Language:PythonMIT47 3 1

sangeet2020

Sangeet Sagar's starred repositories

Real-Time-Voice-Cloning

speechbrain

sherpa-onnx

whisper_streaming

pyroomacoustics

sherpa-ncnn

punctuator2

ctc-segmentation

speech-emotion-recognition

deepsegment

VBx

brouhaha-vad

deepspeare

wordwise

Text-Classification-CNN-PyTorch

BaySMM

WSJ2WAV

fast_matrix_multiplication

online-text-flow

benchmarks

speechbrain