Sangeet Sagar (sangeet2020)

sangeet2020

Geek Repo

Company:Universität des Saarlandes, UFAL Charles Uni

Location:Munich

Home Page:https://sangeet2020.github.io/

Github PK Tool:Github PK Tool

Sangeet Sagar's starred repositories

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:52319Issues:939Issues:1080

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8673Issues:133Issues:1085

sherpa-onnx

Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust

Language:C++License:Apache-2.0Stargazers:3252Issues:51Issues:488

whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Language:PythonLicense:MITStargazers:1889Issues:36Issues:97

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonLicense:MITStargazers:1428Issues:43Issues:226

sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

Language:C++License:Apache-2.0Stargazers:997Issues:36Issues:144

punctuator2

A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text

Language:PythonLicense:MITStargazers:658Issues:28Issues:79

ctc-segmentation

Segment an audio file and obtain utterance alignments. (Python package)

Language:PythonLicense:Apache-2.0Stargazers:319Issues:13Issues:29

speech-emotion-recognition

Speaker independent emotion recognition

Language:PythonLicense:MITStargazers:315Issues:17Issues:34

deepsegment

A sentence segmenter that actually works!

Language:PythonLicense:GPL-3.0Stargazers:302Issues:14Issues:38

VBx

Variational Bayes HMM over x-vectors diarization

brouhaha-vad

Predicts the level of noise and reverberation on your audiofiles

Language:Jupyter NotebookLicense:MITStargazers:138Issues:10Issues:16

deepspeare

Code for Deep-speare: a joint neural model of poetic language, meter and rhyme

Language:HTMLLicense:Apache-2.0Stargazers:70Issues:5Issues:3

wordwise

N-gram keyword extraction using spaCy and pretrained language models

Language:PythonLicense:MITStargazers:62Issues:4Issues:7

Text-Classification-CNN-PyTorch

The aim of this repository is to show a baseline model for text classification through convolutional neural networks in the PyTorch framework. The architecture implemented in this model was inspired by the one proposed in the paper: Convolutional Neural Networks for Sentence Classification.

Language:PythonLicense:MITStargazers:47Issues:3Issues:1

BaySMM

Model for learning document embeddings along with their uncertainties

Language:PythonStargazers:35Issues:4Issues:0

WSJ2WAV

Convert WSJ sphere format to waveform and do data simulation.

Language:PythonLicense:MITStargazers:16Issues:2Issues:0

fast_matrix_multiplication

Different matrix multiplication implementation and benchmarking on CPUs

Language:C++License:MITStargazers:5Issues:2Issues:0

online-text-flow

Online event streaming to improve data and text flows

Language:PythonLicense:MITStargazers:1Issues:5Issues:14

benchmarks

This repository contains the SpeechBrain Benchmarks

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0