There are 0 repository under speechbrain topic.
Extensions to YAML syntax for better python interaction
StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disfluencies attenuated or eliminated.
Target speaker automatic speech recognition (TS-ASR)
Incremental learning for automatic speech recognition (ASR)
Record voice, transcribe a prompt, picturize the prompt, create variations, get description of a celebrity and upload, other use cases on KB
Implementation of different curriculum learning (CL) methods for speechbrain's ASR recipes.
Processing EEG data using Speechbrain-MOABB and model tuning to get best results
[Research] A Perceptual Loss Based Complex Neural Beamforming for AmbiX 3D Speech Enhancement
pretrained SpeechBrain wav2vec seq2seq+CTC model trained on TIMIT dataset. Created by Kip McCharen, Siddharth Surapaneni, and Pavan Bondalapati
Speech transcription and speech diarization
A Speech Recognition Framework for Banking Interactions using Convolutional Recurrent Dense Neural Networks and Language Models
AudioSpeakerVerification: FastAPI-based API for Speaker Matching and Verification using SpeechBrain. Compare and verify speaker identities from audio files.
Speech Emotion Recognition SE&R 2022
Speaker verification of virtual assistants using ECAPA-TDNN model from SpeechBrain toolkit and transfer learning approach emphasizing on inter and intra comparision (text independent and dependent).
A short test to determine the distribution of similarity scores for different SpeechBrain speaker identification models.
Dockerized Zeroc-ICE architecture processing voice commands from a Xamarin mobile application via an Automatic Speech Recognition (ASR) AI model using SpeechBrain.
Research on speech processing, speaker identification and audio diarization
Speech synthesis with conditioning on very small dataset. Using Nvidia's Tacotron2 and WaveGlow models with Pytorch.
Chat-Bot made using whisper live, speechbrain and open AI API