speechbrain

There are 0 repository under speechbrain topic.

speechbrain / HyperPyYAML
Extensions to YAML syntax for better python interaction
python speechbrain yaml
Language:Python 74
jordicapde / stutter-former
StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disfluencies attenuated or eliminated.
speech-enhancement stuttering transformer speechbrain
Language:Jupyter Notebook 18
nuaazs / VAF
Backend of anti-fraud system based on speaker identification technology. 基于声纹识别的反诈系统后端
speaker-identification speaker-recognition speechbrain
Language:Python 17
lucadellalib / ts-asr
Target speaker automatic speech recognition (TS-ASR)
conformer pytorch rnn speech-recognition speechbrain transducer asr
Language:Python 11
shahad-mahmud / incremental_learning_for_asr
Incremental learning for automatic speech recognition (ASR)
continual-learning incremental-learning knowledge-distillation pytorch speech-recognition speech-to-text speechbrain
Language:Python 8
amitpuri / Ask-picturize-it
Record voice, transcribe a prompt, picturize the prompt, create variations, get description of a celebrity and upload, other use cases on KB
natural-language-processing openai openai-dall-e openai-whisper cloudinary gradio huggingface assemblyai elevenlabs rapidapi generative-ai gpt4 speechbrain stable-diffusion stabilityai runway-generated runwayml azure-openai machine-learning langchain
Language:Jupyter Notebook 7
aalto-speech / speechbrain-cl
Implementation of different curriculum learning (CL) methods for speechbrain's ASR recipes.
asr curriculum python speechbrain
Language:Python 5
aspyridakos / EEG-Based-Motor-Imagery-Decoding-with-Deep-Learning
Processing EEG data using Speechbrain-MOABB and model tuning to get best results
eeg-classification eegnet machine-learning moabb python speechbrain
Language:Jupyter Notebook 4
Hguimaraes / 3Denoiser
[Research] A Perceptual Loss Based Complex Neural Beamforming for AmbiX 3D Speech Enhancement
speechbrain speech-enhancement 3d-audio
Language:Python 2
kipmccharen / sys6016_DL_project
pretrained SpeechBrain wav2vec seq2seq+CTC model trained on TIMIT dataset. Created by Kip McCharen, Siddharth Surapaneni, and Pavan Bondalapati
timit speechbrain per apr wav2vec2
Language:Python 2
adam-aalah / Speech-transcription
Speech transcription and speech diarization
diarization python speech-diarization speech-to-text speech-transcription transcription whisper-ai speechbrain
Language:Python 1
albinjm / FinSpeech
A Speech Recognition Framework for Banking Interactions using Convolutional Recurrent Dense Neural Networks and Language Models
automatic-speech-recognition banking convolutional-neural-networks customer-service data-preprocessing deep-learning dense-neural-networks hugging-face language-modeling performance-evaluation pytorch recurrent-neural-networks speech-recognition speechbrain torchaudio
Language:Jupyter Notebook 1
AnkushRathour / AudioSpeakerVerification
AudioSpeakerVerification: FastAPI-based API for Speaker Matching and Verification using SpeechBrain. Compare and verify speaker identities from audio files.
api fastapi python3 speaker-verification speechbrain
Language:Python 1
gabrielziegler3 / speech-emotion-recogntion-ser2022
Speech Emotion Recognition SE&R 2022
speech-recognition sentiment-analysis deep-learning speechbrain
Language:Jupyter Notebook 1
harshita-bfly / Speaker_verification
Speaker verification of virtual assistants using ECAPA-TDNN model from SpeechBrain toolkit and transfer learning approach emphasizing on inter and intra comparision (text independent and dependent).
ecapa-tdnn speaker-recognition speaker-verification speechbrain transfer-learning
Language:Jupyter Notebook 1
wla-98 / speakers_recognition
speakers_recognition
speechbrain speakers-recognition python
Language:Python 1
OwenWaldron / speaker-test
A short test to determine the distribution of similarity scores for different SpeechBrain speaker identification models.
commonvoice speaker-identification speechbrain
Language:Python 0
PhiltasticGuy / voxia
Dockerized Zeroc-ICE architecture processing voice commands from a Xamarin mobile application via an Automatic Speech Recognition (ASR) AI model using SpeechBrain.
csharp docker dotnet speechbrain asr zeroc-ice xamarin xamarin-forms zeroc libvlcsharp
Language:C# 0
DonBraulio / SpeechEmbeddings
Research on speech processing, speaker identification and audio diarization
diarization speaker-identification speech-processing speechbrain
Language:Jupyter Notebook
isabelleysseric / voice-cloning
Speech synthesis with conditioning on very small dataset. Using Nvidia's Tacotron2 and WaveGlow models with Pytorch.
speech-recognition speech-synthesis tacotron2 text-to-speech tts waveglow nvidia speechbrain signal-processing
Language:Jupyter Notebook
zaid-24 / Document-Retrieval-Based-Chat-Bot
Chat-Bot made using whisper live, speechbrain and open AI API
chatbot document-retrieval openai-api rag speechbrain whisper-ai whisper-live
Language:Python

speechbrain

speechbrain / HyperPyYAML

jordicapde / stutter-former

nuaazs / VAF

lucadellalib / ts-asr

shahad-mahmud / incremental_learning_for_asr

amitpuri / Ask-picturize-it

aalto-speech / speechbrain-cl

aspyridakos / EEG-Based-Motor-Imagery-Decoding-with-Deep-Learning

Hguimaraes / 3Denoiser

kipmccharen / sys6016_DL_project

adam-aalah / Speech-transcription

albinjm / FinSpeech

AnkushRathour / AudioSpeakerVerification

gabrielziegler3 / speech-emotion-recogntion-ser2022

harshita-bfly / Speaker_verification

wla-98 / speakers_recognition

OwenWaldron / speaker-test

PhiltasticGuy / voxia

DonBraulio / SpeechEmbeddings

isabelleysseric / voice-cloning

zaid-24 / Document-Retrieval-Based-Chat-Bot