SOMIL JAIN's repositories
data-structure
data structure implementation (multiple languages)
DisVoice
feature extraction from speech signals
EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc.
IndicWav2Vec
Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2
kaldi-serve
Server framework for Kaldi ASR Toolkit
mozilla-vpn-client
A fast, secure and easy to use VPN. Built by the makers of Firefox.
pykaldi
A Python wrapper for Kaldi
SpeakerDiarization_api
An Speaker diarization api
speech-to-text-wavenet
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
speech_emo_recognition
Speech emotion recognition models using fully-convolutional and convolutional-recurrent models
whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"