eziolotta's repositories
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
DeepSpeech-Italian-Model
Tooling for producing Italian model for Common Voice
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
rasa_nlu
đź’¬ Open source library for natural language understanding with intent classification and entity extraction - DIY NLP for chatbots
rVADfast
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
whisper-app
This repository contains all the work I have done (and I'm doing) in developing a web app for speech-to-text, based on OpenAI Whisper