Alexander Veysov's repositories
silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
russian_stt_text_normalization
Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks
deepspeech.pytorch
Speech Recognition using DeepSpeech2.
cft-contest-2018
Repository with illustrations for cft-contest-2018
awesome_doom_quickstart
Many young people have not played doom ... this is a small guide to help them start their journey
open-speech-corpora
A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
MNASNet-pytorch-1
MNASNet implementation and pre-trained model in PyTorch
speechbrain
A PyTorch-based Speech Toolkit
speech-recognition-uk
Speech Recognition for Ukrainian
azure-sdk-for-python
This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://docs.microsoft.com/en-us/python/azure/ or our versioned developer docs at https://azure.github.io/azure-sdk-for-python.
CRAFT-pytorch
Official implementation of Character Region Awareness for Text Detection (CRAFT)
deep-learning-german-tts
The free german voice dataset.
NPTEL2020-Indian-English-Speech-Dataset
NPTEL2020: Speech2Text dataset for Indian-English Accent
pandoc-latex-template
A pandoc LaTeX template to convert markdown files to PDF or LaTeX.