There are 5 repositories under silero topic.
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Telegram bot with voice message recognition and generation. Speech to Text and Text to Speech
Silero TTS backend service. Can be used with Home Assistant and Rhasspy.
Llama cute voice assistant
In this repository, I show you how to use SILERO VAD with ONNX-WEB runtime to run the VAD compeletely in the browser.
This is a simple server that uses Silero models to convert text to audio files over HTTP
voice to voice with ai text generator that can be hooked up to vtube studio like an ai assistant.
Silero TTS Enhanced is a Python library that enhances the original Silero TTS project, providing a convenient way to synthesize speech from text using Silero TTS models. It offers a user-friendly interface for both standalone script usage and integration into Python projects, along with additional features
A web-app/library for transcribing speech
Automatically cuts out parts without speech from given video, making it shorter and more enjoyable to watch (look examples). Usage on google.collab in several clicks.
iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
It's typescript based VAD that uses silero ai VAD under the hood. It's highly robust for Voice Activity Detection. It only works in the browser.
Server for running Silero TTS models (or other compatible models) with OpenTTS-like API. This server allows users to generate speech using different models and provides an easy-to-use REST API.