There are 8 repositories under tts-api topic.
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
A simple VITS HTTP API, developed by extending Moegoe with additional features.
Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU.
A simple FastAPI Server to run XTTSv2
A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight Node.js web app; supports customizable multimodality for voice, images, & files.
Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), support for SafeTensors/BF16, voice cloning, dialogue generation, and GPU/CPU execution.
Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiobooks, and GPU acceleration.
openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as Whisper, Completions, Embeddings, and the latest Text-to-Speech. The application is built using Nuxt, a Javascript framework based on Vue.js.
NoneBot DeepSeek 插件。接入 DeepSeek 模型,提供智能对话与问答功能
AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine
🌻 VITS ONNX TTS server designed for fast inference 🔥
Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech (TTS) models to generate natural-sounding audio from text. Built with modern web technologies for an intuitive user experience, including customizable voice and speech speed settings, and the ability to download audio files directly.
A Non-Official ElevenLabs RESTful API Client for dotnet
An AI-powered chatbot integrated with Telegram, using OpenAI GPT-3.5 Turbo, language embeddings, and FAISS for similarity search to provide more contextually relevant responses to user queries
Simple Python script to interact with the TikTok TTS Voices.
not official API for Microsoft speech synthesis from Microsoft Edge web browser read aloud
Official AllVoiceLab Model Context Protocol (MCP) server, supporting interaction with powerful text-to-speech and video translation APIs.
Twitch Streamer GPT is a NodeJS-based Twitch enhancement tool, offering interactive stream experiences with AI-powered automated responses, voice command activations, and advanced modules. It's easy to set up and suited for users of all tech levels.
Text To Speech Multilingual Support (+20 Language)
any4any是一个企业级多模态AI平台,提供完整的智能交互解决方案。集成了大语言模型对话、数字人系统、智能SQL查询、语音处理、知识库系统等核心功能,支持OpenAI兼容API接口,可无缝集成到各类AI应用中。
Your speech assistant. Communicate with text-to-speech in games, on voice chat, on stream or simply on your speakers!
Some simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possible.
A Chrome extension for high-quality Text-to-Speech APIs like Google's WaveNet / OpenAI TTS API. Contributions Welcome!
🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API.
Uses OpenAI API to clean pdf then converts it to professional grade audiobook with text to speech.
TTS RHVoice REST API
Go Lang API Wrapper around Piper TTS - Supports TTS Inference and List of Voices
This is a simple HTTP service that uses the Edge-TTS library to generate text-to-speech audio files.