There are 33 repositories under voice-recognition topic.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
A PyTorch-based Speech Toolkit
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
:microphone: React Native Voice Recognition library for iOS and Android (Online and Offline Support)
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
A nearly-live implementation of OpenAI's Whisper.
Python AI assistant 🧠
A lightweight, simple-to-use, RNN wake word listener
Captains log and 3d star map for Elite Dangerous
Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
speech to text benchmark framework
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
In this video, we're going to build a Conversational Voice Controlled React News Application using Alan AI. Alan AI is a revolutionary speech recognition software that allows you to add voice capabilities to your applications.
Source code for my blog post "Getting started with TensorFlow on iOS"
Record audio from a user's microphone and display a cool visualization.
A speech recognition framework designed for SwiftUI.
Your personal voice assistant
Virtually controlling computer using hand-gestures and voice commands. Using MediaPipe, OpenCV Python.
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Leveraging OpenAI's GPT-3.5, Pinecone, FAISS, and Celery for seamless integration and performance.
Dragonfly-Based Voice Programming and Accessibility Toolkit
Start recording when the user speaks
Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.
使用Tensorflow实现声纹识别