voice-recognition

There are 35 repositories under voice-recognition topic.

PaddlePaddle / PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
transformer conformer speech-translation streaming-asr speech-alignment punctuation-restoration streaming-tts speech-synthesis tts asr kws speech-recognition sound-classification voice-cloning vocoder voice-recognition self-supervised-learning wav2vec2 whisper code-switch
Language:Python 10248
speechbrain / speechbrain
A PyTorch-based Speech Toolkit
speech-recognition speech-toolkit speaker-recognition speech-to-text speech-enhancement speech-separation audio audio-processing speech-processing speechrecognition asr voice-recognition spoken-language-understanding speaker-diarization speaker-verification pytorch huggingface transformers language-model deep-learning
Language:Python 7964
alphacep / vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
speech-recognition asr voice-recognition speech-to-text android ios raspberry-pi deep-learning deep-neural-networks speech-to-text-android speaker-identification speaker-verification python offline privacy kaldi deepspeech google-speech-to-text vosk stt
Language:Jupyter Notebook 7131
snakers4 / silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
onnx pytorch voice-activity-detection voice-commands voice-control voice-detection voice-recognition
Language:Python 2935
theajack / cnchar
🇨🇳 功能全面的汉字工具库 (拼音笔画偏旁成语语音可视化等) (Chinese character util)
spell-stroke draw chinese-characters pinyin speak voice-recognition
Language:TypeScript 2268
coqui-ai / STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
stt speech-to-text tensorflow deep-learning automatic-speech-recognition asr voice-recognition speech-recognition speech-recognizer speech-recognition-api
Language:C++ 2159
react-native-voice / voice
:microphone: React Native Voice Recognition library for iOS and Android (Online and Offline Support)
react-native android ios speech-recognition voice-recognition
Language:Objective-C 1726
jim-schwoebel / voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
voice-dataset voice-datasets audio-dataset audio-datasets datasets dataset voice data voice-computing voice-control voice-synthesis voice-commands voice-assistant voice-recognition voice-chat voice-activity-detection voice-conversion noise
1559
collabora / WhisperLive
A nearly-live implementation of OpenAI's Whisper.
dictation obs openai text-to-speech translation voice-recognition whisper tensorrt tensorrt-llm whisper-tensorrt
Language:Python 1283
coqui-ai / open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
tts stt speech-to-text text-to-speech speech-recognition speech-synthesis speech-processing voice-recognition voice-activity-detection voice-cloning speech-emotion-recognition speech-separation
1214
Python-ai-assistant
ggeop / Python-ai-assistant
Python AI assistant 🧠
python35 python voice-recognition voice-assistant voice-control voice-activity-detection voice-chat nlp-machine-learning voice-commands linux-assistant nlp voice-recognition-experiment ai sklearn wolfram-language nltk google-speech-recognition google-speech-to-text mongodb pymongo
Language:Python 862
MycroftAI / mycroft-precise
A lightweight, simple-to-use, RNN wake word listener
wake-word-detection keyword-spotting hotword-detection voice-recognition voice-control speech-recognition embedded-systems raspberry-pi
Language:Python 802
alexylem / jarvis
Jarvis.sh is a simple configurable multi-lang assistant.
jarvis raspberry-pi assistant jasper home-automation voice-recognition voice-control voice-commands personal-assistant sarah
Language:Shell 799
EDDiscovery / EDDiscovery
Captains log and 3d star map for Elite Dangerous
elite-dangerous elite-journal journal-logs eddiscovery elite 3d-map captain-log text-to-speech voice voice-recognition eddn edsm inara
Language:C# 750
wunjo.wladradchenko.ru
wladradchenko / wunjo.wladradchenko.ru
Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.
free image-animation tacotron2 talking-face talking-face-generation talking-head tts wunjo face-swap face-swapping voice-recognition voice-cloning deepfake-emotion retouching-video controlnet diffusion segment-anything vid2vid deepfake deepfakes
Language:Python 720
yeyupiaoling / VoiceprintRecognition-Pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
pytorch voice-recognition arcface speaker-recognition ecapa-tdnn
Language:Python 646
evancohen / sonus
:speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword detection
speech speech-recognition speech-to-text voice-control stt node hotword-detection keyword-spotting alexa voice-recognition
Language:JavaScript 618
Picovoice / rhino
On-device Speech-to-Intent engine powered by deep learning
natural-language-understanding voice-recognition nlu spoken-language-understanding voice-assistant voice-ui voice-user-interface speech-recognition voice-commands voice-control vui on-device slu entity-resolution intent-inference slot-filling voice-command voice-command-control
Language:Python 594
Picovoice / speech-to-text-benchmark
speech to text benchmark framework
speech-recognition speech-to-text deepspeech voice-recognition offline privacy deep-learning deep-neural-networks google-speech-to-text aws-transcribe pocketsphinx mozilla-deepspeech cheetah picovoice edge-ai
Language:Python 586
Picovoice / cheetah
On-device streaming speech-to-text engine powered by deep learning
speech-to-text asr automatic-speech-recognition online-speech-recognition speech-recognition stt transcription voice-recognition streaming-speech-to-text
Language:Python 558
algolia / voice-overlay-ios
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
voice voice-recognition voice-assistant voicetext instant-search instantsearch input search overlay speech-to-text conversation conversational-ui conversational-interface conversational-bots chatbots permissions speech-recognition ios swift objective-c
Language:Swift 540
Picovoice / picovoice
On-device voice assistant platform powered by deep learning
voice-recognition speech-recoginition voice-assistant voice-interface voice-user-interface natural-language-understanding nlu on-device voice-command voice-commands wake-word-detection
Language:Python 508
adrianhajdin / project_news_alan_ai
In this video, we're going to build a Conversational Voice Controlled React News Application using Alan AI. Alan AI is a revolutionary speech recognition software that allows you to add voice capabilities to your applications.
react react-project reactjs voice-assistant voice-recognition
Language:JavaScript 498
hackingbeauty / react-mic
Record audio from a user's microphone and display a cool visualization.
microphone voice-recognition speech-recognition-apps speech-to-text audio-visualizer record-audio wav-audio voice audio-recorder reactjs voice-activated mp3-audio voice-app voice-applications
Language:JavaScript 443
hollance / TensorFlow-iOS-Example
Source code for my blog post "Getting started with TensorFlow on iOS"
tensorflow metal ios machine-learning logistic-regression voice-recognition
Language:Swift 442
Cay-Zhang / SwiftSpeech
A speech recognition framework designed for SwiftUI.
swift swiftui ios speech-recognition combine voice-recognition user-voice audio
Language:Swift 434
rcbyron / hey-athena-client
Your personal voice assistant
voice assistant voice-control voice-recognition voice-commands alexa siri cortana cross-platform
Language:Python 419
Picovoice / leopard
On-device speech-to-text engine powered by deep learning
stt speech-to-text asr automatic-speech-recognition on-device speech-recognition transcription voice-recognition voice-to-text
Language:Python 411
xenon-19 / Gesture-Controlled-Virtual-Mouse
Virtually controlling computer using hand-gestures and voice commands. Using MediaPipe, OpenCV Python.
python3 opencv mediapipe mediapipe-hands python final-year-project cse-project gesture-recognition machine-learning voice-assistant voice-recognition chat-bot eel-python computer-vision
Language:Python 394
jim-schwoebel / voicebook
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
voice voice-assistant voice-recognition voice-recording transcription featurization data data-cleaning visualization generation voice-activity-detection voice-control server security encryption-decryption python3 machine-learning wake-word-detection voice-computing
Language:Python 368
alphacep / vosk
VOSK Speech Recognition Toolkit
speech-recognition voice-recognition speech-to-text python lifelong-learning semi-supervised-learning multilingual
Language:C 362
shamspias / customizable-gpt-chatbot
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Leveraging OpenAI's GPT-3.5, Pinecone, FAISS, and Celery for seamless integration and performance.
artificial-intelligence chatbot data-preprocessing django django-rest-framework gpt-3 machine-learning nlp python conversational-ai voice-chat voice-recognition voice-to-text voice-transcription gpt-voice natural-language-processing langchain langchain-python longchain autogpt
Language:Python 342
dictation-toolbox / Caster
Dragonfly-Based Voice Programming and Accessibility Toolkit
python programming voice-recognition voice-programming voice-commands grammars accessibility rsi dragonfly open-source voice-control voice accessibility-automation
Language:Python 334
Nikorasu / LiveWhisper
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
assistant dictation openai python sounddevice speech-recognition speech-to-text text-to-speech transcription whisper ai chatbot openai-whisper tts voice voice-assistant voice-recognition numpy translation terminal
Language:Python 298
gpt-voice-conversation-chatbot
Adri6336 / gpt-voice-conversation-chatbot
Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.
chatbot gpt-3 openai speech-to-text speech-recognition tts elevenlabs conversational-ai voice-recognition memory conversational conversational-bots personalized user-friendly chatgpt python ai customizable cli gpt-4
Language:Python 289
reriiasu / speech-to-text
Real-time transcription using faster-whisper
faster-whisper speech-recognition whisper voice-recognition openai speech-to-text
Language:HTML 288

voice-recognition

PaddlePaddle / PaddleSpeech

speechbrain / speechbrain

alphacep / vosk-api

snakers4 / silero-vad

theajack / cnchar

coqui-ai / STT

react-native-voice / voice

jim-schwoebel / voice_datasets

collabora / WhisperLive

coqui-ai / open-speech-corpora

ggeop / Python-ai-assistant

MycroftAI / mycroft-precise

alexylem / jarvis

EDDiscovery / EDDiscovery

wladradchenko / wunjo.wladradchenko.ru

yeyupiaoling / VoiceprintRecognition-Pytorch

evancohen / sonus

Picovoice / rhino

Picovoice / speech-to-text-benchmark

Picovoice / cheetah

algolia / voice-overlay-ios

Picovoice / picovoice

adrianhajdin / project_news_alan_ai

hackingbeauty / react-mic

hollance / TensorFlow-iOS-Example

Cay-Zhang / SwiftSpeech

rcbyron / hey-athena-client

Picovoice / leopard

xenon-19 / Gesture-Controlled-Virtual-Mouse

jim-schwoebel / voicebook

alphacep / vosk

shamspias / customizable-gpt-chatbot

dictation-toolbox / Caster

Nikorasu / LiveWhisper

Adri6336 / gpt-voice-conversation-chatbot

reriiasu / speech-to-text