There are 0 repository under speach-recognition topic.
A tool for summarizing dialogues from videos or audio
Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex programs
Robot niko with raspberry pi and python
VTuber application which only requires your voice and microphone, no need for a webcam or other tracking nonsense.
A small script that types what you say using whisper while holding a hotkey
Simple voice recognition with vue js
A real-time speech recognition system powered by Groq and ElevenLabs, designed to listen for audio input, recognize speech, and respond with AI-driven dialogue. Customize the trigger word and personality for dynamic conversations. Includes speech synthesis for vocal replies and a live console interface with Rich library visuals.
A web application where you can track all of your expenses.
ROS2 STT node. An out of the box speach to text recognizer using standalone Vosk speech recognition toolkit. This is a MIRRORED REPOSITORY Refer to the GitLab page for the origin.
This is a web application is created with the aim to provide quality education through AI.
AI Personal Desktop Assistant built using python libraries. It does almost anything, including sending emails, Opens any website with just a voice command, Plays Music, and Wikipedia searching.
This repository contains a Convolutional Neural Network (CNN) model for classifying urban sound events using the UrbanSound8K dataset. The model leverages deep learning techniques to accurately categorize different environmental sounds such as sirens, dog barks, and car horns.
gA easy to install/use speach recognition using a webUI with Gradio and faster wisper models (guillaumekln/faster-whisper-large-v2 Is the default)
automate telegram account voice to text
Description The Voice Assistant is a powerful tool designed to interact with users through voice commands, providing a seamless and intuitive way to perform tasks, obtain information, and control applications. This project demonstrates the integration of speech recognition and synthesis technologies to create a responsive and user-friendly voice in
Progetto per il cdl in Informatica, per la materia Technologies for Advanced Programming.
Bot na prywatne serwery discord z wykorzystaniem komend głosowych, slash oraz z prefixem
An application which contains an AI chatbot, AI code generator and an AI image generator
This is Laravel Project, You can submit any Audio file, AI will transcript it, and you can further ask Question about this Audio script.
Automate subtitle generation for videos using OpenAI's Whisper API and Golang. This project extracts audio from videos, transcribes it, and embeds generated subtitles back into the videos.
This is a simple speech recognition algorithm. This was a Learning Project, in which I learned how to apply machine learning to sounds, using concepts like Mfcc and the role of Fourier transform.
Применение функционала библиотеки моделирования импульсных нейронных сетей Norse для задачи распознавания речи.
Text-to-Speech & AI Bot With OSC Integration
Polish voice to text translator with word search function using fastapi.
:clipboard: speech recognition using javascript, part of Javascript30 Youtube series by Wes Bos
Python powered Intelligent System J.A.R.V.I.S
Rejestrator video z detekcją obiektów sterowany głosowo.
📿 Dhikr Counter - Count Dhikr Using Google Speech to Text Library
Simple voice assistant powered by openai api
Voice translator - https://little-brother.github.io/english-translator/
Voice Id Door Lock Web-App is a Speaker-Identification and Sentence-Verification using Voice MFCCs Feature and GMM
SwarSathi makes communication accessible for people with hearing and speech impairments in India. Over 70 million people can benefit from this platform!