voice-processing

There are 0 repository under voice-processing topic.

Picovoice / web-voice-processor
A library for real-time voice processing in web browsers
javascript browser web-browser real-time realtime wake-word-detection voice-commands speech-recognition speech-to-text voice-processing audio-processing webaudio-api worker downsampling microphone pcm
Language:TypeScript 232
rembertdesigns / gemma3n-disaster-assistant
AI-powered disaster response platform with offline-first architecture using Gemma 3n. Provides computer vision hazard detection, voice analysis with emergency keywords, PDF report generation, and multi-user coordination - all working without internet access.
ai computer-vision kaggle kaggle-competition multimodal triageservice voice-processing gemma3n offline-disaster-response ondevice-ai emergency-response fastapi offline-sync pwa
Language:HTML 12
kristofferv98 / SemanthaVoiceAssistant
A comprehensive AI companion leveraging advanced semantic analysis, sentiment detection, and voice processing to provide personalized and context-aware interactions using Autogen, semantic-router, and VoiceProcessingToolkit.
autogen elevenlabs intent-detection local-llm natural-language-processing openai picovoice python rag sentiment-analysis text-to-speech voice-activity-detection voice-assistant voice-processing voice-recognition websearch whisper ai-companion personalized-interactions semantic-routing
Language:Python 7
Paschalis / VoiceMeld
Advanced Topics in Speech and Language Processing
voice voice-processing voice-recognition
Language:MATLAB 6
AI-CallConnect
madhurimarawat / AI-CallConnect
A cutting-edge AI-powered phone agent designed for seamless voice interactions, dynamic data handling, and scalable communication. Perfect for modern sales and customer engagement solutions.
code-documentation data-science-projects deployed-app deployment encode-hackathon fuzzy-matching-algorithm intemediate-project pandas python question-answering-system streamlit streamlit-deployment system-architecture voice-processing ai-callconnect artificial-generated-dataset cold-calling-app dataset-generated unstop-hackathon deployment-production
Language:Jupyter Notebook 4
AmirMahdyJebreily / Microphone-quality-evaloution
Live microphone quality detection system in browser Js
browser javascript microphone microphone-array-processing voice-processing voice-quality
Language:TypeScript 3
kristofferv98 / VoiceProcessingToolkit
The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications
api audio automation elevenlabs gpt-4 multithreading openai picovoice python speech text-to-speech transcription utility voice voice-processing wake-word whisper whisper-api
Language:Python 3
Ceviess / tgvoice2text
A Telegram bot that processes voice messages using Sber's speech recognition API. This bot converts audio formats, generates authentication tokens, and transcribes voice messages into text, enabling seamless communication via Telegram.
audio-conversion chatbots python speech-recognition telegram-api telegram-bot text-to-speech voice-processing sber-api
Language:Python 1
Chintan2108 / Consumer-Complaint-Classification-OPEN-AI
This repository is made in lieu of submission towards the solution of problem statement 2 of the OPEN AI NLP hackathon. The objective here is to classify the voice recordings of a call center proceeding by treating them as consumer complaints into the said categories of the automotive industry.
nlp complaints text-classification speech-to-text voice-processing complaint-classification
Language:Jupyter Notebook 1
Erfanafshar / speech-gender-detection
An audio signal processing project that detects speaker gender from recorded voice samples and enhances speech using spectral subtraction techniques in MATLAB.
audio-analysis fourier-transform gender-classification matlab signal-processing speech-recognition voice-processing
Language:MATLAB 1
hhoangphuoc / R2D2TimbreTransfer
Timbre Transfer for R2D2-alike Robot voice turning into instrument using Diffusion Model
diffusion-models robot timbre-transfer voice-processing
Language:Python 1
Gordon-Yeh / Memory-Frame
🖼️ framed picture cloud base smart photo frame with voice activation paired with an android app
digital-photoframe voice-processing android
Language:Java 0
Kammann123 / vocoder
Coursework 1 of the Voice Signal Processing course at ITBA. Real-time LPC Vocoder written in Python
dsp lpc portaudio pyaudio python real-time vocoder voice-processing
Language:Jupyter Notebook 0
mohammad-safari / Speech-Spectral-Substraction-and-Noise-Remove
Final_Project_of_Siganls_&_Sytems_Spring_1401
fourier-analysis noise-reduction signal-processing signals-and-systems spectrum visualization voice-processing white-noise
Language:Jupyter Notebook 0
Namratha2301 / dogcat
Web Application that Identifies Animal from their Sound. Right now restricted to binary classification between cat and dog sounds.
python-3-9 ann azure bashscript cnn keras librosa tailwindcss tensorflow voice-processing flask flake8
Language:PureBasic 0
asadsandhu / Whisper-Audio-To-Text
🎧 Transcribe any audio to text in seconds using OpenAI Whisper — right in Google Colab. No setup needed! Upload your MP3, WAV, M4A, or FLAC file and get accurate, multilingual transcriptions powered by Whisper’s medium model — all free in the cloud. ☁️
ai asr audio-to-text audio-transcription automation cloud colab-notebook deep-learning ffmpeg google-colab machine-learning multilingual nlp openai pytorch speech-recognition transcription voice-processing whisper openai-whisper
Language:Jupyter Notebook
cbasitodx / Voice_Processing_Course
Curso de procesado digital de la señal (24-25) : Aplicación al procesado de la voz.
signal-processing voice-processing
Language:Python
ducnt18121997 / Viet-SASV
This repository presents a comprehensive PyTorch implementation of an end-to-end Speaker Verification system, incorporating state-of-the-art deep learning architectures and language models. The system features robust speaker recognition capabilities, with specialized support for the Vietnamese
speaker-verification vietnamese voice-processing voice-spoofing
Language:Python
KenyonY / guang
Universal Function Library of Scientific Calculation
tts utility-library machine-learning deep-learning voice-processing
Language:Jupyter Notebook
sezer-muhammed / Anadolu-Ajans--Medya-Teknolojileri-hackathon
AI-powered platform for creative content generation and management, featuring advanced AI integrations, seamless accessibility, and community collaboration.
ai-content-creation generative-models user-interface voice-processing
Language:Python
simonnchong / Human-Voice-Segmentation
This is an algorithm to identify human voice and do segmentation automatically. The result will be compared to the manual segmentation data, then a accuracy report will be generated based on match rate, insertion rate and omission rate.
matlab multimedia-processing voice voice-processing voice-segmentation
Language:MATLAB

voice-processing

Picovoice / web-voice-processor

rembertdesigns / gemma3n-disaster-assistant

kristofferv98 / SemanthaVoiceAssistant

Paschalis / VoiceMeld

madhurimarawat / AI-CallConnect

AmirMahdyJebreily / Microphone-quality-evaloution

kristofferv98 / VoiceProcessingToolkit

Ceviess / tgvoice2text

Chintan2108 / Consumer-Complaint-Classification-OPEN-AI

Erfanafshar / speech-gender-detection

hhoangphuoc / R2D2TimbreTransfer

Gordon-Yeh / Memory-Frame

Kammann123 / vocoder

mohammad-safari / Speech-Spectral-Substraction-and-Noise-Remove

Namratha2301 / dogcat

asadsandhu / Whisper-Audio-To-Text

cbasitodx / Voice_Processing_Course

ducnt18121997 / Viet-SASV

KenyonY / guang

sezer-muhammed / Anadolu-Ajans--Medya-Teknolojileri-hackathon

simonnchong / Human-Voice-Segmentation