There are 3 repositories under google-cloud-speech topic.
R client for the Google Translation API, Google Cloud Natural Language API and Google Cloud Speech API
A Vue2 Performs synchronous speech recognition Speech to text Google Cloud Speech With Progressive Web App
A Vue2 Streaming Speech Recognition Speech to text with Google Cloud Speech
Convert English Speech into American Sign Language using Google Cloud APIs and play animations for the gesture in Blender Game Engine (Blender 2.79).
how to use the Google Cloud Speech API to transcribe audio/video files.
Context based video seek and search
An open-source framework for modeling real-time conversations in spoken dialogue systems.
Using Snowboy and Google Cloud speech api in Electron for voice recognition
Transcribe live audio using Google Cloud Speech to Text API
Examples of using Google Cloud APIs with Raspberry Pi
A simple app to convert audio files to text using speech-to-text APIs
Transcript Audio/Video Call in a real time
Speech Recognition using Google Cloud Speech API
API server for our Speech translator extension
Jupyter notebook for turning textual dialogue into voice audio.
網頁語音記帳程式 - 利用Google Cloud Speech API 實現快速語音記帳
👻 신해철의 고스트스테이션의 스크립트를 제작 중입니다 | Crom's Ghoststation Radio Transcript
My undergraduate Final Year Project uses a few R-Pis with microphone array hats and allows recording to local disk, streaming to central computer and streaming voice recognition using local VAD and Google Cloud
Google Speech-to-text Hermes module for Rhasspy 2.5
Google Speech-to-text module for Rhasspy 2.5
Project for Multimodal Interaction course (A.Y. 2019/2020), GesturePad
Text to speech note taking service for meetings.
A Speech to Text to Text to Speech script
Web app provides live captions through Chrome's Google Cloud Speech API
谷歌云声音转文字之SRT字幕中文语言版 | Google Cloud SpeechToText, SRT for Chinese
Koala's Toy Project
Large Language Models (OpenAI, Anthropic, Groq, Ollama) tested in NodeJS CLI app environment. Various features like voice recognition, speech synthesis, function calling tools, agents etc. are implemented to the library.
HackWestern 5 - A video calling platform that bridges the language barriers across the world. Wish to talk to another person who's language you don't understand? Don't worry, we got you covered with a real-time video call experience!
A video transcription application built with Nuxt.js version 3
Efficient profanity-filtering tool to combat Tourette Syndrome