audio-to-text

There are 12 repositories under audio-to-text topic.

pluja / whishper
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
ai audio-to-text golang subtitles sveltekit transcription whisper ui webapp speech-recognition speech-to-text stt web web-whisper
Language:Svelte 905
Saik0s / Whisperboard
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
openai ios speech-recognition speech-to-text swiftui transcription audio-to-text composable-architecture tca tuist whisper whisper-cpp
Language:Swift 613
Kabanosk / whisper-website
Simple web application, which can be used to convert audio to subtitles by OpenAI's Whisper model
fastapi openai speech-to-text whisper python3 uvicorn website audio-to-text subtitles subtitles-generator open-source hacktoberfest
Language:Python 237
URUWorks / TeroSubtitler
Tero Subtitler is an open source, cross-platform, and free subtitle editing software.
editor linux macos subtitles windows free captions open-source subtitle-editor transcription audio-to-text ffmpeg mpv smpte whisper yt-dlp subtitler tero ai blu-ray
Language:Pascal 182
javedali99 / audio-to-text-transcription
This repository contains a Python script that allows users to download the audio from a YouTube video, transcribe it into text, detect the language and save the transcription in txt file automatically.
audio-to-text open-source openai python transcription whisper audio youtube
Language:Python 83
HenestrosaDev / audiotext
A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.
customtkinter python speech-recognition audio-to-text video-to-text transcriber speech-to-text speech-to-text-api subtitles-generator whisperx
Language:Python 78
bai0012 / Whisper_auto2lrc
Use Whisper to convert audio files into LRC subtitle files in bulk. 使用whisper实现将音频文件批量转换为lrc字幕文件
audio-to-text lrc python whisper windows pytorch
Language:Python 54
KostasEreksonas / Audio-transcriber
Simple Python audio transcriber using OpenAI's Whisper speech recognition model
audio audio-to-text openai openai-whisper pip python text transcription whisper youtube youtube-dl
Language:Python 32
persiandataset / PersianSpeech
Persian ASR dataset
dataset asr persian-speech-dataset persian-speech-recognition audio-to-text
23
GabrieleRisso / aiyu
core shell functions building blocks for advanced AI pipelines
ai audio-to-image audio-to-text gpt-3 stable-diffusion sutitles text-to-audio text-to-code text-to-image text-to-speech tts whisper
13
ur_audio_sub
thinh-vu / ur_audio_sub
Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.
audio-to-text audio-transcription caption-generator speech-recognition whisper
Language:Jupyter Notebook 13
Journal.it
markydoodled / Journal.it
A SwiftUI App For People Who Need To Take Down Important Information Quickly.
swiftui texteditor voicerecorder camera ios macos audio-editing photo-editing audio-to-text swift
Language:Swift 12
gisty-org / chrome-extension
Chrome Extension to capture captions of ongoing meetings by using webkitspeechrecognition api for all the web video conferencing platforms (for google meet, it directly extracts the captions) and sends to flask api for summarization.
javascript meet-captions audio-to-text google-meet normal-video webkitspeechrecognition chrome-extension capture-captions
Language:JavaScript 9
AzizBenAli / YouTube-AI-Assistant
Develop a python application that allows you to extract valuable insights, engage in meaningful conversations, and explore video content in a whole new way.
agents chatbot langchian-app retrieval-augmented-generation streamlit youtube-api audio-to-text embeddings openai pineconedb conversational-agents conversational-bots python-app memory generative-ai
Language:Python 6
ascender1729 / AudioDictate
An efficient desktop application for transcribing audio files into text using Vosk speech recognition.
audio-processing audio-to-text offline-transcription python speech-recognition transcription vosk
Language:Python 4
gabrielsenadev / audioinsight
AudioInsight is a web application that processes audio and generates transcriptions, summaries, titles, and allows users to ask questions about the related audio.
cloudflare-ai-challenge cloudflare-d1 cloudflare-pages cloudflare-r2 cloudflare-workers-ai ai-audio audio-processing audio-to-text
Language:TypeScript 4
yjg30737 / whisper_transcribe_youtube_video_example_gui
GUI Showcase of using Whisper to transcribe and analyze Youtube video
audio-to-text pyqt pyqt5 pyqt5-desktop-application pytube whisper python qt
Language:Python 4
lxvdnl / audio-to-text-convertor-app
An application in which you can record the output audio stream and turn it into text format.
audio-to-text qt cpp
Language:C++ 3
maximebories / YT-knowledge
Extract textual meaning and knowledge from all videos of a YouTube user's playlists
audio-to-text natural-language-understanding transcription youtube-api youtube-api-v3
Language:Python 3
robinvriens / openai-whisper
Transcribe Audio to Text with node.js using the Whisper model from OpenAI.
ai audio-to-text audiototext nodejs openai transcribe transcription whisper
Language:JavaScript 3
Chauhan-Aniket / Deepgram-AudioToText
Convert Audio to Text using Django
audio-to-text deepgram django
Language:Python 2
gustavz / audio-to-text
streamlit app to transcript audio to text using openai's whisper library
audio-to-text streamlit whisper
Language:Python 2
MitchellShibilski-Unkel / PyAI
Open-source AI library (audio to text, simple NLP and algorithms)
ai knn mitchell-shibilski-unkel python rnn pyai nlp nlp-machine-learning whisper audio-to-text machine-learning machine-learning-library pytorch relu-activation softmax part-of-speech part-of-speech-tagger
Language:Python 2
Siddp278 / NLP_deeplearning
Implemented some of the models and techniques learned in NLP to help build systems that help in daily life.
nlp-machine-learning word-vectors document-vector audio-to-text logistic-regression naive-bayes-classifier autocorrection translators
Language:Python 2
Anujesh-Ansh / AudioTranscribe
Transform audio recordings into text transcripts effortlessly with AudioTranscribe! 🎙️📝 Simplify your transcription process and enhance accessibility with top-notch accuracy. Explore the power of text-to-speech conversion today! 🚀🎧
audio-to-text beginner-project jupyter-notebook python whisper-ai
Language:Jupyter Notebook 1
athrvadeshmukh / DARPG-HACKATHON
ai audio-processing audio-to-text darpg-challenge-2024 hackathon-project hindi-to-english machine-learning speech-recognition speech-to-text darpg zero-ai zero-transciber artificial-intelligence artificial-neural-networks chatgpt open-ai translation-api translation-model translation-tool whisper-ai
Language:Jupyter Notebook 1
chandan0709 / extract-text-from-image-and-audio-using-google-vision-api
I have used the Google Cloud Vision API to transcript the audio file and extract the text from the image.
python3 google-cloud-vision api-service jupyter-notebook audio-to-text image-to-text
Language:HTML 1
imgta / vialect
Streamline your video/audio intake by transforming multimedia content into navigable collections of transcribed text and summaries!
audio-to-text openai python streamlit text-to-speech video-to-text whisper
Language:Python 1
LOKAL_for_Kafka
jbolns / LOKAL_for_Kafka
Event-driven AI > A Python-Kafka event-driven micro-services solution for distributed audio transcriptions.
ai audio-to-text kafka open-source python transcription
Language:Python 1
LOKAL_transcriptions
jbolns / LOKAL_transcriptions
Edge AI > AI app to easily perform transcriptions on regular computers. Quality on par with on-cloud alternatives. Lower costs. Reduced privacy risks.
ai audio-to-text edge-ai open-source python standalone-app transcription
Language:Python 1
Darveivoldavara / whisper-timestamped
Timestamped ASR microservice
asr audio-to-text automatic-speech-recognition data-analysis data-science deep-learning docker fastapi mlops mssqlserver openai prompt-engineering python resource-management whisper timestamps monitoring uvicorn-gunicorn
Language:Jupyter Notebook 0
Darveivoldavara / whisper_model_evaluator
WER, MER, WIL of Whisper vs Vosk vs Google transcribators comparator
asr audio-to-text automatic-speech-recognition data-analysis evaluation google-speech-recognition python tuning-parameters visualization vosk whisper
Language:Jupyter Notebook 0
sameemul-haque / TranscribeTool
📼 A streamlit web interface designed to extract words from video/audio files into text • Python, FFmpeg, Whisper, YT-DLP
audio-to-text audio-transcription audiototext ffmpeg openai-whisper personal-project project streamlit transcribe transcribe-tool transcriber transcribetool transcription video-to-text video-transcription videototext whisper whisper-ai whisper-api yt-dlp
Language:Python 0
breadrock1 / audio-to-text
There is simple backend project to use whisper-rs.
actix-web audio-to-text rust swagger-ui whisper
Language:Rust
rbgo404 / whisper-large-v3
Whisper Large V3 is a pre-trained model developed by OpenAI and designed for tasks like automatic speech recognition (ASR), speech translation and language identification.
audio-to-text
Language:Python
sahasourav17 / Meeting-Notes
Transcribe Bangla Audio into Text
audio-to-text bengali transcribe banglaconformer bengaliai
Language:Jupyter Notebook

audio-to-text

pluja / whishper

Saik0s / Whisperboard

Kabanosk / whisper-website

URUWorks / TeroSubtitler

javedali99 / audio-to-text-transcription

HenestrosaDev / audiotext

bai0012 / Whisper_auto2lrc

KostasEreksonas / Audio-transcriber

persiandataset / PersianSpeech

GabrieleRisso / aiyu

thinh-vu / ur_audio_sub

markydoodled / Journal.it

gisty-org / chrome-extension

AzizBenAli / YouTube-AI-Assistant

ascender1729 / AudioDictate

gabrielsenadev / audioinsight

yjg30737 / whisper_transcribe_youtube_video_example_gui

lxvdnl / audio-to-text-convertor-app

maximebories / YT-knowledge

robinvriens / openai-whisper

Chauhan-Aniket / Deepgram-AudioToText

gustavz / audio-to-text

MitchellShibilski-Unkel / PyAI

Siddp278 / NLP_deeplearning

Anujesh-Ansh / AudioTranscribe

athrvadeshmukh / DARPG-HACKATHON

chandan0709 / extract-text-from-image-and-audio-using-google-vision-api

imgta / vialect

jbolns / LOKAL_for_Kafka

jbolns / LOKAL_transcriptions

Darveivoldavara / whisper-timestamped

Darveivoldavara / whisper_model_evaluator

sameemul-haque / TranscribeTool

breadrock1 / audio-to-text

rbgo404 / whisper-large-v3

sahasourav17 / Meeting-Notes