There are 10 repositories under video-to-text topic.
Video to Text: Natural language description generator for some given video. [Video Captioning]
A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large Language Models base to create interactive video descriptions.
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
Everything is very simple: you either download a picture file or specify its link when running a python script, and output you get a text file, and you can immediately view on the command line how it will look the result of your conversion.
A desktop application that transcribes and optionally translates audio from a file or microphone using WhisperX or the Google Speech-to-Text API.
Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
Generating video descriptions using deep learning in Keras
A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.
Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
Generate subtitles for all the videos in a folder with OpenAI's Whisper privately in your computer.
A curated list of zero-shot captioning papers
Generate automatic transcripts and subtitles for your videos with the help of the neural network-based.
An AI tools which helps to analyze any YouTube video, give the sentiment of the video and suggest description and topics related the content. Lastly, It extract the subtitles from the video by understanding the audio then transcribe it in any language with timestamps and also embed the subtitles into the video
Text from the video is extracted and saved into a .docx file in the form of notes.
A Flask API to convert speech to text using Offline Transcription methods - CMU Sphinx and DeepSpeech.
SolutionAI App which can solve any problems or summarize any Image or Youtube video of any duration to the shortest summary you need.
Python program able to transcribe a Youtube video to text with the help of AI.
It includes our two recent papers on text-to-video retrieval along with a technical report.
ONLY FRONT-END DISPLAY
Convert videos into colourful ASCII art for terminal display using Python and OpenCV.
A Python tool for transcribing videos using Whisper
Unlimited Youtube-Transcript-Generator
📼 A streamlit web interface designed to extract words from video/audio files into text • Python, FFmpeg, Whisper, YT-DLP
This repository is an implementation of the Wav2Vec2 model for converting speech into text through a series of speech recognition, noise removal and STT to transcribe the text from a video file.
Convert a video file or camera captured to display as text.