There are 13 repositories under video-to-text topic.
Video to Text: Natural language description generator for some given video. [Video Captioning]
A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.
A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large Language Models base to create interactive video descriptions.
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
Everything is very simple: you either download a picture file or specify its link when running a python script, and output you get a text file, and you can immediately view on the command line how it will look the result of your conversion.
Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.
Generate subtitles for all the videos in a folder with OpenAI's Whisper privately in your computer.
Generating video descriptions using deep learning in Keras
A curated list of zero-shot captioning papers
Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
Convert a video tutorial in a blog post using Claude 3
A video call application that recognizes gestures (signal language) and converts them into text and sound.
Generate automatic transcripts and subtitles for your videos with the help of the neural network-based.
An AI tools which helps to analyze any YouTube video, give the sentiment of the video and suggest description and topics related the content. Lastly, It extract the subtitles from the video by understanding the audio then transcribe it in any language with timestamps and also embed the subtitles into the video
Text from the video is extracted and saved into a .docx file in the form of notes.
A Flask API to convert speech to text using Offline Transcription methods - CMU Sphinx and DeepSpeech.
SolutionAI App which can solve any problems or summarize any Image or Youtube video of any duration to the shortest summary you need.
Python program able to transcribe a Youtube video to text with the help of AI.
Convert videos into colourful ASCII art for terminal display using Python and OpenCV.
It includes our two recent papers on text-to-video retrieval along with a technical report.
Youtube video to text generation
ONLY FRONT-END DISPLAY
A Python tool for transcribing videos using Whisper
This repository is an implementation of the Wav2Vec2 model for converting speech into text through a series of speech recognition, noise removal and STT to transcribe the text from a video file.
Convert YouTube videos to text files. Why spend 30 minutes watching a video when you can skim the transcript in a couple minutes?
📼 A streamlit web interface designed to extract words from video/audio files into text • Python, FFmpeg, Whisper, YT-DLP
Unlimited Youtube-Transcript-Generator
Convert a video file or camera captured to display as text.