There are 0 repository under video-text-recognition topic.
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.
Text from the video is extracted and saved into a .docx file in the form of notes.