There are 0 repository under text-image topic.
The editor for ASCII-graphics, combining a graphical editor and an image to text converter. Decorate your text and surprise your readers with an original social media post or blog post using ASCII graphics. The tool does not require an internet connection and can work offline in a browser.
A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text removal, text image super resolution, text editing, handwritten generation, scene text recognition and scene text detection.
[CVPR 2024] Official code for "Text-Driven Image Editing via Learnable Regions"
Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)
🐛🐛🐛 Text image can "textify" text, images, and videos, and can be used with simple configuration 它可以将文字、图片、视频进行「文本化」 只需要通过简单的配置即可使用
[BMVC 2023] Zero-shot Composed Text-Image Retrieval
This repository features three demos that can be effortlessly integrated into your AWS environment. They serve as a practical guide to leveraging AWS services for crafting a sophisticated Large Language Model (LLM) Generative AI, geared towards creating a responsive Question and Answer Bot and localizing content generation.
A fine tune version of Stable Diffusion model on self-translate 10k diffusiondb Chinese Corpus and "extend" it
微信小程序的图文编辑功能,可针对单个输入框的文字进行简单样式调整,在文字中间插入、删除图片;
Line and Word Segmentation for Bangla Handwritten Text Recognition
iOS 富文本编辑,原生图文混排 图文并茂 NSAttributedString转html html转NSAttributedString base64图片上传,Rich Text Editor
A Light Neural Network To Control Stable Diffusion Spatial Information tuned by Chinese
This repository is based on the work done for the Bangla Handwritten Line Segmentation
Use CLIP to create matching texts + embeddings for given images; useful for XAI, adversarial training
lmmtoolkit is a toolkit for Multi-Modal Learning
A small script for CLIP attn entropy plots
Paging menu controller having text and imageview in the Tab
Adversarial learning system which generate image from text description using self-attention modules
Software tool that compresses text binary images (lossless compression) to less than 0.002% of their original size on average.
Replication Code for: Making Text-Image Connection Formal and Practical
Super Portrait Engine
Text-Image-Text is a bidirectional system that enables seamless retrieval of images based on text descriptions, and vice versa. It leverages state-of-the-art language and vision models to bridge the gap between textual and visual representations.
This project represents a graphic design technique that uses printable characters from the ASCII standard to create images and animations.
11000-Image-Video-caption-data-of-human-action
20011--Image-Caption-Data-Of-OCR-In-Natural-Scenes
To Fuse Semantic and Positional Clues with Cross-Attention for Scene Text Recognition
A PyTorch implementation of "TextFuseNet: Scene Text Detection with Richer Fused Features".
To track the latest paper for embedding (including text/text-code/text-image embeddings)