There are 0 repository under text-image topic.
The editor for ASCII-graphics, combining a graphical editor and an image to text converter. Decorate your text and surprise your readers with an original social media post or blog post using ASCII graphics. The tool does not require an internet connection and can work offline in a browser.
A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text removal, text image super resolution, text editing, handwritten generation, scene text recognition and scene text detection.
[CVPR 2024] Official code for "Text-Driven Image Editing via Learnable Regions"
Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)
[BMVC 2023] Zero-shot Composed Text-Image Retrieval
This repository features three demos that can be effortlessly integrated into your AWS environment. They serve as a practical guide to leveraging AWS services for crafting a sophisticated Large Language Model (LLM) Generative AI, geared towards creating a responsive Question and Answer Bot and localizing content generation.
RichTextVista is a high-performance, extensible rich text component designed for HarmonyOS applications, supporting diverse rich text styling capabilities.
A fine tune version of Stable Diffusion model on self-translate 10k diffusiondb Chinese Corpus and "extend" it
ๅพฎไฟกๅฐ็จๅบ็ๅพๆ็ผ่พๅ่ฝ๏ผๅฏ้ๅฏนๅไธช่พๅ ฅๆก็ๆๅญ่ฟ่ก็ฎๅๆ ทๅผ่ฐๆด๏ผๅจๆๅญไธญ้ดๆๅ ฅใๅ ้คๅพ็๏ผ
Line and Word Segmentation for Bangla Handwritten Text Recognition
iOS ๅฏๆๆฌ็ผ่พ๏ผๅ็ๅพๆๆททๆ ๅพๆๅนถ่ NSAttributedString่ฝฌhtml html่ฝฌNSAttributedString base64ๅพ็ไธไผ ,Rich Text Editor
A Light Neural Network To Control Stable Diffusion Spatial Information tuned by Chinese
This repository is based on the work done for the Bangla Handwritten Line Segmentation
Use CLIP to create matching texts + embeddings for given images; useful for XAI, adversarial training
๐๐๐ Text image can "textify" text, images, and videos, and can be used with simple configuration ๅฎๅฏไปฅๅฐๆๅญใๅพ็ใ่ง้ข่ฟ่กใๆๆฌๅใ ๅช้่ฆ้่ฟ็ฎๅ็้ ็ฝฎๅณๅฏไฝฟ็จ
A small script for CLIP attn entropy plots
Paging menu controller having text and imageview in the Tab
lmmtoolkit is a toolkit for Multi-Modal Learning
Adversarial learning system which generate image from text description using self-attention modules
Software tool that compresses text binary images (lossless compression) to less than 0.002% of their original size on average.
Replication Code for: Making Text-Image Connection Formal and Practical
Super Portrait Engine
Text-Image-Text is a bidirectional system that enables seamless retrieval of images based on text descriptions, and vice versa. It leverages state-of-the-art language and vision models to bridge the gap between textual and visual representations.
This project represents a graphic design technique that uses printable characters from the ASCII standard to create images and animations.
11000-Image-Video-caption-data-of-human-action
20011--Image-Caption-Data-Of-OCR-In-Natural-Scenes
To Fuse Semantic and Positional Clues with Cross-Attention for Scene Text Recognition
A PyTorch implementation of "TextFuseNet: Scene Text Detection with Richer Fused Features".
To track the latest paper for embedding (including text/text-code/text-image embeddings)