There are 0 repository under cross-attention topic.
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
[TPAMI'23] Unifying Flow, Stereo and Depth Estimation
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and ๐ video, up to 5x faster than OpenAI CLIP and LLaVA ๐ผ๏ธ & ๐๏ธ
T-GATE: Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models
Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind
1-shot image segmentation using Stable Diffusion
Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
๐ Cross attention map tools for huggingface/diffusers
The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".
A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)
Unsupervised Hybrid Network of Transformer and CNN for Blind Hyperspectral and RGB Image Fusion
This model synthesises high-fidelity fashion videos from single images featuring spontaneous and believable movements.
Cross attention mechanism in pytorch, C and C++ for merging two 3D images
[ISMB 2024] Official PyTorch Code for "PhiHER2: Phenotype-informed weakly supervised model for HER2 status prediction from WSIs"
Transcription factor binding site prediction for novel DNA sequence data aiding in mutation identification and drug discovery
Clickbait detection using custom cross attention transformer model