A simple implementation of band attention with CUDA acceleration for faster Diffusion Transformers in sequential generation task.
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool