Beast code in Giters

data's repositories

a-PyTorch-Tutorial-to-Image-Captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Language:PythonMIT000

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language:PythonMIT000

Bidirectional_DALLE

Translation-equivariant Image Quantizer for Bi-directional Image-Text Generation, Stage 2

Language:PythonMIT000

CLIP

Contrastive Language-Image Pretraining

MIT000

clip-gen

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP

MIT000

CogView2

official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"

Apache-2.0000

EVP

Code for paper 'Audio-Driven Emotional Video Portraits'.

000

glide-text2im

GLIDE: a diffusion-based text-conditional image synthesis model

MIT000

lantern

Lantern官方版本下载蓝灯翻墙代理科学上网外网加速器梯子路由 lantern proxy vpn censorship-circumvention censorship gfw accelerator

Language:Go000

MKGformer

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

MIT000

pytorch-tutorial

PyTorch Tutorial for Deep Learning Researchers

MIT000

Semantic-Communication-Systems

pytorch implementation of "Deep Learning-Enabled Semantic Communication Systems with Task-Unaware Transmitter and Dynamic Data"

MIT000

Style-AttnGAN

Improves Text to Image synthesis from AttnGAN by integrating the scale-specific control from StyleGAN; can optionally use GPT-2 as text encoder

NOASSERTION000

TE-VQGAN

Translation-equivariant Image Quantizer for Bi-directional Image-Text Generation, Stage 1

MIT000

Text-to-Image-ReIdentification

A pytorch re-implementation attempt of paper "Improving description-based person re-identification by multi-granularity image-text alignment." by Niu et al. (partially implemented)

000

Text2Video

ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary"

000

Thin-Plate-Spline-Motion-Model

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

MIT000

train-CLIP

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.

MIT000

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

MIT000

TuPian

000

txt2vid

NOASSERTION000

VQA_ReGAT1

Language:PythonMIT000

ZS-F-VQA

Code and Data for the paper: Zero-shot Visual Question Answering using Knowledge Graph [ ISWC 2021 ]

MIT000

Lily11223344

data's repositories

a-PyTorch-Tutorial-to-Image-Captioning

attention-is-all-you-need-pytorch

Bidirectional_DALLE

CLIP

clip-gen

CogView2

DDPM

DeepSC-ST_demonstration

EVP

glide-text2im

lantern

MKGformer

pytorch-tutorial

qll

ReasoningConsistency-VQA

Semantic-Communication-Systems

Style-AttnGAN

swapmix

TE-VQGAN

Test-Git

Test-gitb

Text-to-Image-ReIdentification

Text2Video

Thin-Plate-Spline-Motion-Model

train-CLIP

Transformer-MM-Explainability

TuPian

txt2vid

VQA_ReGAT1

ZS-F-VQA