HHHH17's repositories
smallcap
SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation
CLIP_prefix_caption
Simple image captioning model
ExpansionNet_v2
Implementation code of the work "ExpansionNet v2: Block Static Expansion in fast end to end training for Image Captioning"
grit
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
HHHH17
Config files for my GitHub profile.
PureT
Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]
Xmodal-Ctx
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
copyisallyouneed
Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory
kg-2019
2019年百度的三元组抽取比赛,“科学空间队”源码