该用户不存在或已注销's repositories
ALCE
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
Awesome-Captioning
A curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)
Awesome-LLM-Uncertainty-Reliability-Robustness
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
awesome-trustworthy-deep-learning
A curated list of trustworthy deep learning papers. Daily updating...
awesome-uncertainty-deeplearning
This repository contains a collection of surveys, datasets, papers, and codes, for predictive uncertainty estimation in deep learning models.
CapDec
CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)
CLIP-ViL
[ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383
CLIP_prefix_caption
Simple image captioning model
ClipCap-Chinese
基于ClipCap的看图说话Image Caption模型
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Controllable_Region_Pointer_Advancement
PyTorch implementation of a Controllable Image Captioning model with a language-driven mechanism for advancing the region pointer state that keeps it in sync with the state of the language model.
D3
The implementation for ACL 2022 paper
denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
DPR
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
DRL
Deep Reinforcement Learning
EMNLP-2023-Papers
EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learning, deep learning, and natural language processing with code included. :star: support NLP!
ER-SAN
Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.
ImageCaptioning.pytorch
I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)
LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
MLAT
Official pytorch implementation of paper "Remote Sensing Image Captioning Based on Multi-Layer Aggregated Transformer"
mynote
store picture in my note
Paper-Reading
📖 Paper reading list in dialogue systems and natural language generation (constantly updating 🤗).
region-hierarchical-pytorch
Implementation of a baseline method for image paragraph captioning
RSTNet
RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words (CVPR2021)
TAADpapers
Must-read Papers on Textual Adversarial Attack and Defense
visualization
a collection of visualization function
WordSent
This is the source code of "Word-Sentence Framework for Remote Sensing Image Captioning, TGRS2020".
xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).