该用户不存在或已注销 (amazingYX)

amazingYX

Geek Repo

Company:xidian

Github PK Tool:Github PK Tool

该用户不存在或已注销's repositories

ALCE

[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Awesome-Captioning

A curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)

Stargazers:0Issues:0Issues:0

Awesome-LLM-Uncertainty-Reliability-Robustness

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

License:MITStargazers:0Issues:0Issues:0

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

License:MITStargazers:0Issues:0Issues:0

awesome-trustworthy-deep-learning

A curated list of trustworthy deep learning papers. Daily updating...

License:MITStargazers:0Issues:0Issues:0

awesome-uncertainty-deeplearning

This repository contains a collection of surveys, datasets, papers, and codes, for predictive uncertainty estimation in deep learning models.

License:MITStargazers:0Issues:0Issues:0

CapDec

CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CLIP-ViL

[ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CLIP_prefix_caption

Simple image captioning model

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

ClipCap-Chinese

基于ClipCap的看图说话Image Caption模型

Language:PythonStargazers:0Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

License:MITStargazers:0Issues:0Issues:0

Controllable_Region_Pointer_Advancement

PyTorch implementation of a Controllable Image Captioning model with a language-driven mechanism for advancing the region pointer state that keeps it in sync with the state of the language model.

License:BSD-2-ClauseStargazers:0Issues:0Issues:0

D3

The implementation for ACL 2022 paper

Language:PythonStargazers:0Issues:0Issues:0

denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

DPR

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

DRL

Deep Reinforcement Learning

License:NOASSERTIONStargazers:0Issues:0Issues:0

EMNLP-2023-Papers

EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learning, deep learning, and natural language processing with code included. :star: support NLP!

License:MITStargazers:0Issues:0Issues:0

ER-SAN

Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.

Language:PythonStargazers:0Issues:0Issues:0

ImageCaptioning.pytorch

I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

License:MITStargazers:0Issues:0Issues:0

MLAT

Official pytorch implementation of paper "Remote Sensing Image Captioning Based on Multi-Layer Aggregated Transformer"

Stargazers:0Issues:0Issues:0

mynote

store picture in my note

Stargazers:0Issues:0Issues:0

Paper-Reading

📖 Paper reading list in dialogue systems and natural language generation (constantly updating 🤗).

Stargazers:0Issues:0Issues:0

region-hierarchical-pytorch

Implementation of a baseline method for image paragraph captioning

Language:PythonStargazers:0Issues:0Issues:0

RSTNet

RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words (CVPR2021)

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

TAADpapers

Must-read Papers on Textual Adversarial Attack and Defense

License:MITStargazers:0Issues:0Issues:0

visualization

a collection of visualization function

License:MITStargazers:0Issues:0Issues:0

WordSent

This is the source code of "Word-Sentence Framework for Remote Sensing Image Captioning, TGRS2020".

Stargazers:0Issues:0Issues:0

xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0