hcwei's repositories
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
d2l-zh
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被55个国家的300所大学用于教学。
Deformable-DETR
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
detectron2
Detectron2 is FAIR's next-generation platform for object detection, segmentation and other visual recognition tasks.
energy-based-scene-graph
Code release for Energy-Based Learning for Scene Graph Genertaion
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
image-caption-metrics
a py3 lib for NLP & image-caption metrics : BLEU METEOR CIDEr ROUGE SPICE WMD
image-captioning-DLCT
Official pytorch implementation of paper "Duel-Level Collaborative Transformer for Image Captioning" (AAAI 2021).
ImageCaptioning.pytorch
I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)
meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
models
Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)
PaddleHub
Awesome pre-trained models toolkit based on PaddlePaddle.(300+ models including Image, Text, Audio and Video with Easy Inference & Serving deployment)
PaddleMM
Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.
pytorch-cifar100
Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet, NasNet, Residual Attention Network, SENet, WideResNet)
pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
PyTorch-Networks
Pytorch implementation of cnn network
pytorch-sentiment-analysis
Tutorials on getting started with PyTorch and TorchText for sentiment analysis.
scene_graph_benchmark
image scene graph generation benchmark
self-critical.pytorch
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
SID-Paddle
This is a Paddle version of Learning to See in the Dark, CVPR 2018.
SparseR-CNN
End-to-End Object Detection with Learnable Proposal, CVPR2021
Swin-ImageCaption
Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]
vscode-git-Docker-Remote-
vscode上git、Docker和Remote的使用方法
xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).