hcwei13

followers

following

stars

hcwei's repositories

C4_Ysneaker

Language:Python000

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT000

CMML-Paddle

Language:PythonNOASSERTION010

d2l-zh

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被55个国家的300所大学用于教学。

Language:PythonApache-2.0000

DeepLearning-Note

010

Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Language:PythonApache-2.0000

detectron2

Detectron2 is FAIR's next-generation platform for object detection, segmentation and other visual recognition tasks.

Language:PythonApache-2.0000

energy-based-scene-graph

Code release for Energy-Based Learning for Scene Graph Genertaion

Language:Jupyter NotebookNOASSERTION000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT000

hcwei13.github.io

Language:HTML010

image-caption-metrics

a py3 lib for NLP & image-caption metrics : BLEU METEOR CIDEr ROUGE SPICE WMD

Language:Python000

image-captioning-DLCT

Official pytorch implementation of paper "Duel-Level Collaborative Transformer for Image Captioning" (AAAI 2021).

Language:Jupyter NotebookBSD-3-Clause000

ImageCaptioning.pytorch

I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)

Language:PythonMIT000

learngit

Git使用记录

010

meshed-memory-transformer

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Language:PythonBSD-3-Clause000

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Language:PythonNOASSERTION000

models

Pre-trained and Reproduced Deep Learning Models （『飞桨』官方模型库，包含多种学术前沿和工业场景验证的深度学习模型）

Language:PythonApache-2.0000

PaddleHub

Awesome pre-trained models toolkit based on PaddlePaddle.(300+ models including Image, Text, Audio and Video with Easy Inference & Serving deployment)

Language:PythonApache-2.0000

PaddleMM

Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

Language:PythonApache-2.0000

pytorch-cifar100

Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet, NasNet, Residual Attention Network, SENet, WideResNet)

Language:Python000

pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

Language:PythonNOASSERTION000

PyTorch-Networks

Pytorch implementation of cnn network

Language:Python000

pytorch-sentiment-analysis

Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

Language:Jupyter NotebookMIT000

scene_graph_benchmark

image scene graph generation benchmark

Language:PythonMIT000

self-critical.pytorch

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Language:PythonMIT000

SID-Paddle

This is a Paddle version of Learning to See in the Dark, CVPR 2018.

Language:Python000

SparseR-CNN

End-to-End Object Detection with Learnable Proposal, CVPR2021

Language:PythonMIT000

Swin-ImageCaption

Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]

Language:Jupyter Notebook000

vscode-git-Docker-Remote-

vscode上git、Docker和Remote的使用方法

010

xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

Language:PythonNOASSERTION000