hcwei's repositories

Language:PythonStargazers:0Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

d2l-zh

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被55个国家的300所大学用于教学。

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

detectron2

Detectron2 is FAIR's next-generation platform for object detection, segmentation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

energy-based-scene-graph

Code release for Energy-Based Learning for Scene Graph Genertaion

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

image-caption-metrics

a py3 lib for NLP & image-caption metrics : BLEU METEOR CIDEr ROUGE SPICE WMD

Language:PythonStargazers:0Issues:0Issues:0

image-captioning-DLCT

Official pytorch implementation of paper "Duel-Level Collaborative Transformer for Image Captioning" (AAAI 2021).

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

ImageCaptioning.pytorch

I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

learngit

Git使用记录

Stargazers:0Issues:1Issues:0

meshed-memory-transformer

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

models

Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleHub

Awesome pre-trained models toolkit based on PaddlePaddle.(300+ models including Image, Text, Audio and Video with Easy Inference & Serving deployment)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleMM

Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-cifar100

Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet, NasNet, Residual Attention Network, SENet, WideResNet)

Language:PythonStargazers:0Issues:0Issues:0

pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

PyTorch-Networks

Pytorch implementation of cnn network

Language:PythonStargazers:0Issues:0Issues:0

pytorch-sentiment-analysis

Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

scene_graph_benchmark

image scene graph generation benchmark

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

self-critical.pytorch

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SID-Paddle

This is a Paddle version of Learning to See in the Dark, CVPR 2018.

Language:PythonStargazers:0Issues:0Issues:0

SparseR-CNN

End-to-End Object Detection with Learnable Proposal, CVPR2021

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Swin-ImageCaption

Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

vscode-git-Docker-Remote-

vscode上git、Docker和Remote的使用方法

Stargazers:0Issues:1Issues:0

xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0