mscoco

There are 3 repositories under mscoco topic.

microsoft / Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
ade20k image-classification imagenet mask-rcnn mscoco object-detection semantic-segmentation swin-transformer
Language:Python 13668
sgrvinod / a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
pytorch pytorch-tutorial show-attend-and-tell image-captioning encoder-decoder attention-mechanism computer-vision mscoco
Language:Python 2750
SwinTransformer / Swin-Transformer-Object-Detection
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
mscoco swin-transformer cascade mask-rcnn object-detection reppoints swin
Language:Python 1786
apple / ml-cvnets
CVNets: A library for training computer vision networks
ade20k classification computer-vision deep-learning detection imagenet machine-learning mscoco pascal-voc pytorch segmentation
Language:Python 1772
peteanderson80 / bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
caffe captioning-images faster-rcnn image-captioning mscoco mscoco-dataset visual-question-answering vqa
Language:Jupyter Notebook 1426
HRNet / HRNet-Object-Detection
Object detection with multi-level representations generated from deep high-resolution representation learning (HRNetV2h). This is an official implementation for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919
mmdetection object-detection faster-rcnn cascade-rcnn mscoco hrnets
Language:Python 644
JDAI-CV / CoTNet
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
contextual-transformer cotnet image-classification imagenet instance-segmentation mask-rcnn mscoco object-detection semantic-segmentation vision-transformer
Language:Python 514
sacmehta / EdgeNets
This repository contains the source code of our work on designing efficient CNNs for computer vision
cnn cnn-classification object-detection semantic-segmentation mscoco cityscapes pascal-voc imagenet-dataset imagenet-classifier shufflenetv2 espnetv2 dicenet pytorch
Language:Python 412
hyz-xmaster / VarifocalNet
VarifocalNet: An IoU-aware Dense Object Detector
object-detection dense-object-detection varifocal-loss focal-loss mscoco varifocalnet
Language:Python 348
ViTAE-Transformer / ViTAE-Transformer
The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"
deep-learning vision-transformer imagenet ade20k imagenet-classification mscoco object-detection semantic-segmentation vitae-transformer
Language:Python 249
hyz-xmaster / swa_object_detection
SWA Object Detection
object-detection instance-segmentation mscoco mmdetection deep-neural-networks
Language:Python 248
MichiganCOG / ViP
Video Platform for Action Recognition and Object Detection in Pytorch
pytorch action-recognition object-detection c3d neural-networks deep-learning ssd imagenetvid mscoco resnet hmdb51 ucf101 i3d ycbb dhf1k youcook dvsa detection video-platform video-saliency
Language:Python 218
YehLi / ImageNetModel
Official ImageNet Model repository
contextual-transformer cotnet dual-vit image-classification imagenet instance-segmentation mscoco object-detection semantic-segmentation vision-transformer wave-vit
Language:Jupyter Notebook 214
hustvl / BMaskR-CNN
[ECCV 2020] Boundary-preserving Mask R-CNN
instance-segmentation mask-rcnn maskrcnn mscoco object-detection boundary-detection faster-rcnn detectron2 detectron
Language:Python 190
peteanderson80 / SPICE
Semantic Propositional Image Caption Evaluation
image-captioning captioning-images mscoco
Language:Java 135
HRNet / HRNet-FCOS
High-resolution Networks for the Fully Convolutional One-Stage Object Detection (FCOS) algorithm
hrnets object-detection mscoco fcos
Language:Python 125
ntrang086 / image_captioning
generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset
image-captioning rnn-model cnn mscoco pytorch nlp computer-vision encoder-decoder
Language:Python 71
610265158 / mobilenetv3_centernet
A tensorflow implement mobilenetv3 centernet, which can be easily deployeed on android(MNN) and ios(CoreML).
centernet tensorflow mnn coreml mscoco mobilenetv3-centernet
Language:Python 70
Weed-AI / Weed-AI
A repository and interchange format for weed identification annotation
weed-recognition computer-vision datasets data-formats mscoco
Language:Python 53
labelformat
lightly-ai / labelformat
A tool for converting computer vision label formats.
annotation bounding-boxes labels mscoco object-detection pascal-voc yolo kitti yolov8
Language:Python 51
peteanderson80 / coco-caption
Adds SPICE metric to coco-caption evaluation server codes
mscoco mscoco-image-dataset mscoco-dataset image-captioning captioning-images spice
Language:Jupyter Notebook 50
oswaldoludwig / visually-informed-embedding-of-word-VIEW-
Visually informed embedding of word (VIEW) is a tool for transferring multimodal background knowledge to NLP algorithms.
embeddings word2vec mscoco nlp deep-learning multimodal-learning
Language:Python 30
consistency
utahnlp / consistency
Implementation of models in our EMNLP 2019 paper: A Logic-Driven Framework for Consistency of Neural Models
emnlp2019 nli snli mnli consistency pytorch bert mscoco logic first-order-logic loss-functions regularization
Language:Python 30
ayansengupta17 / GAN
We aim to generate realistic images from text descriptions using GAN architecture. The network that we have designed is used for image generation for two datasets: MSCOCO and CUBS.
machine-learning deep-learning python tensrorflow generative-adversarial-network gan mscoco-dataset cubs batch-normalization skip-thought-vectors deep-neural-networks text-encodings discriminator sentence mscoco loss-functions stack-gan dcgan dcgan-tensorflow
Language:HTML 20
gautamchitnis / cocoapi
Clone of COCO API - Dataset @ http://cocodataset.org/ - with changes to support Windows build and python3
mscoco mscoco-dataset cocodataset pycocotools
Language:Jupyter Notebook 18
deepplants / ViT-PCM
Official implementation of "Max Pooling with Vision Transformers reconciles class and shape in weakly supervised semantic segmentation"
mscoco pascal-voc vision-transformer weakly-supervised-segmentation
Language:Python 16
leftthomas / DeepMask
A Keras implementation of DeepMask based on NIPS 2015 paper "Learning to Segment Object Candidates"
keras python3 opencv3 mscoco
Language:Python 15
howardyclo / ImageNet2COCO
A demo for mapping class labels from ImageNet to COCO.
imagenet imagenet-dataset mscoco mscoco-dataset deep-learning zero-shot-learning one-shot-learning few-shot-learning detection
Language:Jupyter Notebook 10
CLT29 / semantic_neighborhoods
Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval [ECCV 2020]
eccv2020 retrieval computer-vision cross-modal-retrieval cross-modal visual-semantic-embedding code goodnews politics mscoco-dataset mscoco coco conceptual-captions doc2vec
Language:Python 9
jakarto3d / jakarnotator
The Jakarnotator is an annotation tool to create your own database for instance segmentation problem.
annotations mscoco detectron deep-learning data training-data database instance-segmentation computer-vision
Language:JavaScript 7
nayeem8527 / Chitra-VarNan
Hindi Image Captioning
image-captioning tensorflow hindi lstm mscoco
Language:Python 7
canesee-project / Arabic-COCO
MS COCO captions in Arabic
mscoco mscoco-dataset mscoco-image-dataset image-captioning scene-recognition arabic arabic-language captions
6
VladimirSinitsin / labelme_converter
LabelMe to MsCOCO, PascalVOC, Yolo
labelme2coco labelme2voc labelme2yolo labelme mscoco pascalvoc yolo
Language:Python 6
Lukeasargen / Show-Attend-and-Tell-Pytorch-Lightning
Encoder-Decoder CNN-LSTM Model with an attention mechanism for image captioning. Trained using the Microsoft COCO Dataset.
pytorch pytorch-lightning mscoco image-captioning show-attend-and-tell attention-mechanism attention-visualization lstm text-generation encoder-decoder multimodal-learning
Language:Jupyter Notebook 5
biyoml / PyTorch-SSD
PyTorch implementation of SSD: Single Shot MultiBox Detector.
ssd pytorch ssdlite object-detection pascal-voc mscoco
Language:Python 4
shunk031 / huggingface-datasets_COCOA
COCOA: Semantic Amodal Segmentation for huggingface datasets
bsds cocoa huggingface huggingface-datasets mscoco semantic-segmentation
Language:Python 4

mscoco

microsoft / Swin-Transformer

sgrvinod / a-PyTorch-Tutorial-to-Image-Captioning

SwinTransformer / Swin-Transformer-Object-Detection

apple / ml-cvnets

peteanderson80 / bottom-up-attention

HRNet / HRNet-Object-Detection

JDAI-CV / CoTNet

sacmehta / EdgeNets

hyz-xmaster / VarifocalNet

ViTAE-Transformer / ViTAE-Transformer

hyz-xmaster / swa_object_detection

MichiganCOG / ViP

YehLi / ImageNetModel

hustvl / BMaskR-CNN

peteanderson80 / SPICE

HRNet / HRNet-FCOS

ntrang086 / image_captioning

610265158 / mobilenetv3_centernet

Weed-AI / Weed-AI

lightly-ai / labelformat

peteanderson80 / coco-caption

oswaldoludwig / visually-informed-embedding-of-word-VIEW-

utahnlp / consistency

ayansengupta17 / GAN

gautamchitnis / cocoapi

deepplants / ViT-PCM

leftthomas / DeepMask

howardyclo / ImageNet2COCO

CLT29 / semantic_neighborhoods

jakarto3d / jakarnotator

nayeem8527 / Chitra-VarNan

canesee-project / Arabic-COCO

VladimirSinitsin / labelme_converter

Lukeasargen / Show-Attend-and-Tell-Pytorch-Lightning

biyoml / PyTorch-SSD

shunk031 / huggingface-datasets_COCOA