xxayt

A PyTorch implementation of "MetaFormer: A Unified Meta Framework for Fine-Grained Recognition". A reference PyTorch implementation of “CoAtNet: Marrying Convolution and Attention for All Data Sizes”

Language:PythonMIT100

scnni

Simple Convolution Neural Network Inference Framework

Language:C++100

ViLBERT

Language:Jupyter Notebook100

ViLT

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Language:PythonApache-2.0100

ConvNeXt

Code release for ConvNeXt model

Language:PythonMIT000

drawio

000

MCAN

Deep Modular Co-Attention Networks for Visual Question Answering（VQA）

Language:PythonApache-2.0000

NLP-Interview-Notes

该仓库主要记录 NLP 算法工程师相关的面试题

000

nothing

010

PromptSwitch

Language:Python000

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Language:PythonApache-2.0000

Swin-Transformer-Object-Detection

This is an official implementation for Swin Transformer on Object Detection and Instance Segmentation. Besides, xzj add ConvNeXt model

Language:PythonApache-2.0000

TeachCLIP

Source code of our CVPR2024 paper TeachCLIP for Text-to-Video Retrieval

000

UT-CMVMR

NOASSERTION000

ViLBERT-Multi-Task

Multi Task Vision and Language

Language:Jupyter NotebookMIT000

xxayt.github.io

Language:HTMLMIT000