qhfan's repositories

RMT

(CVPR2024)RMT: Retentive Networks Meet Vision Transformer

CloFormer

The official code of "Rethinking Local Perception in Lightweight Vision Transformer"

Language:PythonLicense:MITStargazers:85Issues:4Issues:3

FAT

[NeurIPS2023]Lightweight Vision Transformer with Bidirectional Interaction

ChatGPT

đź”® ChatGPT Desktop Application (Mac, Windows and Linux)

Language:RustLicense:AGPL-3.0Stargazers:1Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:0Issues:0

SMPConv

[CVPR2023] "SMPConv: Self-moving Point Representations for Continuous Convolution"

Language:HTMLLicense:MITStargazers:1Issues:0Issues:0

SMT

This is an official implementation for "Scale-Aware Modulation Meet Transformer".

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

argoverse-api

Official GitHub repository for Argoverse dataset

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

BiFormer

[CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

bleurt

BLEURT is a metric for Natural Language Generation based on transfer learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

deit

Official DeiT repository

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

EdgeNeXt

[CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications".

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Efficient-AI-Backbones

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

Language:PythonStargazers:0Issues:0Issues:0

FocalNet

[NeurIPS 2022] Official code for "Focal Modulation Networks"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

fourier_neural_operator

Use Fourier transform to learn operators in differential equations.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

FSQ-pytorch

A Pytorch Implementation of Finite Scalar Quantization

Language:PythonStargazers:0Issues:0Issues:0

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LITv2

[NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MIRL

[NeurIPS 2023] Masked Image Residual Learning for Scaling Deeper Vision Transformers

Language:PythonStargazers:0Issues:0Issues:0

ml-cvnets

CVNets: A library for training computer vision networks

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Neighborhood-Attention-Transformer

Official NAT and DiNAT repository.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

openmixup

CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:TypeScriptStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

U-ViT

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

ViTAE-VSA

This is an official implementation for "VSA: Learning Varied-Size Window Attention in Vision Transformers"

Language:PythonStargazers:0Issues:0Issues:0