He Huang's starred repositories

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

Language:PythonLicense:GPL-3.0Stargazers:45251Issues:1124Issues:1342

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:29672Issues:304Issues:862

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:22017Issues:310Issues:371

PlotNeuralNet

Latex code for making neural networks diagrams

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:18275Issues:294Issues:1274

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonLicense:MITStargazers:12905Issues:126Issues:296

detr

End-to-End Object Detection with Transformers

Language:PythonLicense:Apache-2.0Stargazers:12800Issues:149Issues:526

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10000Issues:184Issues:2044

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:7858Issues:177Issues:2299

gans-awesome-applications

Curated list of awesome GAN applications and demo

Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Language:PythonLicense:Apache-2.0Stargazers:2908Issues:31Issues:215

FSL-Mate

FSL-Mate: A collection of resources for few-shot learning (FSL).

pytorch_GAN_zoo

A mix of GAN implementations including progressive growing

Language:PythonLicense:BSD-3-ClauseStargazers:1594Issues:33Issues:65

awesome-contrastive-self-supervised-learning

A comprehensive list of awesome contrastive self-supervised learning papers.

VMZ

VMZ: Model Zoo for Video Modeling

Language:PythonLicense:Apache-2.0Stargazers:1033Issues:110Issues:123

Zooming-Slow-Mo-CVPR-2020

Fast and Accurate One-Stage Space-Time Video Super-Resolution (accepted in CVPR 2020)

Language:PythonLicense:GPL-3.0Stargazers:894Issues:31Issues:68

TimeSformer-pytorch

Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification

Language:PythonLicense:MITStargazers:665Issues:17Issues:18

Transformer-SSL

This is an official implementation for "Self-Supervised Learning with Swin Transformers".

Language:PythonLicense:MITStargazers:597Issues:6Issues:19

few-shot-classification-leaderboard

Leaderboards for few-shot image classification on miniImageNet, tieredImageNet, FC100, and CIFAR-FS.

Language:HTMLLicense:CC0-1.0Stargazers:373Issues:15Issues:7

CVPR-2023-Papers

CVPR 2023 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included. ⭐ support visual intelligence development!

Language:PythonLicense:MITStargazers:244Issues:5Issues:0

STAM

Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)

Language:PythonLicense:Apache-2.0Stargazers:219Issues:11Issues:20

taskmodularnets

A set of neural network modules, which are small fully connected layers operating in semantic concept space. These modules are configured through a gating function conditioned on the task to produce features representing the compatibility between the input image and the concept under consideration.

Language:PythonLicense:NOASSERTIONStargazers:58Issues:11Issues:5

jsalt2020_simulate

Training data simulation

Language:PythonLicense:Apache-2.0Stargazers:32Issues:10Issues:6

video-transformers

Implementations of Transformers for Video

Language:PythonLicense:Apache-2.0Stargazers:23Issues:4Issues:0

eccv2020-limited-labels-data-tutorial

ECCV 2020 Tutorial: New Frontiers for Learning with Limited Labels or Data

Language:CSSLicense:MITStargazers:8Issues:11Issues:0
Language:Jupyter NotebookLicense:MITStargazers:6Issues:1Issues:0

multi_label_zsl

Official implementation of the paper "Multi-label Zero-shot Classification by Learning to Transfer from External Knowledge" (BMVC'20, oral).

Language:PythonLicense:MITStargazers:5Issues:1Issues:0