huangjun12's repositories
Awesome-Knowledge-Distillation
Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。
Awesome-Multimodal-LLM
Reading list for Multimodal Large Language Models
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
ControlNet
Let us control diffusion models!
DAVAR-Lab-OCR
OCR toolbox from Davar-Lab
External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
mmdetection
OpenMMLab Detection Toolbox and Benchmark
open_clip
An open source implementation of CLIP.
PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
pytorchvideo
A deep learning library for video understanding research.
stable-diffusion-webui
Stable Diffusion web UI
temporal-shift-module
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
X3D-Multigrid
PyTorch implementation of X3D models with Multigrid training.