xxbbml's starred repositories
PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
Audio-driven-TalkingFace-HeadPose
Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalized Head Movement From Short Video and Speech Signal" (TMM 2022)
AutoTransition
[ECCV 2022] AutoTransition: Learning to Recommend Video Transition Effects
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
action_recognition
Solving UCF-101 with fastai2
mmdetection
OpenMMLab Detection Toolbox and Benchmark
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
PoseEstimationForMobile
:dancer: Real-time single person pose estimation for Android and iOS.
lightweight-human-pose-estimation.pytorch
Fast and accurate human pose estimation in PyTorch. Contains implementation of "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose" paper.
Yet-Another-EfficientDet-Pytorch
The pytorch re-implement of the official efficientdet with SOTA performance in real time and pretrained weights.
video-nonlocal-net
Non-local Neural Networks for Video Classification
tsn-pytorch
Temporal Segment Networks (TSN) in PyTorch
AI_Challenger_2018
AI Challenger, a platform for open datasets and programming competitions to artificial intelligence (AI) talents around the world. https://challenger.ai/
youtube-8m
Starter code for working with the YouTube-8M dataset.
PyTorch-YOLOv3
Minimal PyTorch implementation of YOLOv3
Awesome-Crowd-Counting
Awesome Crowd Counting
Pixel2Mesh
Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images. In ECCV2018.