Beast code in Giters

xxbbml's starred repositories

PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Language:PythonAGPL-3.0148800

improved-aesthetic-predictor

CLIP+MLP Aesthetic Score Predictor

Language:PythonApache-2.080100

Audio-driven-TalkingFace-HeadPose

Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalized Head Movement From Short Video and Speech Signal" (TMM 2022)

Language:Python71400

roop

one-click face swap

Language:PythonGPL-3.02576700

AutoTransition

[ECCV 2022] AutoTransition: Learning to Recommend Video Transition Effects

Language:Python5100

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonMIT1894800

mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Language:PythonApache-2.0406100

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.

Language:PythonApache-2.0652100

action_recognition

Solving UCF-101 with fastai2

Language:Jupyter NotebookApache-2.02800

mmdetection

OpenMMLab Detection Toolbox and Benchmark

Language:PythonApache-2.02865700

DAIN

Depth-Aware Video Frame Interpolation (CVPR 2019)

Language:PythonMIT815700

stagesepx

detect stages in video automatically

Language:PythonMIT42800

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.03084300

SRFlow

Official SRFlow training code: Super-Resolution using Normalizing Flow in PyTorch

Language:Jupyter NotebookNOASSERTION82400

fastpose

Open source library developed under python to estimate the 2D and 3D pose of people present on a video stream thourgh deep networks of convolutions.

Language:Jupyter Notebook3700

PoseEstimationForMobile

:dancer: Real-time single person pose estimation for Android and iOS.

Language:C++Apache-2.0100000

lightweight-human-pose-estimation.pytorch

Fast and accurate human pose estimation in PyTorch. Contains implementation of "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose" paper.

Language:PythonApache-2.0205700

Yet-Another-EfficientDet-Pytorch

The pytorch re-implement of the official efficientdet with SOTA performance in real time and pretrained weights.

Language:Jupyter NotebookLGPL-3.0520000

video-nonlocal-net

Non-local Neural Networks for Video Classification

Language:PythonNOASSERTION196500

tsn-pytorch

Temporal Segment Networks (TSN) in PyTorch

Language:PythonBSD-2-Clause106000

MARS

MARS: Motion-Augmented RGB Stream for Action Recognition

Language:PythonMIT16100

AI_Challenger_2018

AI Challenger, a platform for open datasets and programming competitions to artificial intelligence (AI) talents around the world. https://challenger.ai/

Language:Python68000

youtube-8m

Starter code for working with the YouTube-8M dataset.

Language:PythonApache-2.0229600

PyTorch-YOLOv3

Minimal PyTorch implementation of YOLOv3

Language:PythonGPL-3.0728900

Awesome-Crowd-Counting

Awesome Crowd Counting

233900

Pixel2Mesh

Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images. In ECCV2018.

Language:PythonApache-2.0162800

xxbbml