MansourGu's repositories
aasist
Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"
ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
CrowdCounting-P2PNet
The official codes for the ICCV2021 Oral presentation "Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework"
PaperCode
论文代码复现
DAVAR-Lab-OCR
OCR toolbox from Davar-Lab
DSFD-Pytorch-Inference
A High-Performance Pytorch Implementation of face detection models, including RetinaFace and DSFD
GVAED
Generalized Video Anomaly Event Detection: Systematic Taxonomy and Comparison of Deep Models
how_attentive_are_gats
Code for the paper "How Attentive are Graph Attention Networks?"
Mask_RCNN
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
MIST_VAD
Official codes for CVPR2021 paper "MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection"
MoViNet
MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;
OpenLabeling
Label images and video for Computer Vision applications
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
paper-reading
深度学习经典、新论文逐段精读
pydensecrf
Python wrapper to Philipp Krähenbühl's dense (fully connected) CRFs with gaussian edge potentials.
pytorch-resnet3d
I3D Nonlocal ResNets in Pytorch
QMagFace
QMagFace: Simple and Accurate Quality-Aware Face Recognition (WACV 2023)
RTFM
Official code for 'Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning' [ICCV 2021]
S3R
video anomaly detection
Speech-Resources
语音方向实验室/公司/资源/实习等,欢迎推荐或自荐(排名不分先后)
ssast
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
TableMASTER-mmocr
2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.
TwoStreamSepConvLSTM_ViolenceDetection
Code for the paper: "Efficient Two-Stream Network for Violence Detection Using Separable Convolutional LSTM"
u-net-aerial-imagery-segmentation
Semantic Segmentation of MBRSC Aerial Imagery of Dubai Using a TensorFlow U-Net Model in Python
Variations-of-SFANet-for-Crowd-Counting
The official implementation of "Encoder-Decoder Based Convolutional Neural Networks with Multi-Scale-Aware Modules for Crowd Counting"
yoloface
Yolov5 Face Detection
yolov5-face
YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931) ECCV Workshops 2022)
yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
yolov7-face
yolov7 face detection with landmark