Eric Ma's repositories
2021-CV-Surveys
2021 年,计算机视觉相关综述。包括目标检测、跟踪........
ByteTrack
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
CV
本仓库将使用Pytorch框架实现经典的图像分类网络、目标检测网络、图像分割网络,图像生成网络等,并会持续更新!!!
CvT
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
detr
End-to-End Object Detection with Transformers
fastText
Library for fast text representation and classification.
insightface
State-of-the-art 2D and 3D Face Analysis Project
lite.ai.toolkit
🛠 A lite C++ toolkit of awesome AI models with ONNXRuntime, NCNN, MNN and TNN. YOLOX, YOLOP, MODNet, YOLOR, NanoDet, YOLOX, SCRFD, YOLOX . MNN, NCNN, TNN, ONNXRuntime, CPU/GPU.
NLP
NLP
nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers
OpenGait
A flexible and extensible framework for gait recognition. You can focus on designing your own models and comparing with state-of-the-arts easily with the help of OpenGait.
PaddleSpeech
Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.
PIPNet
Efficient facial landmark detector
PPASR
基于PaddlePaddle2实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。
Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
StyTR-2
StyTr2 : Image Style Transfer with Transformers
Talk-to-Edit
Code for Talk-to-Edit (ICCV2021). Paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog.
tfjs-models
Pretrained models for TensorFlow.js
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
X2Paddle
Deep learning model converter for PaddlePaddle. (『飞桨』深度学习模型转换工具)
YOLOS
You Only Look at One Sequence (NeurIPS 2021)
Yolov4_DeepSocial
基于Yolov4的行人检测、行人距离估计、多目标跟踪系统
Yolov5-deepsort-inference
Yolov5 deepsort inference,使用YOLOv5+Deepsort实现车辆行人追踪和计数,代码封装成一个Detector类,更容易嵌入到自己的项目中
YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
YOWO
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization