yanqi1811's repositories

image-to-latex

Convert images of LaTex math equations into LaTex code.

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

3D-Multi-Person-Pose

3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

3DCrowdNet_RELEASE

Official Pytorch implementation of "Learning to Estimate Robust 3D Human Mesh from In-the-Wild Crowded Scenes", CVPR 2022

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

AdaptivePose

This is an official implementation of our AAAI2022 paper AdaptivePose and Arxiv paper AdaptivePose++

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

awesome-hand-pose-estimation

Awesome work on hand pose estimation/tracking

Language:PythonStargazers:0Issues:1Issues:0

deep-learning-for-image-processing

deep learning for image processing including classification and object-detection etc.

Language:PythonStargazers:0Issues:1Issues:0

DocTr

The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

EasyMocap

Make human motion capture easier.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

FaceFormer

[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Faster-VoxelPose

Official implementation of Faster VoxelPose: Real-time 3D Human Pose Estimation by Orthographic Projection

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

frankmocap

A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

leetcode-1

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)

Language:JavaScriptLicense:NOASSERTIONStargazers:0Issues:0Issues:0

mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

metrabs

Estimate absolute 3D human poses from RGB images.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

mmhuman3d

OpenMMLab 3D Human Parametric Model Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

MotionBERT

MotionBERT: Unified Pretraining for Human Motion Analysis

Language:PythonStargazers:0Issues:0Issues:0

Offline-Chinese-Handwriting-Text-Page-Spotter-with-Text-Kernel

Robust End-to-End Offline Chinese Handwriting Text Page Spotter with Text Kernel

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

PaddleBoBo

基于飞桨开发的虚拟主播

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

pupil

Open source eye tracking

Language:PythonLicense:LGPL-3.0Stargazers:0Issues:1Issues:0

ROMP

Monocular, One-stage, Regression of Multiple 3D People, ROMP[ICCV21], BEV[CVPR22]

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Snipper

This is the re-implementation of paper "Snipper: A Spatiotemporal Transformer for Simultaneous Multi-Person 3D Pose Estimation Tracking and Forecasting on a Video Snippet"

Language:PythonStargazers:0Issues:0Issues:0

StridedTransformer-Pose3D

[TMM 2022] Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

SysMocap

A real-time motion capture system for 3D virtual character animating.

Language:JavaScriptLicense:MPL-2.0Stargazers:0Issues:1Issues:0

TCFormer

The codes for TCFormer in paper: Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

voxelpose-pytorch

Official implementation of "VoxelPose: Towards Multi-Camera 3D Human Pose Estimation in Wild Environment"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

yoloair

🔥🔥🔥YOLOv5, YOLOv6, YOLOv7, PPYOLOE, YOLOX, YOLOR, YOLOv4, YOLOv3, PPYOLO, PPYOLOv2, Transformer, Attention, TOOD and Improved-YOLOv5-YOLOv7... Support to improve backbone, neck, head, loss, IoU, NMS and other modules🚀

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

Yolov7-tracker

Yolo v7 and several Multi-Object Tracker(SORT, DeepSORT, ByteTrack, BoT-SORT, etc.) in VisDrone2019 Dataset. It uses a unified style and integrated tracker for easy embedding in your own projects.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0