FangSen9000

We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance

Language:C++NOASSERTION846 36 121

translate

Effortless Real-Time Sign Language Translation

Language:TypeScriptNOASSERTION465 20 135

ControlNet_Plus_Plus

Official PyTorch implementation of ECCV 2024 Paper: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.

Language:PythonApache-2.0380 11 11

Comfyui-MusePose

Language:PythonNOASSERTION349 2 53

DatasetDM

[NeurIPS2023] DatasetDM:Synthesizing Data with Perception Annotations Using Diffusion Models

Language:Python298 16 31

world-models

Extracting spatial and temporal world models from LLMs

Language:Jupyter NotebookMIT233 6 4

UniHSI

[ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts

Language:Python153 10 14

UniMoCap

[Open-source Project] UniMoCap: community implementation to unify the text-motion datasets (HumanML3D, KIT-ML, and BABEL) and whole-body motion dataset (Motion-X).

Language:PythonNOASSERTION146 5 3

puppeteer

Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"

Language:PythonMIT143 5 2

FineGym

All about FineGym (CVPR 2020 Oral): models, features, data, and more... keep starring and stay tuned!

127 6 13

word-embeddings-for-nmt

Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018

Language:Python120 9 5

sign-language-processing.github.io

Documentation and background of sign language processing

Language:TeX111 5 50

datasets

TFDS data loaders for sign language datasets.

Language:Python81 6 47

Sign-Language-Translator

Sign Language Translator enables the hearing impaired user to communicate efficiently in sign language, and the application will translate the same into text/speech. The user has to train the model, by recording its own sign language gestures. Internally it uses MobileNet and KNN classifier to classify the gestures.

Language:HTMLMIT34 2 1

FangSen9000

房森's starred repositories

AnimateAnyone

magic-animate

Moore-AnimateAnyone

trimesh

Open-AnimateAnyone

MusePose

smplify-x

minimal

MotionGPT

SMPLer-X

MocapNET

translate

ControlNet_Plus_Plus

Comfyui-MusePose

DatasetDM

world-models

UniHSI

UniMoCap

puppeteer

FineGym

word-embeddings-for-nmt

sign-language-processing.github.io

datasets

Sign-Language-Translator

clash-dashboard

2D-Keypoints-based-Pose-Classifier

Sign-Language-Mocap-Archive

pose-pipelines

lsedataset

yanjunhan2021.github.io