房森's starred repositories
AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
MocapNET
We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance
world-models
Extracting spatial and temporal world models from LLMs
word-embeddings-for-nmt
Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018
sign-language-processing.github.io
Documentation and background of sign language processing
transcription
Text to pose model for sign language pose generation from a text sequence
slt_how2sign_wicv2023
Sign Language Translation for Instructional Videos - CVPR WiCV 2023
Sign-Language-Translator
Sign Language Translator enables the hearing impaired user to communicate efficiently in sign language, and the application will translate the same into text/speech. The user has to train the model, by recording its own sign language gestures. Internally it uses MobileNet and KNN classifier to classify the gestures.
MMD_3D_POSE_Converter
Convert 3D Human Pose to VMD file
jiyanggao.github.io
Minimal is a Jekyll theme for GitHub Pages
pose-pipelines
Pipelines to process (crop, mask, and estimate poses) sign language videos like the way described in WMT-SLT 22
lsedataset
Spanish Sign Language dataset