房森's starred repositories
AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
MocapNET
We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance
ControlNet_Plus_Plus
Official PyTorch implementation of ECCV 2024 Paper: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.
world-models
Extracting spatial and temporal world models from LLMs
word-embeddings-for-nmt
Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018
sign-language-processing.github.io
Documentation and background of sign language processing
Sign-Language-Translator
Sign Language Translator enables the hearing impaired user to communicate efficiently in sign language, and the application will translate the same into text/speech. The user has to train the model, by recording its own sign language gestures. Internally it uses MobileNet and KNN classifier to classify the gestures.
clash-dashboard
clash-dashboard 最新备份,原仓库删库前一天克隆的,包含绝大部分提交记录。
Sign-Language-Mocap-Archive
Collected Sign Language Motion Capture
pose-pipelines
Pipelines to process (crop, mask, and estimate poses) sign language videos like the way described in WMT-SLT 22
lsedataset
Spanish Sign Language dataset