Beast code in Giters

chenchen's starred repositories

stats

macOS system monitor in your menu bar

Language:SwiftMIT2589800

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonApache-2.02451900

nlp-papers-with-arxiv

Statistics and accepted paper list of NLP conferences with arXiv link

Language:Jupyter NotebookMIT43000

CVPR21Chal-SLR

This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sign Language Recognition.

Language:PythonCC0-1.020900

VAC_CSLR

Visual Alignment Constraint for Continuous Sign Language Recognition. ( ICCV 2021)

Language:PythonApache-2.012200

HandyFigure

HandyFigure provides the sources file (ususally PPT files) for paper figures

Language:JavaScriptMIT16100

MABe2022

Solution for Multi Agent Behavior Challenge 2022

Language:Python100

We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance

Language:C++NOASSERTION85800

video2bvh

Extracts human motion in video and save it as bvh mocap file.

Language:PythonMIT57700

Attention-A-Lightweight-2D-Hand-Pose-Estimation-Approach-Pytorch

Language:Python1900

CVPR-2022-Papers

63600

motion-transformer

A Spatio-temporal Transformer for 3D Human Motion Prediction

Language:PythonGPL-3.010500

A-unified-3d-human-motion-synthesis-model-via-conditional-variational-auto-encoder

Language:Python3700

docker-pytorch

A Docker image for PyTorch

Language:DockerfileMIT97300

slt

Sign Language Transformers (CVPR'20)

Language:PythonApache-2.023900

awesome-causality-algorithms

An index of algorithms for learning causality with data

MIT295800

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

MIT609200

Awesome-SLP

A curated list of awesome work on Sign Language Production

5900

ProgressiveTransformersSLP

Source code for "Progressive Transformers for End-to-End Sign Language Production" (ECCV 2020)

Language:PythonNOASSERTION10500

NUWA

A unified 3D Transformer Pipeline for visual synthesis

281000

image-gpt

PyTorch Implementation of OpenAI's Image GPT

Language:PythonApache-2.025400

awesome-grounding

awesome grounding: A curated list of research papers in visual grounding

MIT102800

awesome-vision-language-pretraining-papers

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

113900

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonMIT2061600

CLIP

CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

Language:PythonMIT7600

sshfs-win

SSHFS For Windows

Language:CNOASSERTION518200

xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

Language:PythonNOASSERTION103000

hanchenchen