chenchen's starred repositories

stats

macOS system monitor in your menu bar

Language:SwiftLicense:MITStargazers:25898Issues:0Issues:0

Developer-Books

编程开发相关书单列表整理

Stargazers:3924Issues:0Issues:0
Language:PythonStargazers:9Issues:0Issues:0

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonLicense:Apache-2.0Stargazers:24519Issues:0Issues:0

nlp-papers-with-arxiv

Statistics and accepted paper list of NLP conferences with arXiv link

Language:Jupyter NotebookLicense:MITStargazers:430Issues:0Issues:0

CVPR21Chal-SLR

This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sign Language Recognition.

Language:PythonLicense:CC0-1.0Stargazers:209Issues:0Issues:0

VAC_CSLR

Visual Alignment Constraint for Continuous Sign Language Recognition. ( ICCV 2021)

Language:PythonLicense:Apache-2.0Stargazers:122Issues:0Issues:0

HandyFigure

HandyFigure provides the sources file (ususally PPT files) for paper figures

Language:JavaScriptLicense:MITStargazers:161Issues:0Issues:0

MABe2022

Solution for Multi Agent Behavior Challenge 2022

Language:PythonStargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:8Issues:0Issues:0

MocapNET

We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance

Language:C++License:NOASSERTIONStargazers:858Issues:0Issues:0

video2bvh

Extracts human motion in video and save it as bvh mocap file.

Language:PythonLicense:MITStargazers:577Issues:0Issues:0

motion-transformer

A Spatio-temporal Transformer for 3D Human Motion Prediction

Language:PythonLicense:GPL-3.0Stargazers:105Issues:0Issues:0

docker-pytorch

A Docker image for PyTorch

Language:DockerfileLicense:MITStargazers:973Issues:0Issues:0

slt

Sign Language Transformers (CVPR'20)

Language:PythonLicense:Apache-2.0Stargazers:239Issues:0Issues:0

awesome-causality-algorithms

An index of algorithms for learning causality with data

License:MITStargazers:2958Issues:0Issues:0

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

License:MITStargazers:6092Issues:0Issues:0

Awesome-SLP

A curated list of awesome work on Sign Language Production

Stargazers:59Issues:0Issues:0

ProgressiveTransformersSLP

Source code for "Progressive Transformers for End-to-End Sign Language Production" (ECCV 2020)

Language:PythonLicense:NOASSERTIONStargazers:105Issues:0Issues:0

NUWA

A unified 3D Transformer Pipeline for visual synthesis

Stargazers:2810Issues:0Issues:0

image-gpt

PyTorch Implementation of OpenAI's Image GPT

Language:PythonLicense:Apache-2.0Stargazers:254Issues:0Issues:0

awesome-grounding

awesome grounding: A curated list of research papers in visual grounding

License:MITStargazers:1028Issues:0Issues:0

awesome-vision-language-pretraining-papers

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

Stargazers:1139Issues:0Issues:0

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:20616Issues:0Issues:0

CLIP

CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

Language:PythonLicense:MITStargazers:76Issues:0Issues:0

sshfs-win

SSHFS For Windows

Language:CLicense:NOASSERTIONStargazers:5182Issues:0Issues:0

xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

Language:PythonLicense:NOASSERTIONStargazers:1030Issues:0Issues:0