WangJYao

WangJYao

Geek Repo

Github PK Tool:Github PK Tool

WangJYao's starred repositories

Language:Jupyter NotebookStargazers:25Issues:0Issues:0

CVTE_chain_model_finetune

finetune the chain model based on cvte open source model without traing any GMM for frame alignment

Language:ShellStargazers:12Issues:0Issues:0

gop-ft

Transfer learning approach to pronunciation scoring

Language:Jupyter NotebookStargazers:9Issues:0Issues:0

SRILM

Mirror of srilm source code :-)

Stargazers:7Issues:0Issues:0
Language:PythonStargazers:85Issues:0Issues:0

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonLicense:MITStargazers:1306Issues:0Issues:0

awesome-kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

License:MITStargazers:532Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:11697Issues:0Issues:0

Speech-Evaluation-System

This is a pronunciation evaluation system based on Mandarin Chinese characters and words. The evaluation core utilizes a multi-task scoring model built on top of wenet, which can decode the pronunciation into pinyin text and provide a pronunciation score.

Stargazers:1Issues:0Issues:0

E2E-R

Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring

Language:PythonLicense:MITStargazers:17Issues:0Issues:0

gopt

Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".

Language:PythonLicense:BSD-3-ClauseStargazers:145Issues:0Issues:0

HiPAMA

This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Multi-Aspect Attention (ICASSP 2023).

Language:PythonLicense:BSD-3-ClauseStargazers:27Issues:0Issues:0

comfy_controlnet_preprocessors

Add my own preprocessors

Language:PythonLicense:Apache-2.0Stargazers:4Issues:0Issues:0

Sensitive-lexicon

敏感词库旨在建立一个词汇集,用于识别和过滤文本内容中的不当或不适宜的语言,以保护用户免受有害信息的影响并维持沟通环境的健康。

License:MITStargazers:176Issues:0Issues:0

SensitiveWordRetrieval

前缀树实现的中文敏感词检索C++版本

Language:C++Stargazers:2Issues:0Issues:0

IVAC-P2L

IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting

Language:PythonLicense:MITStargazers:23Issues:0Issues:0

PoseRAC

PoseRAC: Pose Saliency Transformer for Repetitive Action Counting

Language:PythonLicense:MITStargazers:13Issues:0Issues:0

GraMMaR

The official repo for "GraMMaR: Ground-aware Motion Model for 3D Human Motion Reconstruction"

Language:JavaScriptStargazers:32Issues:0Issues:0

RepCount-Using-Skeleton-Information

RepCount using skeleton and joint information

Language:PythonStargazers:2Issues:0Issues:0

EveryShotCounts

Codebase for "Every Shot Counts: Using Exemplars for Repetition Counting in Videos"

Language:PythonLicense:MITStargazers:19Issues:0Issues:0
Language:PythonLicense:MITStargazers:12Issues:0Issues:0
Language:PythonStargazers:13Issues:0Issues:0
Language:PythonStargazers:22Issues:0Issues:0

metrabs

Estimate absolute 3D human poses from RGB images.

Language:PythonLicense:MITStargazers:459Issues:0Issues:0

MCM

Official Implement MCM: Multi-condition Motion Synthesis Framework

Language:PythonStargazers:17Issues:0Issues:0

LOGO

Accepted by CVPR 2023

Language:PythonStargazers:34Issues:0Issues:0

SMPLer-X

Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"

Language:PythonLicense:NOASSERTIONStargazers:976Issues:0Issues:0

smplify-x

Expressive Body Capture: 3D Hands, Face, and Body from a Single Image

Language:PythonLicense:NOASSERTIONStargazers:1742Issues:0Issues:0

4D-Humans

4DHumans: Reconstructing and Tracking Humans with Transformers

Language:PythonLicense:MITStargazers:1205Issues:0Issues:0

emdb

Toolkit for EMDB: The Electromagnetic Database of Global 3D Human Pose and Shape in the Wild

Language:PythonLicense:MITStargazers:106Issues:0Issues:0