jxzhanggg

Jing-Xuan Zhang's starred repositories

ctc_segmentation

Segment a given audio into utterances using a trained end-to-end ASR model.

Language:PythonApache-2.07300

AV-RelScore

Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" in CVPR23

Language:Python2800

Visual_Speech_Recognition_for_Multiple_Languages

Visual Speech Recognition for Multiple Languages

Language:PythonNOASSERTION31100

kenlm

KenLM: Faster and Smaller Language Model Queries

Language:C++NOASSERTION245900

Semi-supervised-learning

A Unified Semi-Supervised Learning Codebase (NeurIPS'22)

Language:PythonMIT128100

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonApache-2.01072100

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonNOASSERTION81900

hydra

Hydra is a framework for elegantly configuring complex applications

Language:PythonMIT846900

nonparaSeq2seqVC_code

Implementation code of non-parallel sequence-to-sequence VC

Language:PythonMIT24700

beaqlejs

*BeaqleJS* provides a framework to create browser based listening tests and is purely based on open web standards like HTML5 and Javascript.

Language:JavaScriptGPL-3.08600

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION5170500

Lip2Wav

This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"

Language:PythonMIT69200

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookMIT153000

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0817500

ultrasuite-tools

Tools to process the UltraSuite data

Language:Jupyter NotebookNOASSERTION1100

LipNet-PyTorch

The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)

Language:Python20600

cluster-scripts

A collection of useful scripts, templates, and examples for clusters using SLURM https://slurm.schedmd.com/

Language:Shell9600

955.WLB

955 不加班的公司名单 - 工作 955，work–life balance (工作与生活的平衡)

3448400

996.ICU

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

NOASSERTION26955800