Beast code in Giters

GRU's repositories

ASR---Word-Error-Rate-GUI

This is an interactive GUI where you can enter some ground truth and hypothesis/asr-output to compute the Word Error Rate. It shows the evaluation.

000

asr-evaluation

Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).

Apache-2.0000

asv-subtools

An Open Source Tools for Speaker Recognition

Apache-2.0000

ChineseNLP

Datasets, SOTA results of every fields of Chinese NLP

000

cocoapi

COCO API - Dataset @ http://cocodataset.org/

NOASSERTION000

Conv-TasNet-1

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

MIT000

Conv-TasNet-2

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement

000

DBFace

DBFace is a real-time, single-stage detector for face detection, with faster speed and higher accuracy

000

DCUNetTorchSound

Implementation of Phase-aware speech enhancement with deep complex U-Net

000

deep-sdm

deep-sdm is appied for face landmark.

000

delta

DELTA is a deep learning based natural language and speech processing platform.

Apache-2.0000

dual-path-RNNs-DPRNNs-based-speech-separation

A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".

000

duckling

Language, engine, and tooling for expressing, testing, and evaluating composable language rules on input strings.

NOASSERTION000

end-to-end-lipreading

Pytorch code for End-to-End Audiovisual Speech Recognition

000

FewShotTagging

Code for ACL2020 paper: Few-shot Slot Tagging with Collapsed Dependency Transfer and Label-enhanced Task-adaptive Projection Network

000

Interspeech-2020-Non-native-children-ASR

000

mediapipe

MediaPipe is the simplest way for researchers and developers to build world-class ML solutions and applications for mobile, edge, cloud and the web.

Apache-2.0000

MicArrayBeamforming

Microphone Array Beamforming Toolbox

MIT000

NLP-Models-Tensorflow

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

MIT000

NSNet

This in an implementation of NSNet in PyTorch and PyTorch Lightning. NSNet is a recurrent neural network for single channel speech enhancement.

000

Online-Speech-Recognition

Working online speech recognition based on RNN Transducer. ( Trained model release soon ... )

NOASSERTION000

OpenAttack

An Open-Source Package for Textual Adversarial Attack.

000

OpenTransformer

A No-Recurrence Sequence-to-Sequence Model for Speech Recognition

MIT000

Fast and accurate face landmark detection library using PyTorch; Support 68-point semi-frontal and 39-point profile landmark detection; Support both coordinate-based and heatmap-based inference; Up to 100FPS landmark inference on CPU.

000

re2

RE2 is a fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python. It is a C++ library.

BSD-3-Clause000

sound-source-localization-algorithm_DOA_estimation

关于语音信号声源定位DOA估计所用的一些传统算法

000

SpeechAlgorithms

Code of my WeChat Offical Account

Apache-2.0000

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

000

spokestack-android

Spokestack speech recognition pipeline for Android

Apache-2.0000

VL-BERT

Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".

MIT000

gaoyiyeah

GRU's repositories

ASR---Word-Error-Rate-GUI

asr-evaluation

asv-subtools

ChineseNLP

cocoapi

Conv-TasNet-1

Conv-TasNet-2

DBFace

DCUNetTorchSound

deep-sdm

delta

dual-path-RNNs-DPRNNs-based-speech-separation

duckling

end-to-end-lipreading

FewShotTagging

Interspeech-2020-Non-native-children-ASR

mediapipe

MicArrayBeamforming

NLP-Models-Tensorflow

NSNet

Online-Speech-Recognition

OpenAttack

OpenTransformer

pytorch_face_landmark

re2

sound-source-localization-algorithm_DOA_estimation

SpeechAlgorithms

speechmetrics

spokestack-android

VL-BERT