GRU (gaoyiyeah)

gaoyiyeah

Geek Repo

Location:New York

Github PK Tool:Github PK Tool

GRU's repositories

ASR---Word-Error-Rate-GUI

This is an interactive GUI where you can enter some ground truth and hypothesis/asr-output to compute the Word Error Rate. It shows the evaluation.

Stargazers:0Issues:0Issues:0

asr-evaluation

Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).

License:Apache-2.0Stargazers:0Issues:0Issues:0

asv-subtools

An Open Source Tools for Speaker Recognition

License:Apache-2.0Stargazers:0Issues:0Issues:0

ChineseNLP

Datasets, SOTA results of every fields of Chinese NLP

Stargazers:0Issues:0Issues:0

cocoapi

COCO API - Dataset @ http://cocodataset.org/

License:NOASSERTIONStargazers:0Issues:0Issues:0

Conv-TasNet-1

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

License:MITStargazers:0Issues:0Issues:0

Conv-TasNet-2

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement

Stargazers:0Issues:0Issues:0

DBFace

DBFace is a real-time, single-stage detector for face detection, with faster speed and higher accuracy

Stargazers:0Issues:0Issues:0

DCUNetTorchSound

Implementation of Phase-aware speech enhancement with deep complex U-Net

Stargazers:0Issues:0Issues:0

deep-sdm

deep-sdm is appied for face landmark.

Stargazers:0Issues:0Issues:0

delta

DELTA is a deep learning based natural language and speech processing platform.

License:Apache-2.0Stargazers:0Issues:0Issues:0

dual-path-RNNs-DPRNNs-based-speech-separation

A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".

Stargazers:0Issues:0Issues:0

duckling

Language, engine, and tooling for expressing, testing, and evaluating composable language rules on input strings.

License:NOASSERTIONStargazers:0Issues:0Issues:0

end-to-end-lipreading

Pytorch code for End-to-End Audiovisual Speech Recognition

Stargazers:0Issues:0Issues:0

FewShotTagging

Code for ACL2020 paper: Few-shot Slot Tagging with Collapsed Dependency Transfer and Label-enhanced Task-adaptive Projection Network

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

mediapipe

MediaPipe is the simplest way for researchers and developers to build world-class ML solutions and applications for mobile, edge, cloud and the web.

License:Apache-2.0Stargazers:0Issues:0Issues:0

MicArrayBeamforming

Microphone Array Beamforming Toolbox

License:MITStargazers:0Issues:0Issues:0

NLP-Models-Tensorflow

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

License:MITStargazers:0Issues:0Issues:0

NSNet

This in an implementation of NSNet in PyTorch and PyTorch Lightning. NSNet is a recurrent neural network for single channel speech enhancement.

Stargazers:0Issues:0Issues:0

Online-Speech-Recognition

Working online speech recognition based on RNN Transducer. ( Trained model release soon ... )

License:NOASSERTIONStargazers:0Issues:0Issues:0

OpenAttack

An Open-Source Package for Textual Adversarial Attack.

Stargazers:0Issues:0Issues:0

OpenTransformer

A No-Recurrence Sequence-to-Sequence Model for Speech Recognition

License:MITStargazers:0Issues:0Issues:0

pytorch_face_landmark

Fast and accurate face landmark detection library using PyTorch; Support 68-point semi-frontal and 39-point profile landmark detection; Support both coordinate-based and heatmap-based inference; Up to 100FPS landmark inference on CPU.

Stargazers:0Issues:0Issues:0

re2

RE2 is a fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python. It is a C++ library.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

sound-source-localization-algorithm_DOA_estimation

关于语音信号声源定位DOA估计所用的一些传统算法

Stargazers:0Issues:0Issues:0

SpeechAlgorithms

Code of my WeChat Offical Account

License:Apache-2.0Stargazers:0Issues:0Issues:0

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Stargazers:0Issues:0Issues:0

spokestack-android

Spokestack speech recognition pipeline for Android

License:Apache-2.0Stargazers:0Issues:0Issues:0

VL-BERT

Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".

License:MITStargazers:0Issues:0Issues:0