KUN's repositories

CTPN

Detecting Text in Natural Image with Connectionist Text Proposal Network

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1Issues:0Issues:0

IRM-based-Speech-Enhancement-using-LSTM

Ideal Ratio Mask (IRM) Estimation based Speech Enhancement using LSTM

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

rnnoise

Recurrent neural network for audio noise reduction

Language:CLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0

sednn

deep learning based speech enhancement using keras or pytorch, make it easy to use

Stargazers:1Issues:0Issues:0

segan

Speech Enhancement Generative Adversarial Network in TensorFlow

License:MITStargazers:1Issues:0Issues:0

SEGAN-1

A PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"

Stargazers:1Issues:0Issues:0

segan-pytorch

SEGAN pytorch implementation https://arxiv.org/abs/1703.09452

License:GPL-3.0Stargazers:1Issues:0Issues:0

segan_pytorch

Speech Enhancement Generative Adversarial Network in PyTorch

License:MITStargazers:1Issues:0Issues:0

text-detection-ctpn

text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:MATLABStargazers:0Issues:0Issues:0

AudioVerification

CCU DeepLearning Final Project

License:MITStargazers:0Issues:0Issues:0

Autoregressive-Predictive-Coding

Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning

Stargazers:0Issues:0Issues:0

deepspeech.pytorch

Speech Recognition using DeepSpeech2.

License:MITStargazers:0Issues:0Issues:0

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

End-to-end-ASR-Pytorch

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.

License:MITStargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

License:MITStargazers:0Issues:0Issues:0
License:GPL-3.0Stargazers:0Issues:0Issues:0

LAS_Mandarin_PyTorch

Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)

License:MITStargazers:0Issues:0Issues:0

Listen-Attend-Spell

A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.

Stargazers:0Issues:0Issues:0

LPS_extraction

The script is to extract log-power-spectrum features for speech enhancement and bandwidth extension.

Stargazers:0Issues:0Issues:0

malaya-speech

Speech Toolkit for bahasa Malaysia, https://malaya-speech.readthedocs.io/

License:MITStargazers:0Issues:0Issues:0

ML2021-Spring

**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2021 Spring

Stargazers:0Issues:0Issues:0

Mockingjay-Speech-Representation

Official Implementation of Mockingjay in Pytorch

License:MITStargazers:0Issues:0Issues:0

Model-Attacking-Defending

In this project, I implemented FGSM and the basic iterative method to attack a pre-trained model. Then I tried to protect my model by doing randomization to the images before I feed them into my model.

Stargazers:0Issues:0Issues:0

netron

Visualizer for deep learning and machine learning models

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

License:MITStargazers:0Issues:0Issues:0

self-supervised-speech-recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

Stargazers:0Issues:0Issues:0

softer-NMS

Softer-NMS: Rethinking Bounding Box Regression for Accurate Object Detection

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

speech_feature_extractor

Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.

License:MITStargazers:0Issues:0Issues:0