ZhangZhaofeng's repositories
algo_tra
algo_tra
bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
BeamformIt
BeamformIt acoustic beamforming software
covarep
A Cooperative Voice Analysis Repository for Speech Technologies
dist-keras
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
eesen
End-to-End Speech Recognition using Deep RNNs (Models), CTC (Training) and WFSTs (Decoding)
improved_wgan_training
Code for reproducing experiments in "Improved Training of Wasserstein GANs"
kaldi-ctc
Connectionist Temporal Classification (CTC) Automatic Speech Recognition
Fay_copy
Fay是一个完整的开源项目,包含Fay控制器及数字人模型,可灵活组合出不同的应用场景:虚拟主播、现场推销货、商品导购、语音助理、远程语音助理、数字人互动、数字人面试官及心理测评、贾维斯、Her。 开源项目,非产品试用!!!
keras
Deep Learning library for Python. Runs on TensorFlow, Theano, or CNTK.
keras-kaldi
Keras Interface for Kaldi ASR
segan
Speech Enhancement Generative Adversarial Network in TensorFlow
setk
Tools for Speech Enhancement integrated with Kaldi
SignalGraph
Matlab-based deep learning toolkit that supports arbitrary directed acyclic graphs (DAG). Support DNN, LSTM, CNN layers and many signal processing layers. Include recipes/examples of using the tool for various tasks.
SMIR-Generator
Spherical Microphone array Impulse Response generator (SMIRgen)
SqueezeNet
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters
WSCM-MUSIC
Weighted Spatial Covariance Matrix Estimation for MUSIC based TDOA Estimation of Speech Source