ZhangZhaofeng

ZhangZhaofeng's repositories

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookNOASSERTION000

BeamformIt

BeamformIt acoustic beamforming software

Language:C++000

Convolutional Neural Networks for Matlab. Has versions for GPU and CPU, written on CUDA, C++ and Matlab. All versions work identically. The GPU version uses kernels from Alex Krizhevsky's library 'cuda-convnet2'.

Language:Cuda020

covarep

A Cooperative Voice Analysis Repository for Speech Technologies

Language:MatlabNOASSERTION000

dist-keras

Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.

Language:PythonGPL-3.0000

eesen

End-to-End Speech Recognition using Deep RNNs (Models), CTC (Training) and WFSTs (Decoding)

Language:C++Apache-2.0000

EQ

Language:C++000

improved_wgan_training

Code for reproducing experiments in "Improved Training of Wasserstein GANs"

Language:PythonMIT000

kaldi-ctc

Connectionist Temporal Classification (CTC) Automatic Speech Recognition

Language:C++NOASSERTION000

Fay_copy

Fay是一个完整的开源项目，包含Fay控制器及数字人模型，可灵活组合出不同的应用场景：虚拟主播、现场推销货、商品导购、语音助理、远程语音助理、数字人互动、数字人面试官及心理测评、贾维斯、Her。开源项目，非产品试用！！！

GPL-3.0000

keras

Deep Learning library for Python. Runs on TensorFlow, Theano, or CNTK.

Language:PythonNOASSERTION000

keras-kaldi

Keras Interface for Kaldi ASR

Language:PythonGPL-3.0000

multi_atr

Language:Python000

RemoteBendo

Language:Python000

segan

Speech Enhancement Generative Adversarial Network in TensorFlow

Language:PythonMIT000

setk

Tools for Speech Enhancement integrated with Kaldi

Apache-2.0000

SignalGraph

Matlab-based deep learning toolkit that supports arbitrary directed acyclic graphs (DAG). Support DNN, LSTM, CNN layers and many signal processing layers. Include recipes/examples of using the tool for various tasks.

Language:MatlabMIT000

simu_var_hilo

Language:Python000

SMIR-Generator

Spherical Microphone array Impulse Response generator (SMIRgen)

GPL-3.0000

SqueezeNet

SqueezeNet: AlexNet-level accuracy with 50x fewer parameters

BSD-2-Clause000

WassersteinGAN

Language:PythonBSD-3-Clause000

WSCM-MUSIC

Weighted Spatial Covariance Matrix Estimation for MUSIC based TDOA Estimation of Speech Source

Language:Matlab000

ZhangZhaofeng

ZhangZhaofeng's repositories

abiapis

equalizer

algo_tra

algo_tra_v2

bark-with-voice-clone

BeamformIt

ConvNet

covarep

dist-keras

eesen

EQ

improved_wgan_training

kaldi-ctc

Fay_copy

keras

keras-kaldi

multi_atr

RemoteBendo

segan

setk

SignalGraph

simu_var_hilo

SMIR-Generator

SqueezeNet

WassersteinGAN

WSCM-MUSIC