gavin-pu's repositories
APOProject
A trial of developing a APO (Audio Processing Object), working on Windows 10.
ASR_Theory
语音识别理论,包括研一与研二期间部分所学,论文和PPT
Beamforming-for-speech-enhancement
simple delaysum, MVDR and CGMM-MVDR
btk20_documentation
btk 2.0 documentation
ComputeLibrary
The ARM Computer Vision and Machine Learning library is a set of functions optimised for both ARM CPUs and GPUs using SIMD technologies.
cosmoflow-sims
Running the simulations for the CosmoFlow project
dagger
Dagger 是一个基于 Loki 的日志查询和管理系统,它是由达闼科技( CloudMinds )云团队的`大禹基础设施平台`派生出来的一个项目。Dagger 运行在 Loki 前端,具备日志查询、搜索,保存和下载等特性,适用于云原生场景下的容器日志管理场景。
dancenet
DanceNet -💃💃Dance generator using Autoencoder, LSTM and Mixture Density Network. (Keras)
DeepLearning
深度学习入门教程, 优秀文章, Deep Learning Tutorial
distant_speech_recognition
spatial signal processing toolkit a.k.a beamforming toolkit 2.0 (BTK2.0)
EA-SVC
An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"
HyperFT
开源移动端快速视频人脸跟踪-移动端150FPS+
LPCNet
Efficient neural speech synthesis
MASP
Microphone Array Speech Processing
nara_wpe
Different implementations of "Weighted Prediction Error" for speech dereverberation
odas
ODAS: Open embeddeD Audition System
odas_web
A desktop visualization GUI for the ODAS library
online-offline-CGMM-for-MVDR
Offline CGMM and CGMM with spatial prior distribution in an online manner
pifuhd
High-Resolution 3D Human Digitization from A Single Image.
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
Sound_Localization_Algorithms
Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.
speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Spherical-Harmonic-Transform
A collection of MATLAB routines for the Spherical Harmonic Transform and related manipulations in the spherical harmonic spectrum.
Tacotron2-Wavenet-Korean-TTS
Korean TTS, Tacotron2, Wavenet
TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese)
ue4-mediapipe-plugin
UE4 MediaPipe plugin
voice-web
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system