gavin-pu

followers

following

stars

gavin-pu's repositories

APOProject

A trial of developing a APO (Audio Processing Object), working on Windows 10.

Language:C++GPL-2.0000

ASR_Theory

语音识别理论，包括研一与研二期间部分所学，论文和PPT

GPL-3.0000

athena-signal

Language:CApache-2.0000

Beamforming-for-speech-enhancement

simple delaysum, MVDR and CGMM-MVDR

Language:Python000

btk20_documentation

btk 2.0 documentation

Language:PythonMIT000

ComputeLibrary

The ARM Computer Vision and Machine Learning library is a set of functions optimised for both ARM CPUs and GPUs using SIMD technologies.

Language:C++MIT000

cosmoflow-sims

Running the simulations for the CosmoFlow project

Language:C++NOASSERTION000

dagger

Dagger 是一个基于 Loki 的日志查询和管理系统，它是由达闼科技（ CloudMinds ）云团队的`大禹基础设施平台`派生出来的一个项目。Dagger 运行在 Loki 前端，具备日志查询、搜索，保存和下载等特性，适用于云原生场景下的容器日志管理场景。

Language:VueApache-2.0000

dancenet

DanceNet -💃💃Dance generator using Autoencoder, LSTM and Mixture Density Network. (Keras)

MIT000

DeepLearning

深度学习入门教程, 优秀文章, Deep Learning Tutorial

Apache-2.0000

distant_speech_recognition

spatial signal processing toolkit a.k.a beamforming toolkit 2.0 (BTK2.0)

Language:C++MIT000

EA-SVC

An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"

MIT000

HyperFT

开源移动端快速视频人脸跟踪-移动端150FPS+

000

LPCNet

Efficient neural speech synthesis

BSD-3-Clause000

MASP

Microphone Array Speech Processing

MIT000

Microphone-Array-postfilter

000

nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation

MIT000

odas

ODAS: Open embeddeD Audition System

GPL-3.0000

odas_web

A desktop visualization GUI for the ODAS library

MIT000

online-offline-CGMM-for-MVDR

Offline CGMM and CGMM with spatial prior distribution in an online manner

000

pifuhd

High-Resolution 3D Human Digitization from A Single Image.

NOASSERTION000

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

MIT000

Sound_Localization_Algorithms

Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.

Language:MATLAB000

speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

000

Spherical-Harmonic-Transform

A collection of MATLAB routines for the Spherical Harmonic Transform and related manipulations in the spherical harmonic spectrum.

BSD-3-Clause000

Tacotron2-Wavenet-Korean-TTS

Korean TTS, Tacotron2, Wavenet

MIT000

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese)

Apache-2.0000

ue4-mediapipe-plugin

UE4 MediaPipe plugin

Apache-2.0000

voice-web

Common Voice is part of Mozilla's initiative to help teach machines how real people speak.

Language:TypeScriptMPL-2.0000

voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

000