gavin-pu's repositories

APOProject

A trial of developing a APO (Audio Processing Object), working on Windows 10.

Language:C++License:GPL-2.0Stargazers:0Issues:0Issues:0

ASR_Theory

语音识别理论,包括研一与研二期间部分所学,论文和PPT

License:GPL-3.0Stargazers:0Issues:0Issues:0
Language:CLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Beamforming-for-speech-enhancement

simple delaysum, MVDR and CGMM-MVDR

Language:PythonStargazers:0Issues:0Issues:0

btk20_documentation

btk 2.0 documentation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ComputeLibrary

The ARM Computer Vision and Machine Learning library is a set of functions optimised for both ARM CPUs and GPUs using SIMD technologies.

Language:C++License:MITStargazers:0Issues:0Issues:0

cosmoflow-sims

Running the simulations for the CosmoFlow project

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

dagger

Dagger 是一个基于 Loki 的日志查询和管理系统,它是由达闼科技( CloudMinds )云团队的`大禹基础设施平台`派生出来的一个项目。Dagger 运行在 Loki 前端,具备日志查询、搜索,保存和下载等特性,适用于云原生场景下的容器日志管理场景。

Language:VueLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dancenet

DanceNet -💃💃Dance generator using Autoencoder, LSTM and Mixture Density Network. (Keras)

License:MITStargazers:0Issues:0Issues:0

DeepLearning

深度学习入门教程, 优秀文章, Deep Learning Tutorial

License:Apache-2.0Stargazers:0Issues:0Issues:0

distant_speech_recognition

spatial signal processing toolkit a.k.a beamforming toolkit 2.0 (BTK2.0)

Language:C++License:MITStargazers:0Issues:0Issues:0

EA-SVC

An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"

License:MITStargazers:0Issues:0Issues:0

HyperFT

开源移动端快速视频人脸跟踪-移动端150FPS+

Stargazers:0Issues:0Issues:0

LPCNet

Efficient neural speech synthesis

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

MASP

Microphone Array Speech Processing

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation

License:MITStargazers:0Issues:0Issues:0

odas

ODAS: Open embeddeD Audition System

License:GPL-3.0Stargazers:0Issues:0Issues:0

odas_web

A desktop visualization GUI for the ODAS library

License:MITStargazers:0Issues:0Issues:0

online-offline-CGMM-for-MVDR

Offline CGMM and CGMM with spatial prior distribution in an online manner

Stargazers:0Issues:0Issues:0

pifuhd

High-Resolution 3D Human Digitization from A Single Image.

License:NOASSERTIONStargazers:0Issues:0Issues:0

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

License:MITStargazers:0Issues:0Issues:0

Sound_Localization_Algorithms

Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.

Language:MATLABStargazers:0Issues:0Issues:0

speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stargazers:0Issues:0Issues:0

Spherical-Harmonic-Transform

A collection of MATLAB routines for the Spherical Harmonic Transform and related manipulations in the spherical harmonic spectrum.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Tacotron2-Wavenet-Korean-TTS

Korean TTS, Tacotron2, Wavenet

License:MITStargazers:0Issues:0Issues:0

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese)

License:Apache-2.0Stargazers:0Issues:0Issues:0

ue4-mediapipe-plugin

UE4 MediaPipe plugin

License:Apache-2.0Stargazers:0Issues:0Issues:0

voice-web

Common Voice is part of Mozilla's initiative to help teach machines how real people speak.

Language:TypeScriptLicense:MPL-2.0Stargazers:0Issues:0Issues:0

voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Stargazers:0Issues:0Issues:0