dttlgotv's repositories

AEC-Challenge

AEC Challenge

License:MITStargazers:0Issues:1Issues:0

AEC3

AEC3 Extracted From WebRTC

Language:C++Stargazers:0Issues:1Issues:0

BlackHole

BlackHole is a modern macOS virtual audio driver that allows applications to pass audio to other applications with zero additional latency.

Language:CLicense:GPL-3.0Stargazers:0Issues:1Issues:0

bssaec2020

A New Perspective of Auxiliary-Function-Based Independent Component Analysis in Acoustic Echo Cancellation

Language:MATLABStargazers:0Issues:1Issues:0
Language:HTMLLicense:Apache-2.0Stargazers:0Issues:1Issues:0

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

distant_speech_recognition

spatial signal processing toolkit a.k.a beamforming toolkit 2.0 (BTK2.0)

Language:C++License:MITStargazers:0Issues:1Issues:0

ExoPlayer

An extensible media player for Android

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

fma

FMA: A Dataset For Music Analysis

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

Google-Voice-Separation-voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Stargazers:0Issues:0Issues:0

GSYVideoPlayer

视频播放器(IJKplayer、ExoPlayer、MediaPlayer),HTTPS,支持弹幕,外挂字幕,支持滤镜、水印、gif截图,片头广告、中间广告,多个同时播放,支持基本的拖动,声音、亮度调节,支持边播边缓存,支持视频自带rotation的旋转(90,270之类),重力旋转与手动旋转的同步支持,支持列表播放 ,列表全屏动画,视频加载速度,列表小窗口支持拖动,动画效果,调整比例,多分辨率切换,支持切换播放器,进度条小窗口预览,列表切换详情页面无缝播放,rtsp、concat、mpeg。

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

ijkplayer-1

Android/iOS video player based on FFmpeg n3.4, with MediaCodec, VideoToolbox support.

Language:CLicense:GPL-2.0Stargazers:0Issues:1Issues:0

lite.ai.toolkit

🛠 A lite C++ toolkit of awesome AI models with ONNXRuntime, NCNN, MNN and TNN. YOLOX, YOLOP, YOLOv5, YOLOR, NanoDet, YOLOX, SCRFD, YOLOX . MNN, NCNN, TNN, ONNXRuntime, CPU/GPU.

Language:C++License:GPL-3.0Stargazers:0Issues:1Issues:0

LPCNet

Efficient neural speech synthesis

Language:CLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

lyra

A Very Low-Bitrate Codec for Speech Compression

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

MediaRender

android DLNA media render

Language:JavaLicense:MITStargazers:0Issues:1Issues:0

MetaAF

Control adaptive filters with neural networks.

Language:PythonStargazers:0Issues:0Issues:0

MOSNet

Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

NKF-AEC

Acoustic Echo Cancellation with Nerual Kalman Filtering

Language:HTMLStargazers:0Issues:0Issues:0

PercepNet

(Under construct) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Percepnet-Keras

percepnet implemented using Keras, still need to be optimized and tuned.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

python-pesq

PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)

Language:CLicense:MITStargazers:0Issues:1Issues:0
Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0
Language:C++License:BSD-3-ClauseStargazers:0Issues:1Issues:0

rnnoise

Recurrent neural network for audio noise reduction

Language:CLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

speexdsp

Speex audio processing library - THIS IS A MIRROR, DEVELOPMENT HAPPENS AT https://gitlab.xiph.org/xiph/speexdsp

Language:CLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Subband_Kalman_AEC

Subband kalman filter for echo cancellation

Language:MATLABLicense:MITStargazers:0Issues:0Issues:0

TDengine

An open-source big data platform designed and optimized for the Internet of Things (IoT).

Language:CLicense:AGPL-3.0Stargazers:0Issues:1Issues:0