wangyang199609

0

followers

following

stars

wangyang199609's repositories

av-se

Deep-Learning-Based Audio-Visual Speech Enhancement and Separation

100

asteroid

The PyTorch-based audio source separation toolkit for researchers || Pretrained models available

Language:PythonMIT000

athena-signal

Language:CApache-2.0000

avobjects

Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"

Language:PythonMIT000

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

MIT000

ConferencingSpeech2022

Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications

Language:PythonApache-2.0000

dnn_aec_data_process

pre-process script for timit data for dnn-aec works

Language:Python000

Dual-Path-Transformer-Network-PyTorch

Unofficial implementation of Dual-Path Transformer Network (DPTNet) for speech separation (Interspeech 2020)

000

facenet-pytorch

Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models

Language:PythonMIT000

fucking-algorithm

刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.

000

learngit

Language:Python010

Lipreading_using_Temporal_Convolutional_Networks

ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

Language:PythonNOASSERTION000

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

000

MuSE

000

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++NOASSERTION000

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

NOASSERTION000

RIR-Generator

Generating room impulse responses

Language:C++MIT000

rir-generator-1

Language:PythonMIT000

rnnoise

Recurrent neural network for audio noise reduction

Language:CBSD-3-Clause000

speaker_extraction_SpEx

multi-scale time domain speaker extraction

GPL-3.0000

speech-demo.github.io

000

SpeechAlgorithms

Speech Algorithms ， from 语音算法组

Language:CApache-2.0000

speechbrain

A PyTorch-based Speech Toolkit

Apache-2.0000

traditional-speech-enhancement

语音增强传统方法

MIT000

Tutorial_Separation

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

000

v2rayNvpn

翻墙、免费翻墙、免费科学上网、免费节点、免费梯子、免费ss/ssr/v2ray/trojan节点、蓝灯、谷歌商店、翻墙梯子、外网游戏、国外游戏、vpn、vpn推荐、每天更新、上外网、外网、V2rayN、Qv2ray、V2rayW、V2RayS、Mellow、V2rayX、V2rayU、ClashX、Kitsunebi、BifrostV、i2Ray 、Quantumult、Surge 4、winXray、Qv2ray、Kitsunebi、Trojan-Qt5、代理服务器、机场、马里奥、魔兽世界、poshMark、亚马逊、虾皮、煤炉、Mercari、外贸

000

VoViT

VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer

Language:Python000

WebRTC_NS

Noise Suppression Module Port From WebRTC

Language:CBSD-3-Clause000

youtube-dl

Command-line program to download videos from YouTube.com and other video sites

Language:PythonUnlicense000

yt-dlp

A youtube-dl fork with additional features and fixes

Language:PythonUnlicense000