wangyang199609's repositories
asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
avobjects
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
dnn_aec_data_process
pre-process script for timit data for dnn-aec works
Dual-Path-Transformer-Network-PyTorch
Unofficial implementation of Dual-Path Transformer Network (DPTNet) for speech separation (Interspeech 2020)
facenet-pytorch
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
Lipreading_using_Temporal_Convolutional_Networks
ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
RIR-Generator
Generating room impulse responses
rnnoise
Recurrent neural network for audio noise reduction
speaker_extraction_SpEx
multi-scale time domain speaker extraction
SpeechAlgorithms
Speech Algorithms , from 语音算法组
speechbrain
A PyTorch-based Speech Toolkit
traditional-speech-enhancement
语音增强传统方法
Tutorial_Separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
v2rayNvpn
翻墙、免费翻墙、免费科学上网、免费节点、免费梯子、免费ss/ssr/v2ray/trojan节点、蓝灯、谷歌商店、翻墙梯子 、外网游戏、国外游戏、vpn、vpn推荐、每天更新、上外网、外网、V2rayN、Qv2ray、V2rayW、V2RayS、Mellow、V2rayX、V2rayU、ClashX、Kitsunebi、BifrostV、i2Ray 、Quantumult、Surge 4、winXray、Qv2ray、Kitsunebi、Trojan-Qt5、代理服务器、机场、马里奥、魔兽世界、poshMark、亚马逊、虾皮、煤炉、Mercari、外贸
VoViT
VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer
WebRTC_NS
Noise Suppression Module Port From WebRTC
youtube-dl
Command-line program to download videos from YouTube.com and other video sites
yt-dlp
A youtube-dl fork with additional features and fixes