wangyang199609

wangyang199609

Geek Repo

Github PK Tool:Github PK Tool

wangyang199609's repositories

av-se

Deep-Learning-Based Audio-Visual Speech Enhancement and Separation

Stargazers:1Issues:0Issues:0

asteroid

The PyTorch-based audio source separation toolkit for researchers || Pretrained models available

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:CLicense:Apache-2.0Stargazers:0Issues:0Issues:0

avobjects

Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

License:MITStargazers:0Issues:0Issues:0

ConferencingSpeech2022

Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dnn_aec_data_process

pre-process script for timit data for dnn-aec works

Language:PythonStargazers:0Issues:0Issues:0

Dual-Path-Transformer-Network-PyTorch

Unofficial implementation of Dual-Path Transformer Network (DPTNet) for speech separation (Interspeech 2020)

Stargazers:0Issues:0Issues:0

facenet-pytorch

Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

fucking-algorithm

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

Lipreading_using_Temporal_Convolutional_Networks

ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

License:NOASSERTIONStargazers:0Issues:0Issues:0

RIR-Generator

Generating room impulse responses

Language:C++License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

rnnoise

Recurrent neural network for audio noise reduction

Language:CLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

speaker_extraction_SpEx

multi-scale time domain speaker extraction

License:GPL-3.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

SpeechAlgorithms

Speech Algorithms , from 语音算法组

Language:CLicense:Apache-2.0Stargazers:0Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

traditional-speech-enhancement

语音增强传统方法

License:MITStargazers:0Issues:0Issues:0

Tutorial_Separation

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

Stargazers:0Issues:0Issues:0

v2rayNvpn

翻墙、免费翻墙、免费科学上网、免费节点、免费梯子、免费ss/ssr/v2ray/trojan节点、蓝灯、谷歌商店、翻墙梯子 、外网游戏、国外游戏、vpn、vpn推荐、每天更新、上外网、外网、V2rayN、Qv2ray、V2rayW、V2RayS、Mellow、V2rayX、V2rayU、ClashX、Kitsunebi、BifrostV、i2Ray 、Quantumult、Surge 4、winXray、Qv2ray、Kitsunebi、Trojan-Qt5、代理服务器、机场、马里奥、魔兽世界、poshMark、亚马逊、虾皮、煤炉、Mercari、外贸

Stargazers:0Issues:0Issues:0

VoViT

VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer

Language:PythonStargazers:0Issues:0Issues:0

WebRTC_NS

Noise Suppression Module Port From WebRTC

Language:CLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

youtube-dl

Command-line program to download videos from YouTube.com and other video sites

Language:PythonLicense:UnlicenseStargazers:0Issues:0Issues:0

yt-dlp

A youtube-dl fork with additional features and fixes

Language:PythonLicense:UnlicenseStargazers:0Issues:0Issues:0