asdfs's repositories
ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
AutoSpeech
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
cmake-examples
Useful CMake Examples
crfpp
CRF++: Yet Another CRF toolkit
face_recognition
The world's simplest facial recognition api for Python and the command line
gogozhifu
个人免签支付系统,GOGO支付,无手续费,免挂机,收款实时回调
janus-gateway
Janus WebRTC Server
k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
leaderboard
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.
Open-IM-SDK-Flutter
OpenIM:由前微信技术专家打造的基于 Go 实现的即时通讯(IM)项目,Flutter版本IM SDK
Open-IM-Server
OpenIM:由前微信技术专家打造的基于 Go 实现的即时通讯(IM)项目,从服务端到客户端SDK开源即时通讯(IM)整体解决方案,可以轻松替代第三方IM云服务,打造具备聊天、社交功能的app。
openface
Face recognition with deep neural networks.
react-native-ip-sec-vpn
React Native IPSec VPN Module
RepVGG
RepVGG: Making VGG-style ConvNets Great Again
rnnoise_16k
implementation of rnnoise_16k
Speech-Emotion-Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
SpeechSeparation
Using SepFormer
svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
TalkNet_ASD
TalkNet: Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection
VoiceprintRecognition-Tensorflow
使用Tensorflow实现声纹识别
voxceleb_trainer
In defence of metric learning for speaker recognition