demonstan's starred repositories
deepMAR-Lite
Multi-attribute recognition net in an updated and containerised PyTorch version
pedestrian-attribute-recognition-pytorch
A simple baseline for pedestrian attribute recognition in surveillance scenarios
speaker-id
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Low-Latency-Android-iOS-Linux-Windows-tvOS-macOS-Interactive-Audio-Platform
🇸Superpowered Audio, Networking and Cryptographics SDKs. High performance and cross platform on Android, iOS, macOS, tvOS, Linux, Windows and modern web browsers.
libsndfile
A C library for reading and writing sound files containing sampled audio data.
r8brain-free-src
High-quality pro audio resampler / sample rate converter C++ library. Very fast, for both audio resampling and time-series interpolation.
libsamplerate
An audio Sample Rate Conversion library
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system
VoiceIdentityBook
《声纹技术:从核心算法到工程实践》
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Resemblyzer
A python package to analyze and compare voices with deep learning
DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196