Beast code in Giters

demonstan's starred repositories

deepMAR-Lite

Multi-attribute recognition net in an updated and containerised PyTorch version

Language:Python700

pedestrian-attribute-recognition-pytorch

A simple baseline for pedestrian attribute recognition in surveillance scenarios

Language:Python32700

speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Language:PythonApache-2.033500

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Language:HTML36000

portaudio

PortAudio is a cross-platform, open-source C language library for real-time audio input and output.

Language:CNOASSERTION138700

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language:Jupyter NotebookApache-2.0748900

kfr

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

Language:C++GPL-2.0162400

Low-Latency-Android-iOS-Linux-Windows-tvOS-macOS-Interactive-Audio-Platform

🇸Superpowered Audio, Networking and Cryptographics SDKs. High performance and cross platform on Android, iOS, macOS, tvOS, Linux, Windows and modern web browsers.

Language:C++132900

essentia

C++ library for audio and music analysis, description and synthesis, including Python bindings

Language:C++AGPL-3.0277300

libsndfile

A C library for reading and writing sound files containing sampled audio data.

Language:CLGPL-2.1138900

r8brain-free-src

High-quality pro audio resampler / sample rate converter C++ library. Very fast, for both audio resampling and time-series interpolation.

Language:C++MIT55000

libsamplerate

An audio Sample Rate Conversion library

Language:CBSD-2-Clause58100

SFML

Simple and Fast Multimedia Library

Language:C++Zlib986400

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Language:C++MPL-2.02484700

AudioFile

A simple C++ library for reading and writing audio files.

Language:C++MIT93300

NumCpp

C++ implementation of the Python Numpy library

Language:C++MIT348400

nlpaug

Data augmentation for NLP

Language:Jupyter NotebookMIT436600

voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Language:Python106100

VoiceIdentityBook

《声纹技术：从核心算法到工程实践》

14800

uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Language:PythonApache-2.0154800

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Apache-2.0152300

Resemblyzer

A python package to analyze and compare voices with deep learning

Language:PythonApache-2.0268900

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Language:PythonCC-BY-4.0103400

vedadet

A single stage object detection toolbox based on PyTorch

Language:PythonApache-2.049700

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

Language:CNOASSERTION196500

MS-SNSD

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.

Language:HTMLMIT46000

deep-speaker

Deep Speaker: an End-to-End Neural Speaker Embedding System.

Language:PythonMIT89700

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT560700

pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Language:PythonMIT30300

demonstan