bygreencn

followers

following

stars

China

http://bygreencn.wordpress.com/

Sui Libin's starred repositories

rnnoise

Recurrent neural network for audio noise reduction

Language:CBSD-3-Clause377000

pulseaudio

Mirror of the PulseAudio sound server (for bug reports and pull requests go to the website!)

Language:CNOASSERTION42000

FAST-LIVO

A Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry (LIVO).

Language:C++GPL-2.086500

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

Language:CNOASSERTION191200

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonMIT49400

Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

Language:C++MPL-2.0740300

pywhispercpp

Python bindings for whisper.cpp

Language:C++MIT12300

Speech-Translate

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

Language:PythonMIT40400

python-rtmixer

:microphone: Reliable low-latency audio playback and recording with Python :snake:

Language:CMIT6000

libmingw32_extended

A library for mingw32 that includes some POSIX functions but eventually all of the POSIX functions will be completed and right now the POSIX functions that are included in this repository are pipe, wait, mmap, munmap, msync, mlock, munlock, posix_madvise, madvise, shm_open, shm_unlink, readv, writev, process_vm_readv, process_vm_writev, dlopen, etc

Language:CMIT1200

Brick

The Brick sample-rate converter

Language:C++1700

miniaudio

Audio playback and capture library written in C, in a single source file.

Language:CNOASSERTION370100

sox

SoX, Swiss Army knife of sound processing

Language:CNOASSERTION65000

libADLMIDI

A Software MIDI Synthesizer library with OPL3 (YMF262) emulator

Language:C++LGPL-3.017000

whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

Language:PythonMIT78400

FastASR

这是一个用C++实现ASR推理的项目，它依赖很少，安装也很简单，推理速度很快，在树莓派4B等ARM平台也可以流畅的运行。支持的模型是由Google的Transformer模型中优化而来，数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时)，所以识别效果也很好，可以媲美许多商用的ASR软件。

Language:CApache-2.044600

whisper_real_time

Real time transcription with OpenAI Whisper.

Language:Python196600

faster-whisper-livestream-translator

faster-whisper livestream translation, OBS noise reduction, dual language subtitles

Language:PythonMIT6800

stream-translator

Language:PythonMIT20900

pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Language:PythonApache-2.0570900

Colaboratory-Notebook-for-Ultimate-Vocal-Remover

Colaboratory Notebook for Ultimate Vocal Remover

Language:Python8700

ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Language:PythonMIT1541500

spleeter

Deezer source separation library including pretrained models.

Language:PythonMIT2508700

whisper-jax-colab

Language:Jupyter NotebookUnlicense4400

whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Language:Jupyter NotebookApache-2.0414600

vad

Voice activity detector (VAD) for the browser with a simple API

Language:TypeScriptNOASSERTION43000

snowboy

Future versions with model training module will be maintained through a forked version here: https://github.com/seasalt-ai/snowboy

Language:C++NOASSERTION300200

whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Language:PythonMIT121000

whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

Language:PythonMIT76600

diart

A python package to build AI-powered real-time audio applications

Language:PythonMIT84100