Sui Libin's starred repositories

rnnoise

Recurrent neural network for audio noise reduction

Language:CLicense:BSD-3-ClauseStargazers:3770Issues:0Issues:0

pulseaudio

Mirror of the PulseAudio sound server (for bug reports and pull requests go to the website!)

Language:CLicense:NOASSERTIONStargazers:420Issues:0Issues:0

FAST-LIVO

A Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry (LIVO).

Language:C++License:GPL-2.0Stargazers:865Issues:0Issues:0

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

Language:CLicense:NOASSERTIONStargazers:1912Issues:0Issues:0

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:494Issues:0Issues:0

Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

Language:C++License:MPL-2.0Stargazers:7403Issues:0Issues:0

pywhispercpp

Python bindings for whisper.cpp

Language:C++License:MITStargazers:123Issues:0Issues:0

Speech-Translate

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

Language:PythonLicense:MITStargazers:404Issues:0Issues:0

python-rtmixer

:microphone: Reliable low-latency audio playback and recording with Python :snake:

Language:CLicense:MITStargazers:60Issues:0Issues:0

libmingw32_extended

A library for mingw32 that includes some POSIX functions but eventually all of the POSIX functions will be completed and right now the POSIX functions that are included in this repository are pipe, wait, mmap, munmap, msync, mlock, munlock, posix_madvise, madvise, shm_open, shm_unlink, readv, writev, process_vm_readv, process_vm_writev, dlopen, etc

Language:CLicense:MITStargazers:12Issues:0Issues:0

Brick

The Brick sample-rate converter

Language:C++Stargazers:17Issues:0Issues:0

miniaudio

Audio playback and capture library written in C, in a single source file.

Language:CLicense:NOASSERTIONStargazers:3701Issues:0Issues:0

sox

SoX, Swiss Army knife of sound processing

Language:CLicense:NOASSERTIONStargazers:650Issues:0Issues:0

libADLMIDI

A Software MIDI Synthesizer library with OPL3 (YMF262) emulator

Language:C++License:LGPL-3.0Stargazers:170Issues:0Issues:0

whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

Language:PythonLicense:MITStargazers:784Issues:0Issues:0

FastASR

这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时), 所以识别效果也很好,可以媲美许多商用的ASR软件。

Language:CLicense:Apache-2.0Stargazers:446Issues:0Issues:0

whisper_real_time

Real time transcription with OpenAI Whisper.

Language:PythonStargazers:1966Issues:0Issues:0

faster-whisper-livestream-translator

faster-whisper livestream translation, OBS noise reduction, dual language subtitles

Language:PythonLicense:MITStargazers:68Issues:0Issues:0
Language:PythonLicense:MITStargazers:209Issues:0Issues:0

pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Language:PythonLicense:Apache-2.0Stargazers:5709Issues:0Issues:0

Colaboratory-Notebook-for-Ultimate-Vocal-Remover

Colaboratory Notebook for Ultimate Vocal Remover

Language:PythonStargazers:87Issues:0Issues:0

ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Language:PythonLicense:MITStargazers:15415Issues:0Issues:0

spleeter

Deezer source separation library including pretrained models.

Language:PythonLicense:MITStargazers:25087Issues:0Issues:0
Language:Jupyter NotebookLicense:UnlicenseStargazers:44Issues:0Issues:0

whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4146Issues:0Issues:0

vad

Voice activity detector (VAD) for the browser with a simple API

Language:TypeScriptLicense:NOASSERTIONStargazers:430Issues:0Issues:0

snowboy

Future versions with model training module will be maintained through a forked version here: https://github.com/seasalt-ai/snowboy

Language:C++License:NOASSERTIONStargazers:3002Issues:0Issues:0

whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Language:PythonLicense:MITStargazers:1210Issues:0Issues:0

whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

Language:PythonLicense:MITStargazers:766Issues:0Issues:0

diart

A python package to build AI-powered real-time audio applications

Language:PythonLicense:MITStargazers:841Issues:0Issues:0