There are 388 repositories under audio topic.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
GUI for a Vocal Remover that uses Deep Neural Networks.
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
💿 Free software that works great, and also happens to be open-source Python.
FFmpeg for browser, powered by WebAssembly
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸
Code. Music. Live.
A PyTorch-based Speech Toolkit
openFrameworks is a community-developed cross platform toolkit for creative coding in C++.
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
EarTrumpet - Volume Control for Windows
A React component for playing a variety of URLs, including file paths, YouTube, Facebook, Twitch, SoundCloud, Streamable, Vimeo, Wistia and DailyMotion
Speech recognition module for Python, supporting several engines and APIs, online and offline.
AirPlay and AirPlay 2 audio player
JUCE is an open-source cross-platform C++ application framework for desktop and mobile applications, including VST, VST3, AU, AUv3, LV2 and AAX audio plug-ins.
Mumble is an open-source, low-latency, high quality voice chat software.
THIS REPO IS NOT MAINTAINED ANYMORE. Please see https://codeberg.org/tenacityteam/tenacity for Tenacity, which is maintained.
Full-featured audio/video downloader for Android using yt-dlp
3D engine with modern graphics