pveugen

followers

following

stars

@detail-co

Amsterdam

http://linkedin.com/in/pveugen

Paul Veugen's starred repositories

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT20325 198 368

mlx

MLX: An array framework for Apple silicon

Language:C++MIT16007 139 477

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonBSD-2-Clause10270 127 655

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.08167 179 2335

WhisperKit

On-device Speech Recognition for Apple Silicon

Language:SwiftMIT2991 30 104

SwiftyGif

High performance GIF engine

Language:SwiftMIT1974 27 136

player

UI components and hooks for building video/audio players on the web. Robust, customizable, and accessible. Modern alternative to JW Player and Video.js.

Language:TypeScriptMIT1951 38 570

Facial-Expression-Recognition.Pytorch

A CNN based pytorch implementation on facial expression recognition (FER2013 and CK+), achieving 73.112% (state-of-the-art) in FER2013 and 94.64% in CK+ dataset

Language:PythonMIT1769 31 142

resemble-enhance

AI powered speech denoising and enhancement

Language:PythonMIT1140 16 37

WebRTC-iOS

A simple native WebRTC demo iOS app using swift

Language:SwiftApache-2.01094 36 122

DSWaveformImage

Generate waveform images from audio files on iOS, macOS & visionOS in Swift. Native SwiftUI & UIKit views.

Language:SwiftMIT1001 17 84

voicefixer

General Speech Restoration

Language:PythonMIT967 16 58

Swift-YouTube-Player

Swift library for embedding and controlling YouTube videos in your iOS applications via WKWebView!

Language:SwiftMIT872 39 175

MAXINE-AR-SDK

NVIDIA AR SDK - API headers and sample applications

Language:CMIT732 370

SwiftSpeech

A speech recognition framework designed for SwiftUI.

Language:SwiftMIT447 10 15

TipKit-Examples

An example project for the TipKit framework

Language:Swift393 4 6

mayavoz

Pytorch based speech enhancement toolkit.

Language:PythonMIT328 14 16

VSP-LLM

Language:PythonNOASSERTION285 6 3

Waveform

GPU accelerated waveform view

Language:SwiftMIT188 7 6

libspecbleach

C library for audio noise reduction and other spectral effects

Language:CLGPL-2.162 6 24

pyBK

Speaker diarization python system based on binary key speaker modelling

Language:PythonMIT61 9 6

syncstart

Calculate the cut needed at start to sync two media files

Language:PythonMIT57 4 11

koala

On-device noise suppression powered by deep learning

Language:PythonApache-2.053 12 8

xcc

A CLI for Xcode Cloud

Language:SwiftCC0-1.044 4 2

FuzzyMatchingSwift

Fuzzy matching String extensions

Language:SwiftNOASSERTION44 4 39

ios-pro-camera

A Pro camera app for ios

Language:Swift41 5 2

NumPy-iOS

Swift package for using NumPy in iOS apps

Language:PythonMIT30 3 8

geepeetto

Localize your iOS App strings automatically using ChatGPT 🤖 🤥!

Language:PythonMIT30 2 1

VoiceActivityDetector

WebRTC based voice activity detection

Language:SwiftMIT16 2 2

detectSilence

A Swift script for detecting silence in audio files made with reactive programming in RxSwift

Language:Swift12 20