Paul Veugen's starred repositories
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
WhisperKit
On-device Speech Recognition for Apple Silicon
Facial-Expression-Recognition.Pytorch
A CNN based pytorch implementation on facial expression recognition (FER2013 and CK+), achieving 73.112% (state-of-the-art) in FER2013 and 94.64% in CK+ dataset
resemble-enhance
AI powered speech denoising and enhancement
WebRTC-iOS
A simple native WebRTC demo iOS app using swift
DSWaveformImage
Generate waveform images from audio files on iOS, macOS & visionOS in Swift. Native SwiftUI & UIKit views.
voicefixer
General Speech Restoration
Swift-YouTube-Player
Swift library for embedding and controlling YouTube videos in your iOS applications via WKWebView!
MAXINE-AR-SDK
NVIDIA AR SDK - API headers and sample applications
SwiftSpeech
A speech recognition framework designed for SwiftUI.
TipKit-Examples
An example project for the TipKit framework
libspecbleach
C library for audio noise reduction and other spectral effects
FuzzyMatchingSwift
Fuzzy matching String extensions
ios-pro-camera
A Pro camera app for ios
VoiceActivityDetector
WebRTC based voice activity detection
detectSilence
A Swift script for detecting silence in audio files made with reactive programming in RxSwift