Marco Lee's starred repositories
whisper.cpp
Port of OpenAI's Whisper model in C/C++
ControlNet
Let us control diffusion models!
sd-webui-controlnet
WebUI extension for ControlNet
imaginAIry
Pythonic AI generation of images and videos
x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
awesome-zkml
awesome-zkml repository
LIVE-Layerwise-Image-Vectorization
[CVPR 2022 Oral] Towards Layer-wise Image Vectorization
git-re-basin
Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"
pyctcdecode
A fast and lightweight python-based CTC beam search decoder for speech recognition.
FullSubNet-plus
The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
mlsmpm-particles-rs
MLS-MPM fluid simulation in two dimensions, in Rust with Bevy.
Handwritten-Text-Recognition
Fast Writer Adaptation with Style Extractor Network for Handwritten Text Recognition