Florian Stègre's starred repositories
HierSpeechpp
The official implementation of HierSpeech++
pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
metavoice-src
Foundational model for human-like, expressive TTS
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Deepfake-using-Wave2Lip
A deep learning model to lip-sync a given video with any given audio. It uses GAN architecture to orchestrate loss reconstruction or training.
sd-wav2lip-uhq
Wav2Lip UHQ extension for Automatic1111
DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
MaskFreeVIS
Mask-Free Video Instance Segmentation [CVPR 2023]